Intel Contributes AI Acceleration to PyTorch 2.0
March 17, 2023 | IntelEstimated reading time: 1 minute
?In the release of Python 2.0, contributions from Intel using Intel® Extension for PyTorch , oneAPI Deep Neural Network Library (oneDNN) and additional support for Intel® CPUs enable developers to optimize inference and training performance for artificial intelligence (AI).
As part of the PyTorch 2.0 compilation stack, the TorchInductor CPU backend optimization by Intel Extension for PyTorch and PyTorch ATen CPU achieved up to 1.7 times faster FP32 inference performance when benchmarked with TorchBench, HuggingFace and timm.1 This update brings notable performance improvements to graph compilation over the PyTorch eager mode.
Other optimizations include:
- Improved message-passing between adjacent neural network nodes to support graph neural network in PyTorch Geometric (PyG) for enhanced inference and performance training on Intel CPUs.
- New x86 quantization backend – a combination of FBGEMM (Facebook General Matrix-Matrix Multiplication) and oneDNN backends – replaces FBGEMM as the default quantization backend for x86 CPU platforms to enable better end-to-end int8 inference performance.
- Extended use of oneDNN with oneDNN Graph API to maximize efficient code generation on AI hardware by automatically identifying the graph partitions to be accelerated through fusion. BFloat16 and Float32 data types are supported and only inference workloads can be optimized; BF16 is only optimized on machines with AVX512_BF16 ISA support.
Suggested Items
Intel Announces New Program for AI PC Software Developers and Hardware Vendors
03/27/2024 | Intel CorporationIntel Corporation announced the creation of two new artificial intelligence (AI) initiatives as part of the AI PC Acceleration Program: the AI PC Developer Program and the addition of independent hardware vendors to the program.
SEMI 3D & Systems Summit To Spotlight Trends In Hybrid Bonding, Chiplet Design And Environmental Sustainability
03/26/2024 | SEMILeading experts in 3D integration and systems for semiconductor manufacturing applications will gather at the annual SEMI 3D & Systems Summit, 12-14 June, 2024,
Ventec to Launch New Bondply Dielectrics and Value-Added Services at IPC APEX EXPO 2024
03/26/2024 | Ventec International GroupVentec International Group is to reveal new products for advanced signal integrity and thermal performance, and introduce services, during IPC APEX EXPO 2024, April 9-11 on booth # 4309.
RTX's Raytheon Lower Tier Air, Missile Defense Sensor Detects and Engages Complex Target
03/25/2024 | RTXRaytheon, an RTX business, announced that its Lower Tier Air and Missile Defense Sensor, or LTAMDS, continues to advance through its U.S. Army test program with another successful live-fire event. Military leaders from seven nations were on-site to witness the radar's capabilities and performance first-hand.
IMAPS Wrap-up: AI, Chiplets, and 3D Cube Architecture
03/22/2024 | Marcy LaRont, PCB007 MagazineThe International Microelectronics Assembly and Packaging Society, IMAPS, held its 20th Device Packaging Expo and Conference this past week in Fountain Hills, Arizona, followed immediately by a ‘Workshop on Advanced Packaging for Medical Electronics’ that continued through the remainder of Thursday. Fortunate to find myself in Texas earlier in the week, I made it for the last day of the IMAPS event and attended two excellent keynote presentations by AMD and Intel, respectively. Here are some highlights.
Copyright © 2024 I-Connect007 | IPC Publishing Group Inc. All rights reserved.
Log in