AMD Announces World’s Fastest HPC Accelerator for Scientific Research
November 16, 2020 | AMDEstimated reading time: 2 minutes
AMD announced the new AMD Instinct™ MI100 accelerator – the world’s fastest HPC GPU and the first x86 server GPU to surpass the 10 teraflops (FP64) performance barrier. Supported by new accelerated compute platforms from Dell, Gigabyte, HPE, and Supermicro, the MI100, combined with AMD EPYC CPUs and the ROCm™ 4.0 open software platform, is designed to propel new discoveries ahead of the exascale era.
Built on the new AMD CDNA architecture, the AMD Instinct MI100 GPU enables a new class of accelerated systems for HPC and AI when paired with 2nd Gen AMD EPYC processors. The MI100 offers up to 11.5 TFLOPS of peak FP64 performance for HPC and up to 46.1 TFLOPS peak FP32 Matrix performance for AI and machine learning workloads2. With new AMD Matrix Core technology, the MI100 also delivers a nearly 7x boost in FP16 theoretical peak floating point performance for AI training workloads compared to AMD’s prior generation accelerators.3
“Today AMD takes a major step forward in the journey toward exascale computing as we unveil the AMD Instinct MI100 – the world’s fastest HPC GPU,” said Brad McCredie, corporate vice president, Data Center GPU and Accelerated Processing, AMD. “Squarely targeted toward the workloads that matter in scientific computing, our latest accelerator, when combined with the AMD ROCm open software platform, is designed to provide scientists and researchers a superior foundation for their work in HPC.”
Open Software Platform for the Exascale Era
The AMD ROCm developer software provides the foundation for exascale computing. As an open source toolset consisting of compilers, programming APIs and libraries, ROCm is used by exascale software developers to create high performance applications. ROCm 4.0 has been optimized to deliver performance at scale for MI100-based systems. ROCm 4.0 has upgraded the compiler to be open source and unified to support both OpenMP® 5.0 and HIP. PyTorch and Tensorflow frameworks, which have been optimized with ROCm 4.0, can now achieve higher performance with MI1007,8. ROCm 4.0 is the latest offering for HPC, ML and AI application developers which allows them to create performance portable software.
“We’ve received early access to the MI100 accelerator, and the preliminary results are very encouraging. We’ve typically seen significant performance boosts, up to 2-3x compared to other GPUs,” said Bronson Messer, director of science, Oak Ridge Leadership Computing Facility. “What’s also important to recognize is the impact software has on performance. The fact that the ROCm open software platform and HIP developer tool are open source and work on a variety of platforms, it is something that we have been absolutely almost obsessed with since we fielded the very first hybrid CPU/GPU system.”
Suggested Items
Real Time with... IPC APEX EXPO 2024: Integrating Automation into the North American PCB Market
04/18/2024 | Real Time with...IPC APEX EXPODan Beaulieu introduces James Chien from SAA Symtek Automation Asia and Jason Perry from Technica, who are bringing automation to the North American PCB market. They discuss their expertise in automation, equipment sets, and growing opportunities. The focus is on integrating automation into existing shops, considering hardware, software, and flexibility. They also discuss the challenges of modernizing domestic shops and the potential of expanding automation to other markets.
Mobileye EyeQ6 Lite Launches to Speed ADAS Upgrades Worldwide
04/17/2024 | BUSINESS WIREMobileye announced it has delivered the first production-candidate hardware and software of its new EyeQ™6 Lite system-on-chip to its customers, which will power advanced driver-assistance systems in multiple models launching this year.
Yamaha to Showcase Latest-generation Assembly Equipment and Software Tools at SMTconnect
04/16/2024 | Yamaha Robotics SMT SectionYamaha Robotics SMT Section will team with its distributor ANS Elektronik to showcase innovations for high-speed surface mount assembly at SMTconnect 2024.
Marantz Electronics EZPro Software Solution: Streamlining Production Preparation for Cost-Efficient Manufacturing
04/16/2024 | Mek (Marantz Electronics)Marantz Electronics is proud to announce the launch of EZPro Software, Automatic Optical Inspection (AOI) machine programming that harnesses the power of Artificial Intelligence (AI).
FPT Unveils Strategic Directions, “All In” on AI, Automotive and Semiconductor
04/15/2024 | BUSINESS WIREFPT Corporation (FPT) announced its strategic directions for the 2024-2026 period at the 2024 Annual General Meeting, with five focused areas defined as Artificial Intelligence (AI), Automotive, Semiconductor, Digital Transformation, and Green Transformation.