Home Update New #1 Supercomputer: Fugaku in Japan, with A64FX, take Arm…

New #1 Supercomputer: Fugaku in Japan, with A64FX, take Arm…

298
New #1 Supercomputer: Fugaku in Japan, with A64FX, take Arm...


High efficiency computing is now at some extent in its existence the place to be the primary, you want very highly effective, very environment friendly {hardware}, a number of it, and plenty of functionality to deploy it. Deploying a single rack of servers to whole a few thousand cores isn’t going to chop it. The former #1 supercomputer, Summit, is constructed from 22-core IBM Power9 CPUs paired with NVIDIA GV100 accelerators, totaling 2.Four million cores and consuming 10 MegaWatts of energy. The new Fugaku supercomputer, constructed at Riken in partnership with Fujitsu, takes the highest spot on the June 2020 #1 listing, with 7.Three million cores and consuming 28 MegaWatts of energy.

The new Fugaku supercomputer is greater than Summit in virtually each method. It has 3.05x cores, it has 2.8x the rating within the official LINPACK checks, and consumes 2.8x the facility. It additionally marks the primary time that an Arm primarily based system sits at primary on the highest 500 listing.

Due to the onset of the Coronavirus pandemic, Riken accelerated the deployment of Fugaku in latest months. On May 13th, Riken introduced that greater than 400 racks, every that includes a number of 48-core A64FX playing cards per server, have been deployed. This was a course of that had began again in December, however they have been so eager on getting the supercomputer up and operating to help with the R&D as quickly as potential – the server racks didn’t have their official entrance panels after they began working. There are nonetheless further assets so as to add, with full operation scheduled to start in Riken’s Fiscal 2021, suggesting that Fugaku’s compute values on the highest 100 listing are set to rise even increased.

Alongside being #1 within the TOP500, Fugaku enters the Inexperienced500 List at #9, simply behind Summit, and under the Fugaku Prototype set up which sits at #4.

At the center of Fugaku is the A64FX, a customized Arm v8-A CPU-based chip optimised for compute. The whole configuration makes use of 158,976 of those 48+4-core playing cards, operating at 2.2 GHz peak efficiency (48 cores for compute, Four for help). This permits for some substantial Rpeak numbers, similar to 537 PetaFLOPs of FP64, the standard TOP500 metric. But A64FX additionally helps quantized fashions with decrease precision, which is the place we get into some enjoyable numbers for Fugaku:

  • FP64: 0.54 ExaFLOPs
  • FP32: 1.07 ExaOPs
  • FP16: 2.15 ExaOPs
  • INT8: 4.30 ExaOPs

Due to the design of the A64FX, it additionally permits for a complete reminiscence bandwith of 163 PetaBytes per second.

To date, the A64FX compute card is the one implementation of Arm’s v8.2-A Scalable Vector Extensions (SVE). The objective of SVE is to permit Arm’s clients to construct {hardware} with vector models starting from 128-bit to 2048-bit, such that any software program that’s constructed to run on SVE will routinely scale whatever the SVE execution unit dimension. A64FX makes use of two 512-bit huge pipes per core, with 48 compute cores per chip, and in addition provides in 4 Eight GiB HBM2 hyperlinks per chip to be able to feed the models for 1 TiB/s of whole bandwidth into the chip.

As listed above, the unit helps INT8 by FP64, and the chip has an on-board customized Tofu interconnect, supporting as much as 560 Gbps of interconnect to different A64FX modules. The chip is constructed on TSMC’s N7 course of, and is available in at 8.79 billion transistors. 90% execution effectivity is claimed for DGEMM sort workloads, and extra mechanisms similar to mixed collect and unaligned SIMD loading are used to assist preserve throughput excessive. There can also be further tuning that may be achieved on the energy stage for optimization, and in depth inside RAS (over 128ok error checkers in silicon) to make sure accuracy.

Details on the A64FX chip have been disclosed at Hot Chips in 2018, and we noticed wafers and chips at Supercomputing in 2019. This chip is anticipated to be the primary in a sequence of chips from Fujitsu alongside the same HPC…



Source

LEAVE A REPLY

Please enter your comment!
Please enter your name here