Thursday, August 11, 2022

7 nm Ampere GPU architecture detailed, NVIDIA A100 GPU launched

- Advertisement -

Finally, at the 2020 NVIDIA GTC keynote, NVIDIA’s CEO Jensen Huang has launched the new 7 nm based Ampere GPU architecture. Instead of launching the GeForce RTX 3000 gaming GPUs, NVIDIA has launched the high performing A100 GPU that is the company;s new 7nm GPU offering.

The newer GPUs by the company will be based on the same architecture as Huang clarified that “there’s great overlap in the architecture, but not in the configuration.” From GeForce gaming GPUs to Quadro professional GPUs, the same tech behind the A100 GPU unveiled will be used in future.

NVIDIA’s new NVIDIA A100 Tensor Core GPU is based on the new NVIDIA Ampere GPU architecture and builds upon the capabilities of the prior NVIDIA Tesla V100 GPU. It adds many new features and delivers significantly faster performance for HPC, AI, and data analytics workloads. 

A100 provides strong scaling for GPU compute and DL applications running in single– and multi-GPU workstations, servers, clusters, cloud data centres, systems at the edge, and supercomputers. The A100 GPU enables building elastic, versatile, and high throughput data centres.

The new NVIDIA A100 GPU fabricated on the TSMC 7nm N7 manufacturing process, the NVIDIA Ampere architecture-based GA100 GPU that powers A100 includes 54.2 billion transistors with a die size of 826 mm2. The new chip also features the third generation of Tensor Cores that have a new numerical format called Tensor Float 32 (TF32) replacing the older FP32.

7 nm Ampere GPU architecture detailed, NVIDIA A100 GPU launched
NVIDIA A100 GPU on the new SXM4 module

The NVIDIA A100 GPU has 40 GB of high-speed HBM2 memory with 1555 GB/sec of memory bandwidth—a 73% increase compared to Tesla V100. The A100 GPU has significantly more on-chip memory including a 40 MB Level 2 (L2) cache—nearly 7x larger than V100.

The new Multi-Instance GPU (MIG) feature allows the A100 Tensor Core GPU to be securely partitioned into as many as seven separate GPU Instances for CUDA applications, providing multiple users with separate GPU resources to accelerate their applications. 

7 nm Ampere GPU architecture detailed, NVIDIA A100 GPU launched
GA100 Full GPU with 128 SMs. The A100 Tensor Core GPU has 108 SMs.

The third-generation of NVIDIA high-speed NVLink interconnect implemented in A100 GPUs and the new NVIDIA NVSwitch significantly enhances multi-GPU scalability, performance, and reliability. NVIDIA is supposed to be offering 8 A100 GPUs combined in a single DGX A100 node with a peak performance of 5 PFLOPs of computing performance for US$200,000.

- Advertisement -

The A100 GPU will also support PCI Express Gen 4 with faster bandwidth than before and the A100 GPUs will be able to connect to PCIe 4.0-capable CPUs, currently on AMD Zen-2 based CPUs. It seems strange that NVIDIA did not launch the new RTX 3000 gaming GPUs, well they might be waiting for its competitor AMD to launch RDNA2 based gaming GPUs first.

Source: NVIDIA

Do check out:

😎TechnoSports-stay UPDATED😎

- Advertisement -
Raunak Saha
Raunak Saha
A cs engineer by profession but foodie from heart. I am tech lover guy who has a passion for singing. Football is my love and making websites is my hobby.


Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Related Articles

Bitdefender WW

More To Consider

Stay Connected

Ajio [CPS] IN

Hot Topics


Latest Articles


Bigrock [CPS] IN


Adblocker detected! Please consider reading this notice.

We've detected that you are using AdBlock Plus or some other adblocking software which is preventing the page from fully loading.

We don't have any banner, Flash, animation, obnoxious sound, or popup ad. We do not implement these annoying types of ads!

We need money to operate the site, and almost all of it comes from our online advertising.

Please add to your ad blocking whitelist or disable your adblocking software.