NVIDIA A100 TENSOR CORE GPU

A100 introduces groundbreaking new features to optimize inference workloads. It brings unprecedented versatility by accelerating a full range of precisions, from FP32 to FP16 to INT8 and all the way down to INT4. Multi-Instance GPU (MIG) technology allows multiple networks to operate simultaneously on a single A100 GPU for optimal utilization of compute resources. And structural sparsity support delivers up to 2X more performance on top of A100’s other inference performance gains.NVIDIA already delivers market-leading inference performance, as demonstrated in an across-the-board sweep of MLPerf Inference 0.5, the first industry-wide benchmark for inference. A100 brings 20X more performance to further extend that leadership.

About

The Most Powerful End-to-End AI and HPC Data Center Platform

A100 is part of the complete NVIDIA data center solution that incorporates building blocks across hardware, networking, software, libraries, and optimized AI models and applications from NGC^™. Representing the most powerful end-to-end AI and HPC platform for data centers, it allows researchers to deliver real-world results and deploy solutions into production at scale.

Specification

Specifications

Peak FP64

9.7 TF

Peak FP64 Tensor Core

19.5 TF

Peak FP32

19.5 TF

Peak FP32 Tensor Core

156 TF | 312 TF*

Peak BFLOAT16 Tensor Core

312 TF | 624 TF*

Peak FP16 Tensor Core

312 TF | 624 TF*

Peak INT8 Tensor Core

624 TOPS | 1,248 TOPS*

Peak INT4 Tensor Core

1,248 TOPS | 2,496 TOPS*

GPU Memory

40 GB

GPU Memory Bandwidth

1,555 GB/s

Interconnect

NVIDIA NVLink 600 GB/s PCIe Gen4 64 GB/s

Multi-instance GPUs

Various instance sizes with up to 7MIGs @5GB

Form Factor

4/8 SXM on NVIDIA HGX™ A100

Max TDP Power

400W

NVIDIA A100 TENSOR CORE GPU

About

The Most Powerful End-to-End AI and HPC Data Center Platform

Specification

You May Also Like

Our Partners

Sign up with our newsletter to follow the latest trends in server technology

NVIDIA A100 TENSOR CORE GPU

About

The Most Powerful End-to-End AI and HPC Data Center Platform

Specification

You May Also Like

Related products

NVIDIA RTX A6000-48GB

NVIDIA RTX A5000

GEFORCE RTX3090-24GB

Our Partners

Sign up with our newsletter to follow the latest trends in server technology