NVIDIA A100 80GB

The performance flagship for the largest AI models and most massive datasets - 80GB of HBM2e and the world's fastest GPU memory bandwidth, available on demand from our Tier IV Australian cloud.

Available now 80GB HBM2e Over 2 TB/s bandwidth MIG @ 10GB 95% DRAM efficiency
 
EOFY SaleLimited Offer

GPU Compute

NVIDIA A100 Tensor Core GPU

Cost Effective Resources & Infrastructure

Reduce your cloud costs without sacrificing performance or reliability. mCloud delivers enterprise grade cloud infrastructure at a fraction of the cost of AWS, Google Cloud, or Azure.

Fault-Tolerant Tier IV Data Centre

Micron21 operates Australia’s first Tier IV-certified data centre, offering 100% uptime, redundant power, and high availability architecture.

24/7 Australian-based Expert Support

Our cloud specialists provide 24/7 Australian-based support, ensuring seamless deployments and efficient troubleshooting.

NVIDIA A100 (80 GB)

High-Performance Computing

$3,230 AUD / month

Minimum Specifications

Provided as a High-Availability mCloud Virtual Cloud Server

  • GPU: NVIDIA A100 (80 GB)
  • GPU Compute: Dedicated
  • vCPU: 12 Cores - XEON Gold
  • RAM: 64 GB - DDR4
  • Storage: 500 GB - NVMe SSD
  • Bandwidth: 2 TB p/m
  • IP Address: Included
  • DDoS Protection: Shield

Overview

Maximum Memory, Maximum Throughput

The A100 80GB pairs NVIDIA's Ampere architecture and third-generation Tensor Cores with 80GB of HBM2e memory and the world's fastest GPU bandwidth - over 2 TB/s. That keeps the largest models and most massive datasets resident on the GPU, speeding time to solution for the most demanding AI and HPC workloads, while Multi-Instance GPU partitions one card into seven isolated 10GB instances.

80GB HBM2e
GPU memory
2,039 GB/s
Memory bandwidth (over 2 TB/s)
7
MIG instances @ 10GB
95%
DRAM utilisation efficiency

Capabilities

What Makes the A100 80GB Different

Third-Gen Tensor Cores

Up to 312 TFLOPS of deep-learning performance and 20× the Tensor throughput of the previous Volta generation for training and inference.

Multi-Instance GPU (MIG)

Partition a single card into as many as seven fully isolated 10GB instances, each with its own memory, cache, and compute - right-sized acceleration at scale.

Structural Sparsity

Tensor Cores exploit sparsity in AI models to deliver up to 2× higher performance, most notably for inference but also during training.

Next-Gen NVLink

Connect two GPUs over an NVLink bridge at 600 GB/s - double the previous generation's throughput - for workloads that span multiple cards.

80GB HBM2e Memory

The world's fastest GPU memory - over 2 TB/s of bandwidth at 95% DRAM utilisation efficiency, and 1.7× the bandwidth of the previous generation.

Every Math Precision

One accelerator for every job - FP64 and TF32 for HPC and training, BF16/FP16 for deep learning, and INT8 for high-throughput inference.

Specifications

Technical Specifications

Compute & Tensor Cores

  • FP649.7 TFLOPS
  • FP64 Tensor Core19.5 TFLOPS
  • FP3219.5 TFLOPS
  • TF32 Tensor Core156 TFLOPS 312 TFLOPS with sparsity
  • BFLOAT16 Tensor Core312 TFLOPS 624 TFLOPS with sparsity
  • FP16 Tensor Core312 TFLOPS 624 TFLOPS with sparsity
  • INT8 Tensor Core624 TOPS 1,248 TOPS with sparsity

Memory, Platform & Form Factor

  • GPU memory80GB HBM2e
  • Memory bandwidth1,935 GB/s
  • Max thermal design power300W
  • Multi-Instance GPUUp to 7 @ 10GB
  • InterconnectNVLink 600 GB/s PCIe Gen4 64 GB/s
  • Form factorPCIe
  • GPU architectureNVIDIA Ampere

Specifications per the NVIDIA A100 Tensor Core GPU datasheet (r4). Peak rates marked “with sparsity” require structural-sparsity-enabled models.

Performance

Built for the Largest Workloads

Each figure compares the A100 against a different reference point, as published by NVIDIA - note the baseline beneath each number.

2 TB/s

Memory bandwidth

Over 2 TB/s of HBM2e bandwidth: the fastest GPU memory available.

249×

AI inference throughput

BERT-Large inference vs a CPU-only server (INT8 with sparsity).

20×

Higher performance

vs the prior NVIDIA Volta generation, across training and inference.

95%

DRAM utilisation

HBM2e keeps the memory system working at near-peak efficiency.

Where It Fits

The Right Card for the Job

Why run it on mCloud

Australian, on-demand, supported

  • Tier IV data centre - Australia's first, with redundant power and high availability.
  • Fast underlying platform - NVMe storage and 100Gbps networking feed the GPU.
  • 24/7 Australian support - local cloud specialists for deployment and troubleshooting.
  • Pay for what you use - scale instances up or down to match demand.
  • OpenStack & IaC ready - automate provisioning with the API and Terraform.

Deploy the A100 80GB today

Spin up flagship GPU compute on our Tier IV Australian cloud, or talk to our specialists about sizing the right configuration for your workload.

 

Sign up for the Micron21 Newsletter