NVIDIA A100 · High-Performance Computing
Minimum specification. Scale CPU, RAM, and storage in the calculator.
Minimum 12 month commitment
NVIDIA A100 · Largest Models & Datasets
Doubles GPU memory to 80 GB HBM2e for the largest models and datasets.
Minimum 12 month commitment
Introduction
We utilise NVIDIA A100 Tensor Core GPUs to empower our users to be able to perform deep learning, HPC, and data analytics tasks with high-performance compute.
Representing the most powerful end-to-end AI and HPC platform for data centers, it allows technologists and researchers to deliver real-world results and deploy solutions into production at scale
With Multi-Instance GPU (MIG), the A100 can scale up efficiently or be divided into seven isolated GPU instances, offering a versatile platform that adapts dynamically to changing workload demands.
Features
The A100 family shares the same Ampere architecture and third-generation Tensor Cores. The difference between the two cards is memory.
From the smallest job to the biggest multi-node workload, MIG and NVLink let the A100 handle any acceleration need, maximising the utility of every GPU around the clock.
Up to 312 TFLOPS of deep-learning performance, 20× the Tensor throughput of NVIDIA Volta for both training and inference.
2× the throughput of the previous generation; with NVSwitch, up to 16 A100 GPUs interconnect at up to 600 GB/s for peak application performance.
Partition one A100 into up to seven hardware-isolated instances, each with its own memory, cache, and cores: right-sized acceleration for every job.
Up to 80 GB of HBM2e delivers the world's fastest GPU memory bandwidth, over 2 TB/s at 95% DRAM utilisation, 1.7× the previous generation.
Tensor Cores exploit sparsity in AI models for up to 2× higher performance, most notably for inference, and also during training.
Two Cards, One Platform
Same Ampere compute, different memory. Explore the full specifications and selling points of each card on its dedicated page.
Best for value & density
The price-to-performance choice: 40 GB HBM2 with 1,555 GB/s of bandwidth for production inference, fine-tuning, and multi-tenant workloads that fit within 40 GB.
Best for the largest workloads
The performance flagship: 80 GB HBM2e and the world's fastest GPU bandwidth (over 2 TB/s) for large-scale training, massive datasets, and memory-bound HPC.
40 GB vs 80 GB
Same Ampere compute; the difference is memory, bandwidth, and power. Here is the head-to-head at a glance.
| Specification | A100 40 GB | A100 80 GB |
|---|---|---|
| GPU memory | 40 GB HBM2 | 80 GB HBM2e |
| Memory bandwidth | 1,555 GB/s | Over 2 TB/s |
| Max TDP (PCIe) | 250 W | 300 W |
| MIG slice size | 7 @ 5 GB | 7 @ 10 GB |
| Best for | Cost-efficiency, inference & MIG multi-tenancy | Largest models, massive datasets & memory-bound HPC |
Cloud GPUaaS
Our mCloud platform offers the ability to integrate powerful NVIDIA GPUs directly into your virtual machines through GPU passthrough technology, allowing virtual machines to access the full capabilities of a physical GPU and providing near-native performance
Through time-sliced access to GPU compute with guaranteed minimums, clients with non-time critical workloads or limited budgets can now get affordable access to cloud-based GPU compute
For those who require their own dedicated GPUs, our platform supports NVIDIA A100 40GB, NVIDIA A100 80GB, NVIDIA RTX A6000, NVIDIA H100, and NVIDIA H200 GPUs, all designed to support any workload you need to run in the cloud
Why Micron21
As Australia's first Tier IV data centre, our GPU Compute offerings provide reliable, secure, high-calibre performance.
Intel XEON Gold CPUs and ultra-fast NVMe SSDs deliver high-performance compute and rapid access to resources for the most demanding applications.
Every GPU Cloud Server is protected by our comprehensive DDoS platform, inspecting and filtering traffic across our global scrubbing centres.
Tier IV is the highest uptime accreditation a data centre can hold, and we're Australia's first Tier IV-accredited facility.
ISO 27001, 27002, 27018 and 14520 certified, PCI compliant, and IRAP assessed, up to date with the latest security standards.
Proudly 100% Australian owned and operated. The physical sovereignty of your data is the ultimate peace of mind.
Our dedicated Australian-based technicians are located in the Micron21 data centre, with 24/7 access available.
Take a Virtual Tour
Take a look at the difference a Tier IV Data Centre can make to the reliability and availability of your services.
In terms of data centre infrastructure redundancy, Micron21 is an Australian first and leader in this space, exceeding even the highest accreditations available for the protection of power and cooling.
Build and price your exact configuration in minutes with the calculator, or talk to our Australian-based specialists about matching the right A100 to your workload.