mCloud GPU - NVIDIA A100 (80GB) - mCloud - Enterprise

GPU Compute

NVIDIA A100 Tensor Core GPU

Cost Effective Resources & Infrastructure

Reduce your cloud costs without sacrificing performance or reliability. mCloud delivers enterprise grade cloud infrastructure at a fraction of the cost of AWS, Google Cloud, or Azure.

Fault-Tolerant Tier IV Data Centre

Micron21 operates Australia’s first Tier IV-certified data centre, offering 100% uptime, redundant power, and high availability architecture.

24/7 Australian-based Expert Support

Our cloud specialists provide 24/7 Australian-based support, ensuring seamless deployments and efficient troubleshooting.

NVIDIA A100 (80 GB)

High-Performance Computing

$3,230 AUD / month

Minimum Specifications

Provided as a High-Availability mCloud Virtual Cloud Server

GPU: NVIDIA A100 (80 GB)
GPU Compute: Dedicated
vCPU: 12 Cores - XEON Gold
RAM: 64 GB - DDR4
Storage: 500 GB - NVMe SSD
Bandwidth: 2 TB p/m
IP Address: Included
DDoS Protection: Shield

Overview

Maximum Memory, Maximum Throughput

The A100 80GB pairs NVIDIA's Ampere architecture and third-generation Tensor Cores with 80GB of HBM2e memory and the world's fastest GPU bandwidth - over 2 TB/s. That keeps the largest models and most massive datasets resident on the GPU, speeding time to solution for the most demanding AI and HPC workloads, while Multi-Instance GPU partitions one card into seven isolated 10GB instances.

80GB HBM2e

GPU memory

2,039 GB/s

Memory bandwidth (over 2 TB/s)

MIG instances @ 10GB

95%

DRAM utilisation efficiency

Capabilities

What Makes the A100 80GB Different

Third-Gen Tensor Cores

Up to 312 TFLOPS of deep-learning performance and 20× the Tensor throughput of the previous Volta generation for training and inference.

Multi-Instance GPU (MIG)

Partition a single card into as many as seven fully isolated 10GB instances, each with its own memory, cache, and compute - right-sized acceleration at scale.

Structural Sparsity

Tensor Cores exploit sparsity in AI models to deliver up to 2× higher performance, most notably for inference but also during training.

Next-Gen NVLink

Connect two GPUs over an NVLink bridge at 600 GB/s - double the previous generation's throughput - for workloads that span multiple cards.

80GB HBM2e Memory

The world's fastest GPU memory - over 2 TB/s of bandwidth at 95% DRAM utilisation efficiency, and 1.7× the bandwidth of the previous generation.

Every Math Precision

One accelerator for every job - FP64 and TF32 for HPC and training, BF16/FP16 for deep learning, and INT8 for high-throughput inference.

Specifications

Technical Specifications

Compute & Tensor Cores

FP649.7 TFLOPS
FP64 Tensor Core19.5 TFLOPS
FP3219.5 TFLOPS
TF32 Tensor Core156 TFLOPS 312 TFLOPS with sparsity
BFLOAT16 Tensor Core312 TFLOPS 624 TFLOPS with sparsity
FP16 Tensor Core312 TFLOPS 624 TFLOPS with sparsity
INT8 Tensor Core624 TOPS 1,248 TOPS with sparsity

Memory, Platform & Form Factor

GPU memory80GB HBM2e
Memory bandwidth1,935 GB/s
Max thermal design power300W
Multi-Instance GPUUp to 7 @ 10GB
InterconnectNVLink 600 GB/s PCIe Gen4 64 GB/s
Form factorPCIe
GPU architectureNVIDIA Ampere

Specifications per the NVIDIA A100 Tensor Core GPU datasheet (r4). Peak rates marked “with sparsity” require structural-sparsity-enabled models.

Performance

Built for the Largest Workloads

Each figure compares the A100 against a different reference point, as published by NVIDIA - note the baseline beneath each number.

2 TB/s

Memory bandwidth

Over 2 TB/s of HBM2e bandwidth: the fastest GPU memory available.

249×

AI inference throughput

BERT-Large inference vs a CPU-only server (INT8 with sparsity).

20×

Higher performance

vs the prior NVIDIA Volta generation, across training and inference.

95%

DRAM utilisation

HBM2e keeps the memory system working at near-peak efficiency.

Where It Fits

The Right Card for the Job

Ideal workloads

What the A100 80GB runs best

The largest AI models - keep models with billions of parameters resident in 80GB of memory.
Massive datasets - train and analyse data that won't fit on smaller accelerators.
Large-scale training - run full batch sizes without compromising on memory.
High-throughput inference - INT8 with structural sparsity at production scale.
Memory-bound HPC - simulations and analytics that demand maximum bandwidth.

Why run it on mCloud

Australian, on-demand, supported

Tier IV data centre - Australia's first, with redundant power and high availability.
Fast underlying platform - NVMe storage and 100Gbps networking feed the GPU.
24/7 Australian support - local cloud specialists for deployment and troubleshooting.
Pay for what you use - scale instances up or down to match demand.
OpenStack & IaC ready - automate provisioning with the API and Terraform.

Deploy the A100 80GB today

Spin up flagship GPU compute on our Tier IV Australian cloud, or talk to our specialists about sizing the right configuration for your workload.

NVIDIA A100 80GB

The performance flagship for the largest AI models and most massive datasets - 80GB of HBM2e and the world's fastest GPU memory bandwidth, available on demand from our Tier IV Australian cloud.