NVIDIA H200 · B300 · GB300 NVL72 CLOUD INFRASTRUCTURE

RAW GPU
POWER.
BY THE HOUR.

Deploy the world's fastest GPUs in seconds. NVIDIA H200, Blackwell B300, and GB300 NVL72 AI Data Center racks — no contracts, no commitments. Pay only for what you use.

141TB

H200 HBM3E MEMORY

72×

GB300 PER NVL72 RACK

<60s

DEPLOY TIME

99.9%

UPTIME SLA

GPU CATALOG

CHOOSE YOUR NODE

From cutting-edge Blackwell Ultra to battle-tested Hopper — every GPU is bare-metal performance, billed per hour.

AI DATA CENTER · BLACKWELL ULTRA

GB300 NVL72

NVIDIA BLACKWELL ULTRA · 72-GPU NVLink RACK SCALE SYSTEM

SYSTEM FP4 PERF

1.5 EFLOPS

GPUs per Rack

72 × GB300

Total HBM3e

13.5 TB

NVLink BW

1.8 TB/s per GPU

GPU Memory Each

192 GB

FP8 per GPU

2.5 PFLOPS

Interconnect

NVLink 5.0

RACK PRICING

$299_{/ hr · full rack}

NEW · BLACKWELL

B300

NVIDIA BLACKWELL ARCHITECTURE · 2025

GPU Memory

192 GB HBM3e

FP8 Perf.

2.5 PFLOPS

Memory BW

8 TB/s

NVLink BW

1.8 TB/s

$4.90_{/ hr}

POPULAR · HOPPER

H200

NVIDIA HOPPER ARCHITECTURE · SXM5

GPU Memory

141 GB HBM3e

FP8 Perf.

1.98 PFLOPS

Memory BW

4.8 TB/s

Tensor Cores

4th Gen

$2.99_{/ hr}

RELIABLE · HOPPER

H100

NVIDIA HOPPER ARCHITECTURE · SXM5

GPU Memory

80 GB HBM2e

FP8 Perf.

1 PFLOPS

Memory BW

3.35 TB/s

Interconnect

NVLink 4.0

$2.29_{/ hr}

AI DATA CENTER · RACK SCALE

NVIDIA
GB300
NVL72

BLACKWELL ULTRA · 72-GPU NVLink RACK SYSTEM

The NVIDIA GB300 NVL72 isn't a single GPU — it's an entire AI factory in a rack. 72 Blackwell Ultra GPUs fully connected via NVLink 5.0, delivering 1.5 ExaFLOPS of FP4 AI compute and 13.5 TB of unified HBM3e memory. Built for organizations training frontier models and running hyperscale inference.

GB300 NVL72

FULL RACK SPECIFICATION

GPU Architecture

Blackwell Ultra

GPUs per Rack

72 × GB300

Total GPU Memory

13.5 TB HBM3e

FP4 AI Compute

1.5 ExaFLOPS

FP8 AI Compute

180 PFLOPS

NVLink Gen

NVLink 5.0

NVLink BW / GPU

1.8 TB/s

Memory BW / GPU

8 TB/s

Rack Networking

800G InfiniBand

Power per Rack

~120 kW

72 × GB300 GPUs

All-to-all NVLink 5.0 connected. One unified memory domain across the full rack.

DATA CENTER CAPABILITIES

BUILT FOR AI FACTORIES

The GB300 NVL72 isn't just compute — it's a complete AI infrastructure platform.

🧠 Unified Memory Architecture

All 72 GPUs share a single 13.5 TB HBM3e memory pool over NVLink 5.0. No CPU bottlenecks. Models too large for a single GPU run natively across the full rack.

⚡ 1.5 ExaFLOPS FP4

The GB300 NVL72 delivers more AI compute than entire data centers did just 5 years ago — all in a single rack footprint with liquid cooling.

🔗 NVLink 5.0 All-to-All

1.8 TB/s of bidirectional bandwidth per GPU. No communication overhead between GPUs — train 700B+ parameter models without gradient synchronization bottlenecks.

🌐 800G InfiniBand Fabric

Scale beyond one rack seamlessly. Our NVL72 clusters are interconnected with 800Gbps InfiniBand for multi-rack training jobs spanning thousands of GPUs.

🏗 Colocation & Dedicated

Deploy NVL72 racks in our Tier-IV data centers or bring them to your facility. Dedicated racks with reserved capacity and 24/7 infrastructure support.

📋 Enterprise SLA

99.9% uptime SLA with dedicated support, proactive monitoring, priority incident response, and quarterly infrastructure reviews for rack-scale deployments.

PRICING CALCULATOR

ESTIMATE YOUR COST

Build Your Configuration

From single GPUs to full NVL72 racks — configure your workload and see real-time pricing. No surprises.

Hardware

Number of GPUs: 8

8 GPUs

Hours Per Day: 12

12 hrs

Number of Days

Storage Add-on

LIVE ESTIMATE

Hardware$2.99/hr

Cluster Size8 GPUs

Hours/Day12 hrs

Duration30 days

Hardware Subtotal$8,611.20

Storage—

Total Est.
Cost

$8,611_.20

PLATFORM FEATURES

BUILT FOR SPEED

Everything you need to run serious workloads — nothing you don't.

⚡

Instant Provisioning

Go from zero to running GPU cluster in under 60 seconds. No approval queues, no manual steps.

🔒

Isolated Bare Metal

Dedicated physical nodes — no noisy neighbors, no virtualization overhead. Pure GPU throughput.

📊

Real-Time Monitoring

GPU utilization, temperature, memory bandwidth, and cost metrics streamed live to your dashboard.

🌐

Global Regions

Deploy in North America, Europe, and Asia-Pacific. Low-latency interconnects between nodes in every region.

🔁

Spot & On-Demand

Use on-demand for critical jobs, spot instances for training runs at up to 70% cost reduction.

🛠

Full API Access

Terraform provider, Python SDK, REST API. Integrate GPU provisioning into any workflow or CI/CD pipeline.

USE CASES

WHAT RUNS HERE

From single-GPU fine-tuning to frontier model training on NVL72 racks — if it needs GPUs, it runs on SScoreCompute.

SINGLE GPU & CLUSTER WORKLOADS

LLM Fine-Tuning

AI Inference

Computer Vision

Drug Discovery

Climate Modeling

Video Rendering

Genomics

Physics Simulation

GB300 NVL72 AI DATA CENTER WORKLOADS

Frontier Model Training

700B+ Parameter LLMs

Multimodal AI

Hyperscale Inference

AI Reasoning Models

Synthetic Data Gen

Digital Twins

Autonomous Systems

RAW GPUPOWER.BY THE HOUR.

NVIDIAGB300NVL72

🧠 Unified Memory Architecture

⚡ 1.5 ExaFLOPS FP4

🔗 NVLink 5.0 All-to-All

🌐 800G InfiniBand Fabric

🏗 Colocation & Dedicated

📋 Enterprise SLA

Build Your Configuration

Instant Provisioning

Isolated Bare Metal

Real-Time Monitoring

Global Regions

Spot & On-Demand

Full API Access

READY TO DEPLOY?

RAW GPU
POWER.
BY THE HOUR.

NVIDIA
GB300
NVL72