NVIDIA  H200 Tensor Core GPU Server

Next-Gen AI & HPC Acceleration with 141 GB HBM3e and 4.8 TB/s Bandwidth — Built for Massive Models, Speed & Scalability

Bare Metal Server

NVIDIA H200 Tensor Core GPU Server Price

?
Use tabs to select server type. Filter by location, CPU, and RAM. Click any row to see pricing by location.
USD
EUR
GPU
Server
RAM
Storage
Network
Locations
Price
H200
141 GB • 16896 CUDA
Frontier AI
VRAM: 141 GB
CUDA Cores: 16,896
Tensor Cores: 528
TFLOPS: 51
H200
2x AMD EPYC 9224
48C / 96T • 2.5GHz
GPU Specs141 GB • 16,896 CUDA • 528 Tensor • 51 TFLOPSFrontier AI
CPU48C / 96T2.5GHz
RAM128 GB
Storage2x960GB SSD
Network1Gbps100 TB
LocationMontrealCanada
128 GB
2x960GB SSD
1Gbps
100 TB
Canada
Montreal
$2,571.60/mo
Deploy Now →
H200
141 GB • 16896 CUDA
Frontier AI
VRAM: 141 GB
CUDA Cores: 16,896
Tensor Cores: 528
TFLOPS: 51
H200
2x AMD EPYC 9334
64C / 128T • 2.7GHz
GPU Specs141 GB • 16,896 CUDA • 528 Tensor • 51 TFLOPSFrontier AI
CPU64C / 128T2.7GHz
RAM128 GB
Storage2x960GB SSD
Network1Gbps100 TB
LocationMontrealCanada
128 GB
2x960GB SSD
1Gbps
100 TB
Canada
Montreal
$2,652.36/mo
Deploy Now →
H200
141 GB • 16896 CUDA
Frontier AI
VRAM: 141 GB
CUDA Cores: 16,896
Tensor Cores: 528
TFLOPS: 51
H200
2x AMD EPYC 9224
48C / 96T • 2.5GHz
GPU Specs141 GB • 16,896 CUDA • 528 Tensor • 51 TFLOPSFrontier AI
CPU48C / 96T2.5GHz
RAM128 GB
Storage2x960GB SSD
Network1Gbps30 TB
LocationLondonUnited Kingdom
128 GB
2x960GB SSD
1Gbps
30 TB
United Kingdom
London
$2,900.40/mo
Deploy Now →
2x H200
141 GB/GPU • 16896 CUDA
Frontier AI
VRAM: 141 GB/GPU
CUDA Cores: 16,896
Tensor Cores: 528
TFLOPS: 51
2x H200
2x AMD EPYC 9334
64C / 128T • 2.7GHz
GPU Specs141 GB/GPU • 16,896 CUDA • 528 Tensor • 51 TFLOPSFrontier AI
CPU64C / 128T2.7GHz
RAM128 GB
Storage2x960GB SSD
Network1Gbps100 TB
LocationMontrealCanada
128 GB
2x960GB SSD
1Gbps
100 TB
Canada
Montreal
$4,685.18/mo
Deploy Now →
All Bare Metal Plans Include
Setup Cost: Free
Provisioning: Instant/Automated
Access: KVM / IPMI / Root
Protection: DDoS Shield Included
Uptime: 99.9% SLA Guaranteed
Memory: Up to 2TB RAM
Storage: Enterprise NVMe/SSD
Support: 24/7/365 Human Engineers

Don't see what you're looking for?

🚀 Key Specifications

  • Architecture: NVIDIA Hopper (GH100, 5 nm), successor to H100.
  • Memory: 141 GB HBM3e, 4.8 TB/s bandwidth.
  • Compute (SXM / NVL):
    • FP64: 34 TFLOPS
    • TF32 Tensor Core: 989 TFLOPS
    • FP16/BF16 Tensor Core: 1,979 TFLOPS
    • FP8/INT8 Tensor Core: 3,958 TFLOPS
  • Form Factors & TDP:
      • SXM: 700 W, NVLink 900 GB/s
      • NVL (PCIe): up to 600 W, PCIe Gen 5 ×16
  • Interconnects: NVLink for multi-GPU scaling (SXM), PCIe 5.0 (128 GB/s) on NVL.
  • Multi-Instance GPU: Supports 7 MIGs (~18 GB each)
NVIDIA H200 Tensor Core GPU

⚡ Why Choose H200?

Generational leap

Generational leap over H100: double memory capacity, 2.4× memory bandwidth, ideal for trillion‑parameter LLMs.

Transformer & DPX engines

Transformer & DPX engines for 40× faster dynamic programming and new precision formats.

Petaflop-scale clusters

Petaflop-scale clusters with NVLink/NVSwitch, perfect for large HPC or AI workloads.

Enterprise-ready

Enterprise-ready: secure boot, firmware integrity, included NVIDIA AI Enterprise stack, and 5-year enterprise support.

Seamless integration

Compatible with modern AI/data center stacks, CUDA 12, TensorRT, DLSS, vGPU setups, Kubernetes, and video pipelines.

🎯 Ideal Use Cases

Large-Scale LLM Training & Inference

Optimized for 100B+ parameter models and long-context generation.

Scientific Computing & Simulations

Perfect for CFD, molecular dynamics, and engineering modeling, up to 1.9× faster than A100.

Real-Time Data Analytics & RAG Systems

High throughput support for retrieval-augmented generation and vision/speech AI .

Massive Multi-GPU Supercomputing

Build scalable clusters with NVLink 900 GB/s and NVSwitch for near-linear scaling .

Secure & Multi-Tenant Cloud AI Services

MIG partitions provide hardware isolation for private inference or VDI workloads.

  • 0.00

CPU

1vCore

    • RAM
    • 1GB
    • Storage
    • 20GB
    • Traffic
    • 500GB
    • Location/Setup
    • NL

  • 0.00

CPU

2vCore

    • RAM
    • 2GB
    • Storage
    • 40GB
    • Traffic
    • 500GB
    • Location/Setup
    • NL

  • 0.00

CPU

4vCore

    • RAM
    • 4GB
    • Storage
    • 80GB
    • Traffic
    • 500GB
    • Location/Setup
    • NL

  • 0.00

CPU

8vCore

    • RAM
    • 8GB
    • Storage
    • 160GB
    • Traffic
    • 1000GB
    • Location/Setup
    • NL

  • 0.00

CPU

16vCores

    • RAM
    • 16GB
    • Storage
    • 320GB
    • Traffic
    • 1000GB
    • Location/Setup
    • NL
hosting advice logo

4.8

4.7

4.9

hostadvice logo

4.9

Deep Dive & FAQs

- SXM offers top-tier performance and memory bandwidth for large AI/HPC clusters.
- NVL (PCIe) easier to integrate, lower TDP, still DL4‑based, and uses PCIe 5.0 support.

Yes, our data centers are built for high-density, high-power GPU racks.

Absolutely, with NVLink 4.0 and NVSwitch support, you can build pods up to 8 GPUs deep.

Yes, MIG partitions and enterprise firmware enable secure multi-tenant deployment.

Comes with NVIDIA AI Enterprise, NIM microservices, TensorRT, CUDA 11/12, PyTorch, TensorFlow, HPC libraries.

Not sure exactly what you need?
No problem! Our talented engineers are here to help!

We will consult, architect, migrate, manage and do whatever it takes to help your business grow and succeed.

Get in touch today!

Get in touch today!