Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (6)Jump to results

Filter by Vendor

Showing 128 XPUs • 6 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	NVIDIA H100 SXM	NVIDIA RTX 6000 Ada Generation	NVIDIA GeForce GT 710	NVIDIA P100	NVIDIA A100 SXM	NVIDIA T4
Architecture	Hopper	Ada Lovelace	Kepler	Pascal	Ampere	Turing
Form Factor	SXM	PCIe	PCIe	SXM	SXM	PCIe
VRAM	80 GB	48 GB	2 GB	16 GB	80 GB	16 GB
Memory Bandwidth	3,350 GB/s	960 GB/s	14.4 GB/s	732 GB/s	2,039 GB/s	320 GB/s
TFLOPs (FP32)	67	91.1	0.366	9.3	19.5	8.1
TFLOPs (FP16)	1,979	—	—	—	312	—
TFLOPs	1,979	182.5	0.366	9.3	312	65
TFLOPs (FP8)	3,958	—	—	—	—	—
TDP	700 W	300 W	19 W	300 W	400 W	70 W
Launch Date	Sep 2022	Sep 2022	Mar 2014	Apr 2016	May 2020	Sep 2018

Efficiency Metrics

Metric	H100 SXM	RTX 6000 Ada Generation	GeForce GT 710	P100	A100 SXM	T4
TFLOPs per Watt (FP32-eq)	1.41	0.30	0.02	0.03	0.39	0.46
Memory Bandwidth per GB	41.9 GB/s	20.0 GB/s	7.2 GB/s	45.8 GB/s	25.5 GB/s	20.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x NVIDIA H100 SXM

NVIDIA RTX 6000 Ada Generation

Compute (FP32-eq)

10.84x

Need 10.84x RTX 6000 Ada Generation

FP32 Compute

0.74x

RTX 6000 Ada Generation is 1.36x faster

VRAM

1.67x

Need 1.67x RTX 6000 Ada Generation

Memory Bandwidth

3.49x

Need 3.49x RTX 6000 Ada Generation

NVIDIA GeForce GT 710

Compute (FP32-eq)

2703.55x

Need 2703.55x GeForce GT 710

FP32 Compute

183.06x

Need 183.06x GeForce GT 710

VRAM

40.00x

Need 40.00x GeForce GT 710

Memory Bandwidth

232.64x

Need 232.64x GeForce GT 710

NVIDIA P100

Compute (FP32-eq)

106.40x

Need 106.40x P100

FP32 Compute

7.20x

Need 7.20x P100

VRAM

5.00x

Need 5.00x P100

Memory Bandwidth

4.58x

Need 4.58x P100

NVIDIA A100 SXM

Compute (FP32-eq)

6.34x

Need 6.34x A100 SXM

FP32 Compute

3.44x

Need 3.44x A100 SXM

VRAM

1.00x

A100 SXM has 1.00x more

Memory Bandwidth

1.64x

Need 1.64x A100 SXM

NVIDIA T4

Compute (FP32-eq)

30.45x

Need 30.45x T4

FP32 Compute

8.27x

Need 8.27x T4

VRAM

5.00x

Need 5.00x T4

Memory Bandwidth

10.47x

Need 10.47x T4

To match 1x NVIDIA RTX 6000 Ada Generation

NVIDIA H100 SXM

Compute (FP32-eq)

0.09x

H100 SXM is 10.84x faster

FP32 Compute

1.36x

Need 1.36x H100 SXM

VRAM

0.60x

H100 SXM has 1.67x more

Memory Bandwidth

0.29x

H100 SXM has 3.49x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

249.32x

Need 249.32x GeForce GT 710

FP32 Compute

248.91x

Need 248.91x GeForce GT 710

VRAM

24.00x

Need 24.00x GeForce GT 710

Memory Bandwidth

66.67x

Need 66.67x GeForce GT 710

NVIDIA P100

Compute (FP32-eq)

9.81x

Need 9.81x P100

FP32 Compute

9.80x

Need 9.80x P100

VRAM

3.00x

Need 3.00x P100

Memory Bandwidth

1.31x

Need 1.31x P100

NVIDIA A100 SXM

Compute (FP32-eq)

0.58x

A100 SXM is 1.71x faster

FP32 Compute

4.67x

Need 4.67x A100 SXM

VRAM

0.60x

A100 SXM has 1.67x more

Memory Bandwidth

0.47x

A100 SXM has 2.12x more

NVIDIA T4

Compute (FP32-eq)

2.81x

Need 2.81x T4

FP32 Compute

11.25x

Need 11.25x T4

VRAM

3.00x

Need 3.00x T4

Memory Bandwidth

3.00x

Need 3.00x T4

To match 1x NVIDIA GeForce GT 710

NVIDIA H100 SXM

Compute (FP32-eq)

0.00x

H100 SXM is 2703.55x faster

FP32 Compute

0.01x

H100 SXM is 183.06x faster

VRAM

0.03x

H100 SXM has 40.00x more

Memory Bandwidth

0.00x

H100 SXM has 232.64x more

NVIDIA RTX 6000 Ada Generation

Compute (FP32-eq)

0.00x

RTX 6000 Ada Generation is 249.32x faster

FP32 Compute

0.00x

RTX 6000 Ada Generation is 248.91x faster

VRAM

0.04x

RTX 6000 Ada Generation has 24.00x more

Memory Bandwidth

0.02x

RTX 6000 Ada Generation has 66.67x more

NVIDIA P100

Compute (FP32-eq)

0.04x

P100 is 25.41x faster

FP32 Compute

0.04x

P100 is 25.41x faster

VRAM

0.13x

P100 has 8.00x more

Memory Bandwidth

0.02x

P100 has 50.83x more

NVIDIA A100 SXM

Compute (FP32-eq)

0.00x

A100 SXM is 426.23x faster

FP32 Compute

0.02x

A100 SXM is 53.28x faster

VRAM

0.03x

A100 SXM has 40.00x more

Memory Bandwidth

0.01x

A100 SXM has 141.60x more

NVIDIA T4

Compute (FP32-eq)

0.01x

T4 is 88.80x faster

FP32 Compute

0.05x

T4 is 22.13x faster

VRAM

0.13x

T4 has 8.00x more

Memory Bandwidth

0.04x

T4 has 22.22x more

To match 1x NVIDIA P100

NVIDIA H100 SXM

Compute (FP32-eq)

0.01x

H100 SXM is 106.40x faster

FP32 Compute

0.14x

H100 SXM is 7.20x faster

VRAM

0.20x

H100 SXM has 5.00x more

Memory Bandwidth

0.22x

H100 SXM has 4.58x more

NVIDIA RTX 6000 Ada Generation

Compute (FP32-eq)

0.10x

RTX 6000 Ada Generation is 9.81x faster

FP32 Compute

0.10x

RTX 6000 Ada Generation is 9.80x faster

VRAM

0.33x

RTX 6000 Ada Generation has 3.00x more

Memory Bandwidth

0.76x

RTX 6000 Ada Generation has 1.31x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

25.41x

Need 25.41x GeForce GT 710

FP32 Compute

25.41x

Need 25.41x GeForce GT 710

VRAM

8.00x

Need 8.00x GeForce GT 710

Memory Bandwidth

50.83x

Need 50.83x GeForce GT 710

NVIDIA A100 SXM

Compute (FP32-eq)

0.06x

A100 SXM is 16.77x faster

FP32 Compute

0.48x

A100 SXM is 2.10x faster

VRAM

0.20x

A100 SXM has 5.00x more

Memory Bandwidth

0.36x

A100 SXM has 2.79x more

NVIDIA T4

Compute (FP32-eq)

0.29x

T4 is 3.49x faster

FP32 Compute

1.15x

Need 1.15x T4

VRAM

1.00x

T4 has 1.00x more

Memory Bandwidth

2.29x

Need 2.29x T4

To match 1x NVIDIA A100 SXM

NVIDIA H100 SXM

Compute (FP32-eq)

0.16x

H100 SXM is 6.34x faster

FP32 Compute

0.29x

H100 SXM is 3.44x faster

VRAM

1.00x

H100 SXM has 1.00x more

Memory Bandwidth

0.61x

H100 SXM has 1.64x more

NVIDIA RTX 6000 Ada Generation

Compute (FP32-eq)

1.71x

Need 1.71x RTX 6000 Ada Generation

FP32 Compute

0.21x

RTX 6000 Ada Generation is 4.67x faster

VRAM

1.67x

Need 1.67x RTX 6000 Ada Generation

Memory Bandwidth

2.12x

Need 2.12x RTX 6000 Ada Generation

NVIDIA GeForce GT 710

Compute (FP32-eq)

426.23x

Need 426.23x GeForce GT 710

FP32 Compute

53.28x

Need 53.28x GeForce GT 710

VRAM

40.00x

Need 40.00x GeForce GT 710

Memory Bandwidth

141.60x

Need 141.60x GeForce GT 710

NVIDIA P100

Compute (FP32-eq)

16.77x

Need 16.77x P100

FP32 Compute

2.10x

Need 2.10x P100

VRAM

5.00x

Need 5.00x P100

Memory Bandwidth

2.79x

Need 2.79x P100

NVIDIA T4

Compute (FP32-eq)

4.80x

Need 4.80x T4

FP32 Compute

2.41x

Need 2.41x T4

VRAM

5.00x

Need 5.00x T4

Memory Bandwidth

6.37x

Need 6.37x T4

To match 1x NVIDIA T4

NVIDIA H100 SXM

Compute (FP32-eq)

0.03x

H100 SXM is 30.45x faster

FP32 Compute

0.12x

H100 SXM is 8.27x faster

VRAM

0.20x

H100 SXM has 5.00x more

Memory Bandwidth

0.10x

H100 SXM has 10.47x more

NVIDIA RTX 6000 Ada Generation

Compute (FP32-eq)

0.36x

RTX 6000 Ada Generation is 2.81x faster

FP32 Compute

0.09x

RTX 6000 Ada Generation is 11.25x faster

VRAM

0.33x

RTX 6000 Ada Generation has 3.00x more

Memory Bandwidth

0.33x

RTX 6000 Ada Generation has 3.00x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

88.80x

Need 88.80x GeForce GT 710

FP32 Compute

22.13x

Need 22.13x GeForce GT 710

VRAM

8.00x

Need 8.00x GeForce GT 710

Memory Bandwidth

22.22x

Need 22.22x GeForce GT 710

NVIDIA P100

Compute (FP32-eq)

3.49x

Need 3.49x P100

FP32 Compute

0.87x

P100 is 1.15x faster

VRAM

1.00x

P100 has 1.00x more

Memory Bandwidth

0.44x

P100 has 2.29x more

NVIDIA A100 SXM

Compute (FP32-eq)

0.21x

A100 SXM is 4.80x faster

FP32 Compute

0.42x

A100 SXM is 2.41x faster

VRAM

0.20x

A100 SXM has 5.00x more

Memory Bandwidth

0.16x

A100 SXM has 6.37x more

Pricing

Price Type	H100 SXM	RTX 6000 Ada Generation	GeForce GT 710	P100	A100 SXM	T4
CAPEX (Street Price)	$30,000	—	—	—	$15,000	—
OPEX (per hour)	$3.50/hr	$0.33/hr	$0.07/hr	$0.28/hr	$4.05/hr	$0.27/hr
Price per TFLOPs (FP32-eq)	$30	—	—	—	$96	—