Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (6)Jump to results

Filter by Vendor

Showing 128 XPUs • 6 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	Intel Data Center GPU Max 1100	AMD MI250X	NVIDIA B100	NVIDIA GeForce GT 710	NVIDIA RTX 5000	NVIDIA Quadro M4000
Architecture	Ponte Vecchio	CDNA 2	Blackwell	Kepler	Turing	Maxwell
Form Factor	OAM	OAM	SXM	PCIe	PCIe	PCIe
VRAM	48 GB	128 GB	192 GB	2 GB	16 GB	8 GB
Memory Bandwidth	1,229 GB/s	3,277 GB/s	8,000 GB/s	14.4 GB/s	448 GB/s	192 GB/s
TFLOPs (FP32)	22	47.9	45	0.366	11.2	2.57
TFLOPs (FP16)	177	383	—	—	—	—
TFLOPs	177	383	1,800	0.366	89.2	2.57
TFLOPs (FP8)	—	—	—	—	—	—
TDP	300 W	560 W	700 W	19 W	265 W	120 W
Launch Date	Jan 2023	Nov 2021	Mar 2024	Mar 2014	Aug 2018	Jun 2015

Efficiency Metrics

Metric	Data Center GPU Max 1100	MI250X	B100	GeForce GT 710	RTX 5000	Quadro M4000
TFLOPs per Watt (FP32-eq)	0.29	0.34	1.29	0.02	0.17	0.02
Memory Bandwidth per GB	25.6 GB/s	25.6 GB/s	41.7 GB/s	7.2 GB/s	28.0 GB/s	24.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x Intel Data Center GPU Max 1100

AMD MI250X

Compute (FP32-eq)

0.46x

MI250X is 2.16x faster

FP32 Compute

0.46x

MI250X is 2.18x faster

VRAM

0.38x

MI250X has 2.67x more

Memory Bandwidth

0.38x

MI250X has 2.67x more

NVIDIA B100

Compute (FP32-eq)

0.10x

B100 is 10.17x faster

FP32 Compute

0.49x

B100 is 2.05x faster

VRAM

0.25x

B100 has 4.00x more

Memory Bandwidth

0.15x

B100 has 6.51x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

241.80x

Need 241.80x GeForce GT 710

FP32 Compute

60.11x

Need 60.11x GeForce GT 710

VRAM

24.00x

Need 24.00x GeForce GT 710

Memory Bandwidth

85.35x

Need 85.35x GeForce GT 710

NVIDIA RTX 5000

Compute (FP32-eq)

1.98x

Need 1.98x RTX 5000

FP32 Compute

1.96x

Need 1.96x RTX 5000

VRAM

3.00x

Need 3.00x RTX 5000

Memory Bandwidth

2.74x

Need 2.74x RTX 5000

NVIDIA Quadro M4000

Compute (FP32-eq)

34.44x

Need 34.44x Quadro M4000

FP32 Compute

8.56x

Need 8.56x Quadro M4000

VRAM

6.00x

Need 6.00x Quadro M4000

Memory Bandwidth

6.40x

Need 6.40x Quadro M4000

To match 1x AMD MI250X

Intel Data Center GPU Max 1100

Compute (FP32-eq)

2.16x

Need 2.16x Data Center GPU Max 1100

FP32 Compute

2.18x

Need 2.18x Data Center GPU Max 1100

VRAM

2.67x

Need 2.67x Data Center GPU Max 1100

Memory Bandwidth

2.67x

Need 2.67x Data Center GPU Max 1100

NVIDIA B100

Compute (FP32-eq)

0.21x

B100 is 4.70x faster

FP32 Compute

1.06x

Need 1.06x B100

VRAM

0.67x

B100 has 1.50x more

Memory Bandwidth

0.41x

B100 has 2.44x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

523.22x

Need 523.22x GeForce GT 710

FP32 Compute

130.87x

Need 130.87x GeForce GT 710

VRAM

64.00x

Need 64.00x GeForce GT 710

Memory Bandwidth

227.57x

Need 227.57x GeForce GT 710

NVIDIA RTX 5000

Compute (FP32-eq)

4.29x

Need 4.29x RTX 5000

FP32 Compute

4.28x

Need 4.28x RTX 5000

VRAM

8.00x

Need 8.00x RTX 5000

Memory Bandwidth

7.31x

Need 7.31x RTX 5000

NVIDIA Quadro M4000

Compute (FP32-eq)

74.51x

Need 74.51x Quadro M4000

FP32 Compute

18.64x

Need 18.64x Quadro M4000

VRAM

16.00x

Need 16.00x Quadro M4000

Memory Bandwidth

17.07x

Need 17.07x Quadro M4000

To match 1x NVIDIA B100

Intel Data Center GPU Max 1100

Compute (FP32-eq)

10.17x

Need 10.17x Data Center GPU Max 1100

FP32 Compute

2.05x

Need 2.05x Data Center GPU Max 1100

VRAM

4.00x

Need 4.00x Data Center GPU Max 1100

Memory Bandwidth

6.51x

Need 6.51x Data Center GPU Max 1100

AMD MI250X

Compute (FP32-eq)

4.70x

Need 4.70x MI250X

FP32 Compute

0.94x

MI250X is 1.06x faster

VRAM

1.50x

Need 1.50x MI250X

Memory Bandwidth

2.44x

Need 2.44x MI250X

NVIDIA GeForce GT 710

Compute (FP32-eq)

2459.02x

Need 2459.02x GeForce GT 710

FP32 Compute

122.95x

Need 122.95x GeForce GT 710

VRAM

96.00x

Need 96.00x GeForce GT 710

Memory Bandwidth

555.56x

Need 555.56x GeForce GT 710

NVIDIA RTX 5000

Compute (FP32-eq)

20.18x

Need 20.18x RTX 5000

FP32 Compute

4.02x

Need 4.02x RTX 5000

VRAM

12.00x

Need 12.00x RTX 5000

Memory Bandwidth

17.86x

Need 17.86x RTX 5000

NVIDIA Quadro M4000

Compute (FP32-eq)

350.19x

Need 350.19x Quadro M4000

FP32 Compute

17.51x

Need 17.51x Quadro M4000

VRAM

24.00x

Need 24.00x Quadro M4000

Memory Bandwidth

41.67x

Need 41.67x Quadro M4000

To match 1x NVIDIA GeForce GT 710

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.00x

Data Center GPU Max 1100 is 241.80x faster

FP32 Compute

0.02x

Data Center GPU Max 1100 is 60.11x faster

VRAM

0.04x

Data Center GPU Max 1100 has 24.00x more

Memory Bandwidth

0.01x

Data Center GPU Max 1100 has 85.35x more

AMD MI250X

Compute (FP32-eq)

0.00x

MI250X is 523.22x faster

FP32 Compute

0.01x

MI250X is 130.87x faster

VRAM

0.02x

MI250X has 64.00x more

Memory Bandwidth

0.00x

MI250X has 227.57x more

NVIDIA B100

Compute (FP32-eq)

0.00x

B100 is 2459.02x faster

FP32 Compute

0.01x

B100 is 122.95x faster

VRAM

0.01x

B100 has 96.00x more

Memory Bandwidth

0.00x

B100 has 555.56x more

NVIDIA RTX 5000

Compute (FP32-eq)

0.01x

RTX 5000 is 121.86x faster

FP32 Compute

0.03x

RTX 5000 is 30.60x faster

VRAM

0.13x

RTX 5000 has 8.00x more

Memory Bandwidth

0.03x

RTX 5000 has 31.11x more

NVIDIA Quadro M4000

Compute (FP32-eq)

0.14x

Quadro M4000 is 7.02x faster

FP32 Compute

0.14x

Quadro M4000 is 7.02x faster

VRAM

0.25x

Quadro M4000 has 4.00x more

Memory Bandwidth

0.07x

Quadro M4000 has 13.33x more

To match 1x NVIDIA RTX 5000

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.50x

Data Center GPU Max 1100 is 1.98x faster

FP32 Compute

0.51x

Data Center GPU Max 1100 is 1.96x faster

VRAM

0.33x

Data Center GPU Max 1100 has 3.00x more

Memory Bandwidth

0.36x

Data Center GPU Max 1100 has 2.74x more

AMD MI250X

Compute (FP32-eq)

0.23x

MI250X is 4.29x faster

FP32 Compute

0.23x

MI250X is 4.28x faster

VRAM

0.13x

MI250X has 8.00x more

Memory Bandwidth

0.14x

MI250X has 7.31x more

NVIDIA B100

Compute (FP32-eq)

0.05x

B100 is 20.18x faster

FP32 Compute

0.25x

B100 is 4.02x faster

VRAM

0.08x

B100 has 12.00x more

Memory Bandwidth

0.06x

B100 has 17.86x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

121.86x

Need 121.86x GeForce GT 710

FP32 Compute

30.60x

Need 30.60x GeForce GT 710

VRAM

8.00x

Need 8.00x GeForce GT 710

Memory Bandwidth

31.11x

Need 31.11x GeForce GT 710

NVIDIA Quadro M4000

Compute (FP32-eq)

17.35x

Need 17.35x Quadro M4000

FP32 Compute

4.36x

Need 4.36x Quadro M4000

VRAM

2.00x

Need 2.00x Quadro M4000

Memory Bandwidth

2.33x

Need 2.33x Quadro M4000

To match 1x NVIDIA Quadro M4000

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.03x

Data Center GPU Max 1100 is 34.44x faster

FP32 Compute

0.12x

Data Center GPU Max 1100 is 8.56x faster

VRAM

0.17x

Data Center GPU Max 1100 has 6.00x more

Memory Bandwidth

0.16x

Data Center GPU Max 1100 has 6.40x more

AMD MI250X

Compute (FP32-eq)

0.01x

MI250X is 74.51x faster

FP32 Compute

0.05x

MI250X is 18.64x faster

VRAM

0.06x

MI250X has 16.00x more

Memory Bandwidth

0.06x

MI250X has 17.07x more

NVIDIA B100

Compute (FP32-eq)

0.00x

B100 is 350.19x faster

FP32 Compute

0.06x

B100 is 17.51x faster

VRAM

0.04x

B100 has 24.00x more

Memory Bandwidth

0.02x

B100 has 41.67x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

7.02x

Need 7.02x GeForce GT 710

FP32 Compute

7.02x

Need 7.02x GeForce GT 710

VRAM

4.00x

Need 4.00x GeForce GT 710

Memory Bandwidth

13.33x

Need 13.33x GeForce GT 710

NVIDIA RTX 5000

Compute (FP32-eq)

0.06x

RTX 5000 is 17.35x faster

FP32 Compute

0.23x

RTX 5000 is 4.36x faster

VRAM

0.50x

RTX 5000 has 2.00x more

Memory Bandwidth

0.43x

RTX 5000 has 2.33x more

Pricing

Price Type	Data Center GPU Max 1100	MI250X	B100	GeForce GT 710	RTX 5000	Quadro M4000
CAPEX (Street Price)	$5,000	$12,000	—	—	—	—
OPEX (per hour)	—	$2.00/hr	—	$0.07/hr	$0.82/hr	$0.45/hr
Price per TFLOPs (FP32-eq)	$56	$63	—	—	—	—