Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (6)Jump to results

Filter by Vendor

Showing 128 XPUs • 6 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	Intel Habana Gaudi 3	Intel Data Center GPU Max 1100	AMD MI250X	NVIDIA Quadro P600	NVIDIA GeForce GT 710	NVIDIA RTX 4000
Architecture	Gaudi Gen3	Ponte Vecchio	CDNA 2	Pascal	Kepler	Turing
Form Factor	OAM	OAM	OAM	PCIe	PCIe	PCIe
VRAM	128 GB	48 GB	128 GB	2 GB	2 GB	8 GB
Memory Bandwidth	3,700 GB/s	1,229 GB/s	3,277 GB/s	64 GB/s	14.4 GB/s	416 GB/s
TFLOPs (FP32)	—	22	47.9	1.117	0.366	7.1
TFLOPs (FP16)	—	177	383	—	—	—
TFLOPs	1,835	177	383	1.117	0.366	57.6
TFLOPs (FP8)	3,670	—	—	—	—	—
TDP	900 W	300 W	560 W	40 W	19 W	160 W
Launch Date	Apr 2024	Jan 2023	Nov 2021	Feb 2017	Mar 2014	Nov 2018

Efficiency Metrics

Metric	Gaudi 3	Data Center GPU Max 1100	MI250X	Quadro P600	GeForce GT 710	RTX 4000
TFLOPs per Watt (FP32-eq)	1.02	0.29	0.34	0.03	0.02	0.18
Memory Bandwidth per GB	28.9 GB/s	25.6 GB/s	25.6 GB/s	32.0 GB/s	7.2 GB/s	52.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x Intel Habana Gaudi 3

Intel Data Center GPU Max 1100

Compute (FP32-eq)

10.37x

Need 10.37x Data Center GPU Max 1100

VRAM

2.67x

Need 2.67x Data Center GPU Max 1100

Memory Bandwidth

3.01x

Need 3.01x Data Center GPU Max 1100

AMD MI250X

Compute (FP32-eq)

4.79x

Need 4.79x MI250X

VRAM

1.00x

MI250X has 1.00x more

Memory Bandwidth

1.13x

Need 1.13x MI250X

NVIDIA Quadro P600

Compute (FP32-eq)

821.40x

Need 821.40x Quadro P600

VRAM

64.00x

Need 64.00x Quadro P600

Memory Bandwidth

57.81x

Need 57.81x Quadro P600

NVIDIA GeForce GT 710

Compute (FP32-eq)

2506.83x

Need 2506.83x GeForce GT 710

VRAM

64.00x

Need 64.00x GeForce GT 710

Memory Bandwidth

256.94x

Need 256.94x GeForce GT 710

NVIDIA RTX 4000

Compute (FP32-eq)

31.86x

Need 31.86x RTX 4000

VRAM

16.00x

Need 16.00x RTX 4000

Memory Bandwidth

8.89x

Need 8.89x RTX 4000

To match 1x Intel Data Center GPU Max 1100

Intel Habana Gaudi 3

Compute (FP32-eq)

0.10x

Gaudi 3 is 10.37x faster

VRAM

0.38x

Gaudi 3 has 2.67x more

Memory Bandwidth

0.33x

Gaudi 3 has 3.01x more

AMD MI250X

Compute (FP32-eq)

0.46x

MI250X is 2.16x faster

FP32 Compute

0.46x

MI250X is 2.18x faster

VRAM

0.38x

MI250X has 2.67x more

Memory Bandwidth

0.38x

MI250X has 2.67x more

NVIDIA Quadro P600

Compute (FP32-eq)

79.23x

Need 79.23x Quadro P600

FP32 Compute

19.70x

Need 19.70x Quadro P600

VRAM

24.00x

Need 24.00x Quadro P600

Memory Bandwidth

19.20x

Need 19.20x Quadro P600

NVIDIA GeForce GT 710

Compute (FP32-eq)

241.80x

Need 241.80x GeForce GT 710

FP32 Compute

60.11x

Need 60.11x GeForce GT 710

VRAM

24.00x

Need 24.00x GeForce GT 710

Memory Bandwidth

85.35x

Need 85.35x GeForce GT 710

NVIDIA RTX 4000

Compute (FP32-eq)

3.07x

Need 3.07x RTX 4000

FP32 Compute

3.10x

Need 3.10x RTX 4000

VRAM

6.00x

Need 6.00x RTX 4000

Memory Bandwidth

2.95x

Need 2.95x RTX 4000

To match 1x AMD MI250X

Intel Habana Gaudi 3

Compute (FP32-eq)

0.21x

Gaudi 3 is 4.79x faster

VRAM

1.00x

Gaudi 3 has 1.00x more

Memory Bandwidth

0.89x

Gaudi 3 has 1.13x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

2.16x

Need 2.16x Data Center GPU Max 1100

FP32 Compute

2.18x

Need 2.18x Data Center GPU Max 1100

VRAM

2.67x

Need 2.67x Data Center GPU Max 1100

Memory Bandwidth

2.67x

Need 2.67x Data Center GPU Max 1100

NVIDIA Quadro P600

Compute (FP32-eq)

171.44x

Need 171.44x Quadro P600

FP32 Compute

42.88x

Need 42.88x Quadro P600

VRAM

64.00x

Need 64.00x Quadro P600

Memory Bandwidth

51.20x

Need 51.20x Quadro P600

NVIDIA GeForce GT 710

Compute (FP32-eq)

523.22x

Need 523.22x GeForce GT 710

FP32 Compute

130.87x

Need 130.87x GeForce GT 710

VRAM

64.00x

Need 64.00x GeForce GT 710

Memory Bandwidth

227.57x

Need 227.57x GeForce GT 710

NVIDIA RTX 4000

Compute (FP32-eq)

6.65x

Need 6.65x RTX 4000

FP32 Compute

6.75x

Need 6.75x RTX 4000

VRAM

16.00x

Need 16.00x RTX 4000

Memory Bandwidth

7.88x

Need 7.88x RTX 4000

To match 1x NVIDIA Quadro P600

Intel Habana Gaudi 3

Compute (FP32-eq)

0.00x

Gaudi 3 is 821.40x faster

VRAM

0.02x

Gaudi 3 has 64.00x more

Memory Bandwidth

0.02x

Gaudi 3 has 57.81x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.01x

Data Center GPU Max 1100 is 79.23x faster

FP32 Compute

0.05x

Data Center GPU Max 1100 is 19.70x faster

VRAM

0.04x

Data Center GPU Max 1100 has 24.00x more

Memory Bandwidth

0.05x

Data Center GPU Max 1100 has 19.20x more

AMD MI250X

Compute (FP32-eq)

0.01x

MI250X is 171.44x faster

FP32 Compute

0.02x

MI250X is 42.88x faster

VRAM

0.02x

MI250X has 64.00x more

Memory Bandwidth

0.02x

MI250X has 51.20x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

3.05x

Need 3.05x GeForce GT 710

FP32 Compute

3.05x

Need 3.05x GeForce GT 710

VRAM

1.00x

GeForce GT 710 has 1.00x more

Memory Bandwidth

4.44x

Need 4.44x GeForce GT 710

NVIDIA RTX 4000

Compute (FP32-eq)

0.04x

RTX 4000 is 25.78x faster

FP32 Compute

0.16x

RTX 4000 is 6.36x faster

VRAM

0.25x

RTX 4000 has 4.00x more

Memory Bandwidth

0.15x

RTX 4000 has 6.50x more

To match 1x NVIDIA GeForce GT 710

Intel Habana Gaudi 3

Compute (FP32-eq)

0.00x

Gaudi 3 is 2506.83x faster

VRAM

0.02x

Gaudi 3 has 64.00x more

Memory Bandwidth

0.00x

Gaudi 3 has 256.94x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.00x

Data Center GPU Max 1100 is 241.80x faster

FP32 Compute

0.02x

Data Center GPU Max 1100 is 60.11x faster

VRAM

0.04x

Data Center GPU Max 1100 has 24.00x more

Memory Bandwidth

0.01x

Data Center GPU Max 1100 has 85.35x more

AMD MI250X

Compute (FP32-eq)

0.00x

MI250X is 523.22x faster

FP32 Compute

0.01x

MI250X is 130.87x faster

VRAM

0.02x

MI250X has 64.00x more

Memory Bandwidth

0.00x

MI250X has 227.57x more

NVIDIA Quadro P600

Compute (FP32-eq)

0.33x

Quadro P600 is 3.05x faster

FP32 Compute

0.33x

Quadro P600 is 3.05x faster

VRAM

1.00x

Quadro P600 has 1.00x more

Memory Bandwidth

0.23x

Quadro P600 has 4.44x more

NVIDIA RTX 4000

Compute (FP32-eq)

0.01x

RTX 4000 is 78.69x faster

FP32 Compute

0.05x

RTX 4000 is 19.40x faster

VRAM

0.25x

RTX 4000 has 4.00x more

Memory Bandwidth

0.03x

RTX 4000 has 28.89x more

To match 1x NVIDIA RTX 4000

Intel Habana Gaudi 3

Compute (FP32-eq)

0.03x

Gaudi 3 is 31.86x faster

VRAM

0.06x

Gaudi 3 has 16.00x more

Memory Bandwidth

0.11x

Gaudi 3 has 8.89x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.33x

Data Center GPU Max 1100 is 3.07x faster

FP32 Compute

0.32x

Data Center GPU Max 1100 is 3.10x faster

VRAM

0.17x

Data Center GPU Max 1100 has 6.00x more

Memory Bandwidth

0.34x

Data Center GPU Max 1100 has 2.95x more

AMD MI250X

Compute (FP32-eq)

0.15x

MI250X is 6.65x faster

FP32 Compute

0.15x

MI250X is 6.75x faster

VRAM

0.06x

MI250X has 16.00x more

Memory Bandwidth

0.13x

MI250X has 7.88x more

NVIDIA Quadro P600

Compute (FP32-eq)

25.78x

Need 25.78x Quadro P600

FP32 Compute

6.36x

Need 6.36x Quadro P600

VRAM

4.00x

Need 4.00x Quadro P600

Memory Bandwidth

6.50x

Need 6.50x Quadro P600

NVIDIA GeForce GT 710

Compute (FP32-eq)

78.69x

Need 78.69x GeForce GT 710

FP32 Compute

19.40x

Need 19.40x GeForce GT 710

VRAM

4.00x

Need 4.00x GeForce GT 710

Memory Bandwidth

28.89x

Need 28.89x GeForce GT 710

Pricing

Price Type	Gaudi 3	Data Center GPU Max 1100	MI250X	Quadro P600	GeForce GT 710	RTX 4000
CAPEX (Street Price)	$15,000	$5,000	$12,000	—	—	—
OPEX (per hour)	$1.20/hr	—	$2.00/hr	$0.05/hr	$0.07/hr	$0.34/hr
Price per TFLOPs (FP32-eq)	$16	$56	$63	—	—	—