Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (7)Jump to results

Filter by Vendor

Showing 128 XPUs • 7 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	Cerebras WSE-3	Baidu Kunlun II	Intel Data Center GPU Max 1100	NVIDIA A10G	NVIDIA GeForce GT 710	NVIDIA GeForce RTX 3060	NVIDIA V100S
Architecture	Wafer Scale Engine	Kunlun Core	Ponte Vecchio	Ampere	Kepler	Ampere	Volta
Form Factor	—	—	OAM	PCIe	PCIe	PCIe	PCIe
VRAM	44 GB	32 GB	48 GB	24 GB	2 GB	12 GB	32 GB
Memory Bandwidth	—	—	1,229 GB/s	600 GB/s	14.4 GB/s	360 GB/s	1,134 GB/s
TFLOPs (FP32)	—	—	22	35.2	0.366	13	16.4
TFLOPs (FP16)	—	256	177	—	—	—	—
TFLOPs	—	—	177	125	0.366	25.4	16.4
TFLOPs (FP8)	—	—	—	—	—	—	—
TDP	23000 W	200 W	300 W	300 W	19 W	170 W	250 W
Launch Date	Mar 2024	Aug 2021	Jan 2023	Jul 2021	Mar 2014	Feb 2021	Nov 2019

Efficiency Metrics

Metric	WSE-3	Kunlun II	Data Center GPU Max 1100	A10G	GeForce GT 710	GeForce RTX 3060	V100S
TFLOPs per Watt (FP32-eq)	—	—	0.29	0.21	0.02	0.07	0.07
Memory Bandwidth per GB	—	—	25.6 GB/s	25.0 GB/s	7.2 GB/s	30.0 GB/s	35.4 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x Cerebras WSE-3

Baidu Kunlun II

VRAM

1.38x

Need 1.38x Kunlun II

Intel Data Center GPU Max 1100

VRAM

0.92x

Data Center GPU Max 1100 has 1.09x more

NVIDIA A10G

VRAM

1.83x

Need 1.83x A10G

NVIDIA GeForce GT 710

VRAM

22.00x

Need 22.00x GeForce GT 710

NVIDIA GeForce RTX 3060

VRAM

3.67x

Need 3.67x GeForce RTX 3060

NVIDIA V100S

VRAM

1.38x

Need 1.38x V100S

To match 1x Baidu Kunlun II

Cerebras WSE-3

VRAM

0.73x

WSE-3 has 1.38x more

Intel Data Center GPU Max 1100

VRAM

0.67x

Data Center GPU Max 1100 has 1.50x more

NVIDIA A10G

VRAM

1.33x

Need 1.33x A10G

NVIDIA GeForce GT 710

VRAM

16.00x

Need 16.00x GeForce GT 710

NVIDIA GeForce RTX 3060

VRAM

2.67x

Need 2.67x GeForce RTX 3060

NVIDIA V100S

VRAM

1.00x

V100S has 1.00x more

To match 1x Intel Data Center GPU Max 1100

Cerebras WSE-3

VRAM

1.09x

Need 1.09x WSE-3

Baidu Kunlun II

VRAM

1.50x

Need 1.50x Kunlun II

NVIDIA A10G

Compute (FP32-eq)

1.42x

Need 1.42x A10G

FP32 Compute

0.63x

A10G is 1.60x faster

VRAM

2.00x

Need 2.00x A10G

Memory Bandwidth

2.05x

Need 2.05x A10G

NVIDIA GeForce GT 710

Compute (FP32-eq)

241.80x

Need 241.80x GeForce GT 710

FP32 Compute

60.11x

Need 60.11x GeForce GT 710

VRAM

24.00x

Need 24.00x GeForce GT 710

Memory Bandwidth

85.35x

Need 85.35x GeForce GT 710

NVIDIA GeForce RTX 3060

Compute (FP32-eq)

6.97x

Need 6.97x GeForce RTX 3060

FP32 Compute

1.69x

Need 1.69x GeForce RTX 3060

VRAM

4.00x

Need 4.00x GeForce RTX 3060

Memory Bandwidth

3.41x

Need 3.41x GeForce RTX 3060

NVIDIA V100S

Compute (FP32-eq)

5.40x

Need 5.40x V100S

FP32 Compute

1.34x

Need 1.34x V100S

VRAM

1.50x

Need 1.50x V100S

Memory Bandwidth

1.08x

Need 1.08x V100S

To match 1x NVIDIA A10G

Cerebras WSE-3

VRAM

0.55x

WSE-3 has 1.83x more

Baidu Kunlun II

VRAM

0.75x

Kunlun II has 1.33x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.71x

Data Center GPU Max 1100 is 1.42x faster

FP32 Compute

1.60x

Need 1.60x Data Center GPU Max 1100

VRAM

0.50x

Data Center GPU Max 1100 has 2.00x more

Memory Bandwidth

0.49x

Data Center GPU Max 1100 has 2.05x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

170.77x

Need 170.77x GeForce GT 710

FP32 Compute

96.17x

Need 96.17x GeForce GT 710

VRAM

12.00x

Need 12.00x GeForce GT 710

Memory Bandwidth

41.67x

Need 41.67x GeForce GT 710

NVIDIA GeForce RTX 3060

Compute (FP32-eq)

4.92x

Need 4.92x GeForce RTX 3060

FP32 Compute

2.71x

Need 2.71x GeForce RTX 3060

VRAM

2.00x

Need 2.00x GeForce RTX 3060

Memory Bandwidth

1.67x

Need 1.67x GeForce RTX 3060

NVIDIA V100S

Compute (FP32-eq)

3.81x

Need 3.81x V100S

FP32 Compute

2.15x

Need 2.15x V100S

VRAM

0.75x

V100S has 1.33x more

Memory Bandwidth

0.53x

V100S has 1.89x more

To match 1x NVIDIA GeForce GT 710

Cerebras WSE-3

VRAM

0.05x

WSE-3 has 22.00x more

Baidu Kunlun II

VRAM

0.06x

Kunlun II has 16.00x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.00x

Data Center GPU Max 1100 is 241.80x faster

FP32 Compute

0.02x

Data Center GPU Max 1100 is 60.11x faster

VRAM

0.04x

Data Center GPU Max 1100 has 24.00x more

Memory Bandwidth

0.01x

Data Center GPU Max 1100 has 85.35x more

NVIDIA A10G

Compute (FP32-eq)

0.01x

A10G is 170.77x faster

FP32 Compute

0.01x

A10G is 96.17x faster

VRAM

0.08x

A10G has 12.00x more

Memory Bandwidth

0.02x

A10G has 41.67x more

NVIDIA GeForce RTX 3060

Compute (FP32-eq)

0.03x

GeForce RTX 3060 is 34.70x faster

FP32 Compute

0.03x

GeForce RTX 3060 is 35.52x faster

VRAM

0.17x

GeForce RTX 3060 has 6.00x more

Memory Bandwidth

0.04x

GeForce RTX 3060 has 25.00x more

NVIDIA V100S

Compute (FP32-eq)

0.02x

V100S is 44.81x faster

FP32 Compute

0.02x

V100S is 44.81x faster

VRAM

0.06x

V100S has 16.00x more

Memory Bandwidth

0.01x

V100S has 78.75x more

To match 1x NVIDIA GeForce RTX 3060

Cerebras WSE-3

VRAM

0.27x

WSE-3 has 3.67x more

Baidu Kunlun II

VRAM

0.38x

Kunlun II has 2.67x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.14x

Data Center GPU Max 1100 is 6.97x faster

FP32 Compute

0.59x

Data Center GPU Max 1100 is 1.69x faster

VRAM

0.25x

Data Center GPU Max 1100 has 4.00x more

Memory Bandwidth

0.29x

Data Center GPU Max 1100 has 3.41x more

NVIDIA A10G

Compute (FP32-eq)

0.20x

A10G is 4.92x faster

FP32 Compute

0.37x

A10G is 2.71x faster

VRAM

0.50x

A10G has 2.00x more

Memory Bandwidth

0.60x

A10G has 1.67x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

34.70x

Need 34.70x GeForce GT 710

FP32 Compute

35.52x

Need 35.52x GeForce GT 710

VRAM

6.00x

Need 6.00x GeForce GT 710

Memory Bandwidth

25.00x

Need 25.00x GeForce GT 710

NVIDIA V100S

Compute (FP32-eq)

0.77x

V100S is 1.29x faster

FP32 Compute

0.79x

V100S is 1.26x faster

VRAM

0.38x

V100S has 2.67x more

Memory Bandwidth

0.32x

V100S has 3.15x more

To match 1x NVIDIA V100S

Cerebras WSE-3

VRAM

0.73x

WSE-3 has 1.38x more

Baidu Kunlun II

VRAM

1.00x

Kunlun II has 1.00x more

Intel Data Center GPU Max 1100

Compute (FP32-eq)

0.19x

Data Center GPU Max 1100 is 5.40x faster

FP32 Compute

0.75x

Data Center GPU Max 1100 is 1.34x faster

VRAM

0.67x

Data Center GPU Max 1100 has 1.50x more

Memory Bandwidth

0.92x

Data Center GPU Max 1100 has 1.08x more

NVIDIA A10G

Compute (FP32-eq)

0.26x

A10G is 3.81x faster

FP32 Compute

0.47x

A10G is 2.15x faster

VRAM

1.33x

Need 1.33x A10G

Memory Bandwidth

1.89x

Need 1.89x A10G

NVIDIA GeForce GT 710

Compute (FP32-eq)

44.81x

Need 44.81x GeForce GT 710

FP32 Compute

44.81x

Need 44.81x GeForce GT 710

VRAM

16.00x

Need 16.00x GeForce GT 710

Memory Bandwidth

78.75x

Need 78.75x GeForce GT 710

NVIDIA GeForce RTX 3060

Compute (FP32-eq)

1.29x

Need 1.29x GeForce RTX 3060

FP32 Compute

1.26x

Need 1.26x GeForce RTX 3060

VRAM

2.67x

Need 2.67x GeForce RTX 3060

Memory Bandwidth

3.15x

Need 3.15x GeForce RTX 3060

Pricing

Price Type	WSE-3	Kunlun II	Data Center GPU Max 1100	A10G	GeForce GT 710	GeForce RTX 3060	V100S
CAPEX (Street Price)	—	—	$5,000	—	—	—	—
OPEX (per hour)	—	—	—	$1.01/hr	$0.07/hr	$0.05/hr	$0.88/hr
Price per TFLOPs (FP32-eq)	—	—	$56	—	—	—	—