Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (8)Jump to results

Filter by Vendor

Showing 128 XPUs • 8 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	Cerebras WSE-3	SambaNova SN40L	NVIDIA RTX 4500 Ada Generation	NVIDIA GeForce RTX 5060 Ti	NVIDIA GeForce GT 730	NVIDIA GeForce RTX 3090	NVIDIA V100S	NVIDIA P100
Architecture	Wafer Scale Engine	Reconfigurable Dataflow Unit (RDU)	Ada Lovelace	Blackwell	Kepler	Ampere	Volta	Pascal
Form Factor	—	—	PCIe	PCIe	PCIe	PCIe	PCIe	SXM
VRAM	44 GB	640 GB	24 GB	16 GB	2 GB	24 GB	32 GB	16 GB
Memory Bandwidth	—	—	576 GB/s	544 GB/s	28.5 GB/s	936 GB/s	1,134 GB/s	732 GB/s
TFLOPs (FP32)	—	—	48.5	25	0.693	35.6	16.4	9.3
TFLOPs (FP16)	—	—	—	—	—	—	—	—
TFLOPs	—	—	97	88	0.693	71	16.4	9.3
TFLOPs (FP8)	—	—	—	—	—	—	—	—
TDP	23000 W	700 W	210 W	220 W	49 W	350 W	250 W	300 W
Launch Date	Mar 2024	Jan 2024	Mar 2023	Mar 2025	Jun 2014	Sep 2020	Nov 2019	Apr 2016

Efficiency Metrics

Metric	WSE-3	SN40L	RTX 4500 Ada Generation	GeForce RTX 5060 Ti	GeForce GT 730	GeForce RTX 3090	V100S	P100
TFLOPs per Watt (FP32-eq)	—	—	0.23	0.20	0.01	0.10	0.07	0.03
Memory Bandwidth per GB	—	—	24.0 GB/s	34.0 GB/s	14.3 GB/s	39.0 GB/s	35.4 GB/s	45.8 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x Cerebras WSE-3

SambaNova SN40L

VRAM

0.07x

SN40L has 14.55x more

NVIDIA RTX 4500 Ada Generation

VRAM

1.83x

Need 1.83x RTX 4500 Ada Generation

NVIDIA GeForce RTX 5060 Ti

VRAM

2.75x

Need 2.75x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

VRAM

22.00x

Need 22.00x GeForce GT 730

NVIDIA GeForce RTX 3090

VRAM

1.83x

Need 1.83x GeForce RTX 3090

NVIDIA V100S

VRAM

1.38x

Need 1.38x V100S

NVIDIA P100

VRAM

2.75x

Need 2.75x P100

To match 1x SambaNova SN40L

Cerebras WSE-3

VRAM

14.55x

Need 14.55x WSE-3

NVIDIA RTX 4500 Ada Generation

VRAM

26.67x

Need 26.67x RTX 4500 Ada Generation

NVIDIA GeForce RTX 5060 Ti

VRAM

40.00x

Need 40.00x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

VRAM

320.00x

Need 320.00x GeForce GT 730

NVIDIA GeForce RTX 3090

VRAM

26.67x

Need 26.67x GeForce RTX 3090

NVIDIA V100S

VRAM

20.00x

Need 20.00x V100S

NVIDIA P100

VRAM

40.00x

Need 40.00x P100

To match 1x NVIDIA RTX 4500 Ada Generation

Cerebras WSE-3

VRAM

0.55x

WSE-3 has 1.83x more

SambaNova SN40L

VRAM

0.04x

SN40L has 26.67x more

NVIDIA GeForce RTX 5060 Ti

Compute (FP32-eq)

1.10x

Need 1.10x GeForce RTX 5060 Ti

FP32 Compute

1.94x

Need 1.94x GeForce RTX 5060 Ti

VRAM

1.50x

Need 1.50x GeForce RTX 5060 Ti

Memory Bandwidth

1.06x

Need 1.06x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

Compute (FP32-eq)

69.99x

Need 69.99x GeForce GT 730

FP32 Compute

69.99x

Need 69.99x GeForce GT 730

VRAM

12.00x

Need 12.00x GeForce GT 730

Memory Bandwidth

20.21x

Need 20.21x GeForce GT 730

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

1.37x

Need 1.37x GeForce RTX 3090

FP32 Compute

1.36x

Need 1.36x GeForce RTX 3090

VRAM

1.00x

GeForce RTX 3090 has 1.00x more

Memory Bandwidth

0.62x

GeForce RTX 3090 has 1.63x more

NVIDIA V100S

Compute (FP32-eq)

2.96x

Need 2.96x V100S

FP32 Compute

2.96x

Need 2.96x V100S

VRAM

0.75x

V100S has 1.33x more

Memory Bandwidth

0.51x

V100S has 1.97x more

NVIDIA P100

Compute (FP32-eq)

5.22x

Need 5.22x P100

FP32 Compute

5.22x

Need 5.22x P100

VRAM

1.50x

Need 1.50x P100

Memory Bandwidth

0.79x

P100 has 1.27x more

To match 1x NVIDIA GeForce RTX 5060 Ti

Cerebras WSE-3

VRAM

0.36x

WSE-3 has 2.75x more

SambaNova SN40L

VRAM

0.03x

SN40L has 40.00x more

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.91x

RTX 4500 Ada Generation is 1.10x faster

FP32 Compute

0.52x

RTX 4500 Ada Generation is 1.94x faster

VRAM

0.67x

RTX 4500 Ada Generation has 1.50x more

Memory Bandwidth

0.94x

RTX 4500 Ada Generation has 1.06x more

NVIDIA GeForce GT 730

Compute (FP32-eq)

63.49x

Need 63.49x GeForce GT 730

FP32 Compute

36.08x

Need 36.08x GeForce GT 730

VRAM

8.00x

Need 8.00x GeForce GT 730

Memory Bandwidth

19.09x

Need 19.09x GeForce GT 730

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

1.24x

Need 1.24x GeForce RTX 3090

FP32 Compute

0.70x

GeForce RTX 3090 is 1.42x faster

VRAM

0.67x

GeForce RTX 3090 has 1.50x more

Memory Bandwidth

0.58x

GeForce RTX 3090 has 1.72x more

NVIDIA V100S

Compute (FP32-eq)

2.68x

Need 2.68x V100S

FP32 Compute

1.52x

Need 1.52x V100S

VRAM

0.50x

V100S has 2.00x more

Memory Bandwidth

0.48x

V100S has 2.08x more

NVIDIA P100

Compute (FP32-eq)

4.73x

Need 4.73x P100

FP32 Compute

2.69x

Need 2.69x P100

VRAM

1.00x

P100 has 1.00x more

Memory Bandwidth

0.74x

P100 has 1.35x more

To match 1x NVIDIA GeForce GT 730

Cerebras WSE-3

VRAM

0.05x

WSE-3 has 22.00x more

SambaNova SN40L

VRAM

0.00x

SN40L has 320.00x more

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.01x

RTX 4500 Ada Generation is 69.99x faster

FP32 Compute

0.01x

RTX 4500 Ada Generation is 69.99x faster

VRAM

0.08x

RTX 4500 Ada Generation has 12.00x more

Memory Bandwidth

0.05x

RTX 4500 Ada Generation has 20.21x more

NVIDIA GeForce RTX 5060 Ti

Compute (FP32-eq)

0.02x

GeForce RTX 5060 Ti is 63.49x faster

FP32 Compute

0.03x

GeForce RTX 5060 Ti is 36.08x faster

VRAM

0.13x

GeForce RTX 5060 Ti has 8.00x more

Memory Bandwidth

0.05x

GeForce RTX 5060 Ti has 19.09x more

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.02x

GeForce RTX 3090 is 51.23x faster

FP32 Compute

0.02x

GeForce RTX 3090 is 51.37x faster

VRAM

0.08x

GeForce RTX 3090 has 12.00x more

Memory Bandwidth

0.03x

GeForce RTX 3090 has 32.84x more

NVIDIA V100S

Compute (FP32-eq)

0.04x

V100S is 23.67x faster

FP32 Compute

0.04x

V100S is 23.67x faster

VRAM

0.06x

V100S has 16.00x more

Memory Bandwidth

0.03x

V100S has 39.79x more

NVIDIA P100

Compute (FP32-eq)

0.07x

P100 is 13.42x faster

FP32 Compute

0.07x

P100 is 13.42x faster

VRAM

0.13x

P100 has 8.00x more

Memory Bandwidth

0.04x

P100 has 25.68x more

To match 1x NVIDIA GeForce RTX 3090

Cerebras WSE-3

VRAM

0.55x

WSE-3 has 1.83x more

SambaNova SN40L

VRAM

0.04x

SN40L has 26.67x more

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.73x

RTX 4500 Ada Generation is 1.37x faster

FP32 Compute

0.73x

RTX 4500 Ada Generation is 1.36x faster

VRAM

1.00x

RTX 4500 Ada Generation has 1.00x more

Memory Bandwidth

1.63x

Need 1.63x RTX 4500 Ada Generation

NVIDIA GeForce RTX 5060 Ti

Compute (FP32-eq)

0.81x

GeForce RTX 5060 Ti is 1.24x faster

FP32 Compute

1.42x

Need 1.42x GeForce RTX 5060 Ti

VRAM

1.50x

Need 1.50x GeForce RTX 5060 Ti

Memory Bandwidth

1.72x

Need 1.72x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

Compute (FP32-eq)

51.23x

Need 51.23x GeForce GT 730

FP32 Compute

51.37x

Need 51.37x GeForce GT 730

VRAM

12.00x

Need 12.00x GeForce GT 730

Memory Bandwidth

32.84x

Need 32.84x GeForce GT 730

NVIDIA V100S

Compute (FP32-eq)

2.16x

Need 2.16x V100S

FP32 Compute

2.17x

Need 2.17x V100S

VRAM

0.75x

V100S has 1.33x more

Memory Bandwidth

0.83x

V100S has 1.21x more

NVIDIA P100

Compute (FP32-eq)

3.82x

Need 3.82x P100

FP32 Compute

3.83x

Need 3.83x P100

VRAM

1.50x

Need 1.50x P100

Memory Bandwidth

1.28x

Need 1.28x P100

To match 1x NVIDIA V100S

Cerebras WSE-3

VRAM

0.73x

WSE-3 has 1.38x more

SambaNova SN40L

VRAM

0.05x

SN40L has 20.00x more

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.34x

RTX 4500 Ada Generation is 2.96x faster

FP32 Compute

0.34x

RTX 4500 Ada Generation is 2.96x faster

VRAM

1.33x

Need 1.33x RTX 4500 Ada Generation

Memory Bandwidth

1.97x

Need 1.97x RTX 4500 Ada Generation

NVIDIA GeForce RTX 5060 Ti

Compute (FP32-eq)

0.37x

GeForce RTX 5060 Ti is 2.68x faster

FP32 Compute

0.66x

GeForce RTX 5060 Ti is 1.52x faster

VRAM

2.00x

Need 2.00x GeForce RTX 5060 Ti

Memory Bandwidth

2.08x

Need 2.08x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

Compute (FP32-eq)

23.67x

Need 23.67x GeForce GT 730

FP32 Compute

23.67x

Need 23.67x GeForce GT 730

VRAM

16.00x

Need 16.00x GeForce GT 730

Memory Bandwidth

39.79x

Need 39.79x GeForce GT 730

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.46x

GeForce RTX 3090 is 2.16x faster

FP32 Compute

0.46x

GeForce RTX 3090 is 2.17x faster

VRAM

1.33x

Need 1.33x GeForce RTX 3090

Memory Bandwidth

1.21x

Need 1.21x GeForce RTX 3090

NVIDIA P100

Compute (FP32-eq)

1.76x

Need 1.76x P100

FP32 Compute

1.76x

Need 1.76x P100

VRAM

2.00x

Need 2.00x P100

Memory Bandwidth

1.55x

Need 1.55x P100

To match 1x NVIDIA P100

Cerebras WSE-3

VRAM

0.36x

WSE-3 has 2.75x more

SambaNova SN40L

VRAM

0.03x

SN40L has 40.00x more

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.19x

RTX 4500 Ada Generation is 5.22x faster

FP32 Compute

0.19x

RTX 4500 Ada Generation is 5.22x faster

VRAM

0.67x

RTX 4500 Ada Generation has 1.50x more

Memory Bandwidth

1.27x

Need 1.27x RTX 4500 Ada Generation

NVIDIA GeForce RTX 5060 Ti

Compute (FP32-eq)

0.21x

GeForce RTX 5060 Ti is 4.73x faster

FP32 Compute

0.37x

GeForce RTX 5060 Ti is 2.69x faster

VRAM

1.00x

GeForce RTX 5060 Ti has 1.00x more

Memory Bandwidth

1.35x

Need 1.35x GeForce RTX 5060 Ti

NVIDIA GeForce GT 730

Compute (FP32-eq)

13.42x

Need 13.42x GeForce GT 730

FP32 Compute

13.42x

Need 13.42x GeForce GT 730

VRAM

8.00x

Need 8.00x GeForce GT 730

Memory Bandwidth

25.68x

Need 25.68x GeForce GT 730

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.26x

GeForce RTX 3090 is 3.82x faster

FP32 Compute

0.26x

GeForce RTX 3090 is 3.83x faster

VRAM

0.67x

GeForce RTX 3090 has 1.50x more

Memory Bandwidth

0.78x

GeForce RTX 3090 has 1.28x more

NVIDIA V100S

Compute (FP32-eq)

0.57x

V100S is 1.76x faster

FP32 Compute

0.57x

V100S is 1.76x faster

VRAM

0.50x

V100S has 2.00x more

Memory Bandwidth

0.65x

V100S has 1.55x more

Pricing

Price Type	WSE-3	SN40L	RTX 4500 Ada Generation	GeForce RTX 5060 Ti	GeForce GT 730	GeForce RTX 3090	V100S	P100
CAPEX (Street Price)	—	—	—	—	—	—	—	—
OPEX (per hour)	—	—	—	$0.09/hr	$0.04/hr	$0.11/hr	$0.88/hr	$0.28/hr
Price per TFLOPs (FP32-eq)	—	—	—	—	—	—	—	—