Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (6)Jump to results

Filter by Vendor

Showing 128 XPUs • 6 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	NVIDIA RTX 4500 Ada Generation	NVIDIA GeForce RTX 3090	NVIDIA GeForce RTX 3090 Ti	NVIDIA L40	NVIDIA P100	NVIDIA GH200
Architecture	Ada Lovelace	Ampere	Ampere	Ada Lovelace	Pascal	Hopper
Form Factor	PCIe	PCIe	PCIe	PCIe	SXM	SXM
VRAM	24 GB	24 GB	24 GB	48 GB	16 GB	96 GB
Memory Bandwidth	576 GB/s	936 GB/s	1,008 GB/s	864 GB/s	732 GB/s	4,000 GB/s
TFLOPs (FP32)	48.5	35.6	40	45	9.3	67
TFLOPs (FP16)	—	—	—	—	—	—
TFLOPs	97	71	80	181.05	9.3	989
TFLOPs (FP8)	—	—	—	—	—	—
TDP	210 W	350 W	450 W	300 W	300 W	1000 W
Launch Date	Mar 2023	Sep 2020	Mar 2022	Oct 2022	Apr 2016	May 2023

Efficiency Metrics

Metric	RTX 4500 Ada Generation	GeForce RTX 3090	GeForce RTX 3090 Ti	L40	P100	GH200
TFLOPs per Watt (FP32-eq)	0.23	0.10	0.09	0.30	0.03	0.49
Memory Bandwidth per GB	24.0 GB/s	39.0 GB/s	42.0 GB/s	18.0 GB/s	45.8 GB/s	41.7 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x NVIDIA RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

1.37x

Need 1.37x GeForce RTX 3090

FP32 Compute

1.36x

Need 1.36x GeForce RTX 3090

VRAM

1.00x

GeForce RTX 3090 has 1.00x more

Memory Bandwidth

0.62x

GeForce RTX 3090 has 1.63x more

NVIDIA GeForce RTX 3090 Ti

Compute (FP32-eq)

1.21x

Need 1.21x GeForce RTX 3090 Ti

FP32 Compute

1.21x

Need 1.21x GeForce RTX 3090 Ti

VRAM

1.00x

GeForce RTX 3090 Ti has 1.00x more

Memory Bandwidth

0.57x

GeForce RTX 3090 Ti has 1.75x more

NVIDIA L40

Compute (FP32-eq)

0.54x

L40 is 1.87x faster

FP32 Compute

1.08x

Need 1.08x L40

VRAM

0.50x

L40 has 2.00x more

Memory Bandwidth

0.67x

L40 has 1.50x more

NVIDIA P100

Compute (FP32-eq)

5.22x

Need 5.22x P100

FP32 Compute

5.22x

Need 5.22x P100

VRAM

1.50x

Need 1.50x P100

Memory Bandwidth

0.79x

P100 has 1.27x more

NVIDIA GH200

Compute (FP32-eq)

0.10x

GH200 is 10.20x faster

FP32 Compute

0.72x

GH200 is 1.38x faster

VRAM

0.25x

GH200 has 4.00x more

Memory Bandwidth

0.14x

GH200 has 6.94x more

To match 1x NVIDIA GeForce RTX 3090

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.73x

RTX 4500 Ada Generation is 1.37x faster

FP32 Compute

0.73x

RTX 4500 Ada Generation is 1.36x faster

VRAM

1.00x

RTX 4500 Ada Generation has 1.00x more

Memory Bandwidth

1.63x

Need 1.63x RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090 Ti

Compute (FP32-eq)

0.89x

GeForce RTX 3090 Ti is 1.13x faster

FP32 Compute

0.89x

GeForce RTX 3090 Ti is 1.12x faster

VRAM

1.00x

GeForce RTX 3090 Ti has 1.00x more

Memory Bandwidth

0.93x

GeForce RTX 3090 Ti has 1.08x more

NVIDIA L40

Compute (FP32-eq)

0.39x

L40 is 2.55x faster

FP32 Compute

0.79x

L40 is 1.26x faster

VRAM

0.50x

L40 has 2.00x more

Memory Bandwidth

1.08x

Need 1.08x L40

NVIDIA P100

Compute (FP32-eq)

3.82x

Need 3.82x P100

FP32 Compute

3.83x

Need 3.83x P100

VRAM

1.50x

Need 1.50x P100

Memory Bandwidth

1.28x

Need 1.28x P100

NVIDIA GH200

Compute (FP32-eq)

0.07x

GH200 is 13.93x faster

FP32 Compute

0.53x

GH200 is 1.88x faster

VRAM

0.25x

GH200 has 4.00x more

Memory Bandwidth

0.23x

GH200 has 4.27x more

To match 1x NVIDIA GeForce RTX 3090 Ti

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.82x

RTX 4500 Ada Generation is 1.21x faster

FP32 Compute

0.82x

RTX 4500 Ada Generation is 1.21x faster

VRAM

1.00x

RTX 4500 Ada Generation has 1.00x more

Memory Bandwidth

1.75x

Need 1.75x RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

1.13x

Need 1.13x GeForce RTX 3090

FP32 Compute

1.12x

Need 1.12x GeForce RTX 3090

VRAM

1.00x

GeForce RTX 3090 has 1.00x more

Memory Bandwidth

1.08x

Need 1.08x GeForce RTX 3090

NVIDIA L40

Compute (FP32-eq)

0.44x

L40 is 2.26x faster

FP32 Compute

0.89x

L40 is 1.13x faster

VRAM

0.50x

L40 has 2.00x more

Memory Bandwidth

1.17x

Need 1.17x L40

NVIDIA P100

Compute (FP32-eq)

4.30x

Need 4.30x P100

FP32 Compute

4.30x

Need 4.30x P100

VRAM

1.50x

Need 1.50x P100

Memory Bandwidth

1.38x

Need 1.38x P100

NVIDIA GH200

Compute (FP32-eq)

0.08x

GH200 is 12.36x faster

FP32 Compute

0.60x

GH200 is 1.68x faster

VRAM

0.25x

GH200 has 4.00x more

Memory Bandwidth

0.25x

GH200 has 3.97x more

To match 1x NVIDIA L40

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

1.87x

Need 1.87x RTX 4500 Ada Generation

FP32 Compute

0.93x

RTX 4500 Ada Generation is 1.08x faster

VRAM

2.00x

Need 2.00x RTX 4500 Ada Generation

Memory Bandwidth

1.50x

Need 1.50x RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

2.55x

Need 2.55x GeForce RTX 3090

FP32 Compute

1.26x

Need 1.26x GeForce RTX 3090

VRAM

2.00x

Need 2.00x GeForce RTX 3090

Memory Bandwidth

0.92x

GeForce RTX 3090 has 1.08x more

NVIDIA GeForce RTX 3090 Ti

Compute (FP32-eq)

2.26x

Need 2.26x GeForce RTX 3090 Ti

FP32 Compute

1.13x

Need 1.13x GeForce RTX 3090 Ti

VRAM

2.00x

Need 2.00x GeForce RTX 3090 Ti

Memory Bandwidth

0.86x

GeForce RTX 3090 Ti has 1.17x more

NVIDIA P100

Compute (FP32-eq)

9.73x

Need 9.73x P100

FP32 Compute

4.84x

Need 4.84x P100

VRAM

3.00x

Need 3.00x P100

Memory Bandwidth

1.18x

Need 1.18x P100

NVIDIA GH200

Compute (FP32-eq)

0.18x

GH200 is 5.46x faster

FP32 Compute

0.67x

GH200 is 1.49x faster

VRAM

0.50x

GH200 has 2.00x more

Memory Bandwidth

0.22x

GH200 has 4.63x more

To match 1x NVIDIA P100

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

0.19x

RTX 4500 Ada Generation is 5.22x faster

FP32 Compute

0.19x

RTX 4500 Ada Generation is 5.22x faster

VRAM

0.67x

RTX 4500 Ada Generation has 1.50x more

Memory Bandwidth

1.27x

Need 1.27x RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.26x

GeForce RTX 3090 is 3.82x faster

FP32 Compute

0.26x

GeForce RTX 3090 is 3.83x faster

VRAM

0.67x

GeForce RTX 3090 has 1.50x more

Memory Bandwidth

0.78x

GeForce RTX 3090 has 1.28x more

NVIDIA GeForce RTX 3090 Ti

Compute (FP32-eq)

0.23x

GeForce RTX 3090 Ti is 4.30x faster

FP32 Compute

0.23x

GeForce RTX 3090 Ti is 4.30x faster

VRAM

0.67x

GeForce RTX 3090 Ti has 1.50x more

Memory Bandwidth

0.73x

GeForce RTX 3090 Ti has 1.38x more

NVIDIA L40

Compute (FP32-eq)

0.10x

L40 is 9.73x faster

FP32 Compute

0.21x

L40 is 4.84x faster

VRAM

0.33x

L40 has 3.00x more

Memory Bandwidth

0.85x

L40 has 1.18x more

NVIDIA GH200

Compute (FP32-eq)

0.02x

GH200 is 53.17x faster

FP32 Compute

0.14x

GH200 is 7.20x faster

VRAM

0.17x

GH200 has 6.00x more

Memory Bandwidth

0.18x

GH200 has 5.46x more

To match 1x NVIDIA GH200

NVIDIA RTX 4500 Ada Generation

Compute (FP32-eq)

10.20x

Need 10.20x RTX 4500 Ada Generation

FP32 Compute

1.38x

Need 1.38x RTX 4500 Ada Generation

VRAM

4.00x

Need 4.00x RTX 4500 Ada Generation

Memory Bandwidth

6.94x

Need 6.94x RTX 4500 Ada Generation

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

13.93x

Need 13.93x GeForce RTX 3090

FP32 Compute

1.88x

Need 1.88x GeForce RTX 3090

VRAM

4.00x

Need 4.00x GeForce RTX 3090

Memory Bandwidth

4.27x

Need 4.27x GeForce RTX 3090

NVIDIA GeForce RTX 3090 Ti

Compute (FP32-eq)

12.36x

Need 12.36x GeForce RTX 3090 Ti

FP32 Compute

1.68x

Need 1.68x GeForce RTX 3090 Ti

VRAM

4.00x

Need 4.00x GeForce RTX 3090 Ti

Memory Bandwidth

3.97x

Need 3.97x GeForce RTX 3090 Ti

NVIDIA L40

Compute (FP32-eq)

5.46x

Need 5.46x L40

FP32 Compute

1.49x

Need 1.49x L40

VRAM

2.00x

Need 2.00x L40

Memory Bandwidth

4.63x

Need 4.63x L40

NVIDIA P100

Compute (FP32-eq)

53.17x

Need 53.17x P100

FP32 Compute

7.20x

Need 7.20x P100

VRAM

6.00x

Need 6.00x P100

Memory Bandwidth

5.46x

Need 5.46x P100

Pricing

Price Type	RTX 4500 Ada Generation	GeForce RTX 3090	GeForce RTX 3090 Ti	L40	P100	GH200
CAPEX (Street Price)	—	—	—	—	—	—
OPEX (per hour)	—	$0.11/hr	$0.12/hr	$0.69/hr	$0.28/hr	$1.49/hr
Price per TFLOPs (FP32-eq)	—	—	—	—	—	—