Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (5)Jump to results

Filter by Vendor

Showing 128 XPUs • 5 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	NVIDIA L4	NVIDIA GeForce RTX 5080	NVIDIA Quadro K620	NVIDIA Quadro P4000	NVIDIA GeForce RTX 3090
Architecture	Ada Lovelace	Blackwell	Kepler	Pascal	Ampere
Form Factor	PCIe	PCIe	PCIe	PCIe	PCIe
VRAM	24 GB	16 GB	2 GB	8 GB	24 GB
Memory Bandwidth	300 GB/s	960 GB/s	29 GB/s	243 GB/s	936 GB/s
TFLOPs (FP32)	30.3	56	0.768	5.304	35.6
TFLOPs (FP16)	242	—	—	—	—
TFLOPs	121	171	0.768	5.304	71
TFLOPs (FP8)	485	—	—	—	—
TDP	72 W	360 W	45 W	105 W	350 W
Launch Date	Mar 2023	Jan 2025	Jul 2014	Oct 2016	Sep 2020

Efficiency Metrics

Metric	L4	GeForce RTX 5080	Quadro K620	Quadro P4000	GeForce RTX 3090
TFLOPs per Watt (FP32-eq)	0.84	0.24	0.02	0.05	0.10
Memory Bandwidth per GB	12.5 GB/s	60.0 GB/s	14.5 GB/s	30.4 GB/s	39.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x NVIDIA L4

NVIDIA GeForce RTX 5080

Compute (FP32-eq)

0.71x

GeForce RTX 5080 is 1.41x faster

FP32 Compute

0.54x

GeForce RTX 5080 is 1.85x faster

VRAM

1.50x

Need 1.50x GeForce RTX 5080

Memory Bandwidth

0.31x

GeForce RTX 5080 has 3.20x more

NVIDIA Quadro K620

Compute (FP32-eq)

78.78x

Need 78.78x Quadro K620

FP32 Compute

39.45x

Need 39.45x Quadro K620

VRAM

12.00x

Need 12.00x Quadro K620

Memory Bandwidth

10.34x

Need 10.34x Quadro K620

NVIDIA Quadro P4000

Compute (FP32-eq)

11.41x

Need 11.41x Quadro P4000

FP32 Compute

5.71x

Need 5.71x Quadro P4000

VRAM

3.00x

Need 3.00x Quadro P4000

Memory Bandwidth

1.23x

Need 1.23x Quadro P4000

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

1.70x

Need 1.70x GeForce RTX 3090

FP32 Compute

0.85x

GeForce RTX 3090 is 1.17x faster

VRAM

1.00x

GeForce RTX 3090 has 1.00x more

Memory Bandwidth

0.32x

GeForce RTX 3090 has 3.12x more

To match 1x NVIDIA GeForce RTX 5080

NVIDIA L4

Compute (FP32-eq)

1.41x

Need 1.41x L4

FP32 Compute

1.85x

Need 1.85x L4

VRAM

0.67x

L4 has 1.50x more

Memory Bandwidth

3.20x

Need 3.20x L4

NVIDIA Quadro K620

Compute (FP32-eq)

111.33x

Need 111.33x Quadro K620

FP32 Compute

72.92x

Need 72.92x Quadro K620

VRAM

8.00x

Need 8.00x Quadro K620

Memory Bandwidth

33.10x

Need 33.10x Quadro K620

NVIDIA Quadro P4000

Compute (FP32-eq)

16.12x

Need 16.12x Quadro P4000

FP32 Compute

10.56x

Need 10.56x Quadro P4000

VRAM

2.00x

Need 2.00x Quadro P4000

Memory Bandwidth

3.95x

Need 3.95x Quadro P4000

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

2.41x

Need 2.41x GeForce RTX 3090

FP32 Compute

1.57x

Need 1.57x GeForce RTX 3090

VRAM

0.67x

GeForce RTX 3090 has 1.50x more

Memory Bandwidth

1.03x

Need 1.03x GeForce RTX 3090

To match 1x NVIDIA Quadro K620

NVIDIA L4

Compute (FP32-eq)

0.01x

L4 is 78.78x faster

FP32 Compute

0.03x

L4 is 39.45x faster

VRAM

0.08x

L4 has 12.00x more

Memory Bandwidth

0.10x

L4 has 10.34x more

NVIDIA GeForce RTX 5080

Compute (FP32-eq)

0.01x

GeForce RTX 5080 is 111.33x faster

FP32 Compute

0.01x

GeForce RTX 5080 is 72.92x faster

VRAM

0.13x

GeForce RTX 5080 has 8.00x more

Memory Bandwidth

0.03x

GeForce RTX 5080 has 33.10x more

NVIDIA Quadro P4000

Compute (FP32-eq)

0.14x

Quadro P4000 is 6.91x faster

FP32 Compute

0.14x

Quadro P4000 is 6.91x faster

VRAM

0.25x

Quadro P4000 has 4.00x more

Memory Bandwidth

0.12x

Quadro P4000 has 8.38x more

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.02x

GeForce RTX 3090 is 46.22x faster

FP32 Compute

0.02x

GeForce RTX 3090 is 46.35x faster

VRAM

0.08x

GeForce RTX 3090 has 12.00x more

Memory Bandwidth

0.03x

GeForce RTX 3090 has 32.28x more

To match 1x NVIDIA Quadro P4000

NVIDIA L4

Compute (FP32-eq)

0.09x

L4 is 11.41x faster

FP32 Compute

0.18x

L4 is 5.71x faster

VRAM

0.33x

L4 has 3.00x more

Memory Bandwidth

0.81x

L4 has 1.23x more

NVIDIA GeForce RTX 5080

Compute (FP32-eq)

0.06x

GeForce RTX 5080 is 16.12x faster

FP32 Compute

0.09x

GeForce RTX 5080 is 10.56x faster

VRAM

0.50x

GeForce RTX 5080 has 2.00x more

Memory Bandwidth

0.25x

GeForce RTX 5080 has 3.95x more

NVIDIA Quadro K620

Compute (FP32-eq)

6.91x

Need 6.91x Quadro K620

FP32 Compute

6.91x

Need 6.91x Quadro K620

VRAM

4.00x

Need 4.00x Quadro K620

Memory Bandwidth

8.38x

Need 8.38x Quadro K620

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.15x

GeForce RTX 3090 is 6.69x faster

FP32 Compute

0.15x

GeForce RTX 3090 is 6.71x faster

VRAM

0.33x

GeForce RTX 3090 has 3.00x more

Memory Bandwidth

0.26x

GeForce RTX 3090 has 3.85x more

To match 1x NVIDIA GeForce RTX 3090

NVIDIA L4

Compute (FP32-eq)

0.59x

L4 is 1.70x faster

FP32 Compute

1.17x

Need 1.17x L4

VRAM

1.00x

L4 has 1.00x more

Memory Bandwidth

3.12x

Need 3.12x L4

NVIDIA GeForce RTX 5080

Compute (FP32-eq)

0.42x

GeForce RTX 5080 is 2.41x faster

FP32 Compute

0.64x

GeForce RTX 5080 is 1.57x faster

VRAM

1.50x

Need 1.50x GeForce RTX 5080

Memory Bandwidth

0.97x

GeForce RTX 5080 has 1.03x more

NVIDIA Quadro K620

Compute (FP32-eq)

46.22x

Need 46.22x Quadro K620

FP32 Compute

46.35x

Need 46.35x Quadro K620

VRAM

12.00x

Need 12.00x Quadro K620

Memory Bandwidth

32.28x

Need 32.28x Quadro K620

NVIDIA Quadro P4000

Compute (FP32-eq)

6.69x

Need 6.69x Quadro P4000

FP32 Compute

6.71x

Need 6.71x Quadro P4000

VRAM

3.00x

Need 3.00x Quadro P4000

Memory Bandwidth

3.85x

Need 3.85x Quadro P4000

Pricing

Price Type	L4	GeForce RTX 5080	Quadro K620	Quadro P4000	GeForce RTX 3090
CAPEX (Street Price)	$4,000	—	—	—	—
OPEX (per hour)	$0.80/hr	$0.16/hr	$0.05/hr	$0.51/hr	$0.11/hr
Price per TFLOPs (FP32-eq)	$66	—	—	—	—