Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (6)Jump to results

Filter by Vendor

Showing 128 XPUs • 6 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	Groq LPU Inference Engine	Meta MTIA v1	Biren Technology BR100	AMD MI325X	NVIDIA GeForce RTX 2070	NVIDIA GeForce RTX 3090
Architecture	TSP (Tensor Streaming Processor)	Meta Training & Inference Accelerator	Biren GPU	CDNA 3.5	Turing	Ampere
Form Factor	—	—	—	OAM	PCIe	PCIe
VRAM	230 GB	128 GB	64 GB	256 GB	8 GB	24 GB
Memory Bandwidth	—	—	—	6,000 GB/s	448 GB/s	936 GB/s
TFLOPs (FP32)	—	—	—	207	7.5	35.6
TFLOPs (FP16)	—	—	1,000	1,400	—	—
TFLOPs	—	—	—	1,400	7.5	71
TFLOPs (FP8)	—	—	—	2,800	—	—
TDP	300 W	400 W	550 W	800 W	175 W	350 W
Launch Date	Feb 2024	May 2023	Aug 2022	Oct 2024	Oct 2018	Sep 2020

Efficiency Metrics

Metric	LPU Inference Engine	MTIA v1	BR100	MI325X	GeForce RTX 2070	GeForce RTX 3090
TFLOPs per Watt (FP32-eq)	—	—	—	0.88	0.04	0.10
Memory Bandwidth per GB	—	—	—	23.4 GB/s	56.0 GB/s	39.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x Groq LPU Inference Engine

Meta MTIA v1

VRAM

1.80x

Need 1.80x MTIA v1

Biren Technology BR100

VRAM

3.59x

Need 3.59x BR100

AMD MI325X

VRAM

0.90x

MI325X has 1.11x more

NVIDIA GeForce RTX 2070

VRAM

28.75x

Need 28.75x GeForce RTX 2070

NVIDIA GeForce RTX 3090

VRAM

9.58x

Need 9.58x GeForce RTX 3090

To match 1x Meta MTIA v1

Groq LPU Inference Engine

VRAM

0.56x

LPU Inference Engine has 1.80x more

Biren Technology BR100

VRAM

2.00x

Need 2.00x BR100

AMD MI325X

VRAM

0.50x

MI325X has 2.00x more

NVIDIA GeForce RTX 2070

VRAM

16.00x

Need 16.00x GeForce RTX 2070

NVIDIA GeForce RTX 3090

VRAM

5.33x

Need 5.33x GeForce RTX 3090

To match 1x Biren Technology BR100

Groq LPU Inference Engine

VRAM

0.28x

LPU Inference Engine has 3.59x more

Meta MTIA v1

VRAM

0.50x

MTIA v1 has 2.00x more

AMD MI325X

VRAM

0.25x

MI325X has 4.00x more

NVIDIA GeForce RTX 2070

VRAM

8.00x

Need 8.00x GeForce RTX 2070

NVIDIA GeForce RTX 3090

VRAM

2.67x

Need 2.67x GeForce RTX 3090

To match 1x AMD MI325X

Groq LPU Inference Engine

VRAM

1.11x

Need 1.11x LPU Inference Engine

Meta MTIA v1

VRAM

2.00x

Need 2.00x MTIA v1

Biren Technology BR100

VRAM

4.00x

Need 4.00x BR100

NVIDIA GeForce RTX 2070

Compute (FP32-eq)

93.33x

Need 93.33x GeForce RTX 2070

FP32 Compute

27.60x

Need 27.60x GeForce RTX 2070

VRAM

32.00x

Need 32.00x GeForce RTX 2070

Memory Bandwidth

13.39x

Need 13.39x GeForce RTX 2070

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

19.72x

Need 19.72x GeForce RTX 3090

FP32 Compute

5.81x

Need 5.81x GeForce RTX 3090

VRAM

10.67x

Need 10.67x GeForce RTX 3090

Memory Bandwidth

6.41x

Need 6.41x GeForce RTX 3090

To match 1x NVIDIA GeForce RTX 2070

Groq LPU Inference Engine

VRAM

0.03x

LPU Inference Engine has 28.75x more

Meta MTIA v1

VRAM

0.06x

MTIA v1 has 16.00x more

Biren Technology BR100

VRAM

0.13x

BR100 has 8.00x more

AMD MI325X

Compute (FP32-eq)

0.01x

MI325X is 93.33x faster

FP32 Compute

0.04x

MI325X is 27.60x faster

VRAM

0.03x

MI325X has 32.00x more

Memory Bandwidth

0.07x

MI325X has 13.39x more

NVIDIA GeForce RTX 3090

Compute (FP32-eq)

0.21x

GeForce RTX 3090 is 4.73x faster

FP32 Compute

0.21x

GeForce RTX 3090 is 4.75x faster

VRAM

0.33x

GeForce RTX 3090 has 3.00x more

Memory Bandwidth

0.48x

GeForce RTX 3090 has 2.09x more

To match 1x NVIDIA GeForce RTX 3090

Groq LPU Inference Engine

VRAM

0.10x

LPU Inference Engine has 9.58x more

Meta MTIA v1

VRAM

0.19x

MTIA v1 has 5.33x more

Biren Technology BR100

VRAM

0.38x

BR100 has 2.67x more

AMD MI325X

Compute (FP32-eq)

0.05x

MI325X is 19.72x faster

FP32 Compute

0.17x

MI325X is 5.81x faster

VRAM

0.09x

MI325X has 10.67x more

Memory Bandwidth

0.16x

MI325X has 6.41x more

NVIDIA GeForce RTX 2070

Compute (FP32-eq)

4.73x

Need 4.73x GeForce RTX 2070

FP32 Compute

4.75x

Need 4.75x GeForce RTX 2070

VRAM

3.00x

Need 3.00x GeForce RTX 2070

Memory Bandwidth

2.09x

Need 2.09x GeForce RTX 2070

Pricing

Price Type	LPU Inference Engine	MTIA v1	BR100	MI325X	GeForce RTX 2070	GeForce RTX 3090
CAPEX (Street Price)	—	—	—	$45,000	—	—
OPEX (per hour)	—	—	—	$36.92/hr	$0.04/hr	$0.11/hr
Price per TFLOPs (FP32-eq)	—	—	—	$64	—	—