Compare XPUs

Select up to 5 XPUs to compare side-by-side

Select XPUs to Compare

Clear all (5)Jump to results

Filter by Vendor

Showing 128 XPUs • 5 selected

Alibaba

Hanguang 800

AMD

MI100

23.1 TFLOPs

AMD

MI210

181 TFLOPs

AMD

MI250X

383 TFLOPs

AMD

MI300X

1,307 TFLOPs

AMD

MI325X

1,400 TFLOPs

AMD

MI350X

2,100 TFLOPs

AMD

MI355X

5,300 TFLOPs

AMD

Radeon Pro V520

23.04 TFLOPs

AWS

Inferentia2

190 TFLOPs

AWS

Trainium

190 TFLOPs

AWS

Trainium2

680 TFLOPs

Baidu

Kunlun II

Biren Technology

BR100

Cambricon

MLU370

256 TFLOPs

Cerebras

WSE-3

Enflame Technology

CloudBlazer T20

FuriosaAI

RNGD (Renegade)

256 TFLOPs

FuriosaAI

Warboy

Google

TPU v4

275 TFLOPs

Google

TPU v5e

197 TFLOPs

Google

TPU v5p

459 TFLOPs

Graphcore

Bow IPU

Graphcore

IPU-M2000

Groq

LPU Inference Engine

Huawei

Ascend 910B

Iluvatar CoreX

BI-V150

300 TFLOPs

Intel

Data Center GPU Max 1100

177 TFLOPs

Intel

Data Center GPU Max 1550

419 TFLOPs

Intel Habana

Gaudi 2

432 TFLOPs

Intel Habana

Gaudi 3

1,835 TFLOPs

Multi-Metric Comparison

Relative performance across 5 key metrics (normalized to 100 = best in comparison)

Compute Performance (BF16)

Memory Capacity

Power Consumption

Power Efficiency

Specifications

Specification	AWS Inferentia2	AMD MI300X	NVIDIA GeForce GT 710	NVIDIA GeForce GT 730	NVIDIA GeForce GTX 1070
Architecture	Inferentia Gen2	CDNA 3	Kepler	Kepler	Pascal
Form Factor	—	OAM	PCIe	PCIe	PCIe
VRAM	32 GB	192 GB	2 GB	2 GB	8 GB
Memory Bandwidth	—	5,300 GB/s	14.4 GB/s	28.5 GB/s	256 GB/s
TFLOPs (FP32)	—	163.4	0.366	0.693	6.463
TFLOPs (FP16)	—	1,307	—	—	—
TFLOPs	190	1,307	0.366	0.693	6.463
TFLOPs (FP8)	—	2,614	—	—	—
TDP	150 W	750 W	19 W	49 W	150 W
Launch Date	Nov 2022	Dec 2023	Mar 2014	Jun 2014	Jun 2016

Efficiency Metrics

Metric	Inferentia2	MI300X	GeForce GT 710	GeForce GT 730	GeForce GTX 1070
TFLOPs per Watt (FP32-eq)	0.63	0.87	0.02	0.01	0.04
Memory Bandwidth per GB	—	27.6 GB/s	7.2 GB/s	14.3 GB/s	32.0 GB/s

Performance Equivalence

How many units of each GPU are needed to match the performance of the others?

To match 1x AWS Inferentia2

AMD MI300X

Compute (FP32-eq)

0.15x

MI300X is 6.88x faster

VRAM

0.17x

MI300X has 6.00x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

259.56x

Need 259.56x GeForce GT 710

VRAM

16.00x

Need 16.00x GeForce GT 710

NVIDIA GeForce GT 730

Compute (FP32-eq)

137.09x

Need 137.09x GeForce GT 730

VRAM

16.00x

Need 16.00x GeForce GT 730

NVIDIA GeForce GTX 1070

Compute (FP32-eq)

14.70x

Need 14.70x GeForce GTX 1070

VRAM

4.00x

Need 4.00x GeForce GTX 1070

To match 1x AMD MI300X

AWS Inferentia2

Compute (FP32-eq)

6.88x

Need 6.88x Inferentia2

VRAM

6.00x

Need 6.00x Inferentia2

NVIDIA GeForce GT 710

Compute (FP32-eq)

1785.52x

Need 1785.52x GeForce GT 710

FP32 Compute

446.45x

Need 446.45x GeForce GT 710

VRAM

96.00x

Need 96.00x GeForce GT 710

Memory Bandwidth

368.06x

Need 368.06x GeForce GT 710

NVIDIA GeForce GT 730

Compute (FP32-eq)

943.00x

Need 943.00x GeForce GT 730

FP32 Compute

235.79x

Need 235.79x GeForce GT 730

VRAM

96.00x

Need 96.00x GeForce GT 730

Memory Bandwidth

185.96x

Need 185.96x GeForce GT 730

NVIDIA GeForce GTX 1070

Compute (FP32-eq)

101.11x

Need 101.11x GeForce GTX 1070

FP32 Compute

25.28x

Need 25.28x GeForce GTX 1070

VRAM

24.00x

Need 24.00x GeForce GTX 1070

Memory Bandwidth

20.70x

Need 20.70x GeForce GTX 1070

To match 1x NVIDIA GeForce GT 710

AWS Inferentia2

Compute (FP32-eq)

0.00x

Inferentia2 is 259.56x faster

VRAM

0.06x

Inferentia2 has 16.00x more

AMD MI300X

Compute (FP32-eq)

0.00x

MI300X is 1785.52x faster

FP32 Compute

0.00x

MI300X is 446.45x faster

VRAM

0.01x

MI300X has 96.00x more

Memory Bandwidth

0.00x

MI300X has 368.06x more

NVIDIA GeForce GT 730

Compute (FP32-eq)

0.53x

GeForce GT 730 is 1.89x faster

FP32 Compute

0.53x

GeForce GT 730 is 1.89x faster

VRAM

1.00x

GeForce GT 730 has 1.00x more

Memory Bandwidth

0.51x

GeForce GT 730 has 1.98x more

NVIDIA GeForce GTX 1070

Compute (FP32-eq)

0.06x

GeForce GTX 1070 is 17.66x faster

FP32 Compute

0.06x

GeForce GTX 1070 is 17.66x faster

VRAM

0.25x

GeForce GTX 1070 has 4.00x more

Memory Bandwidth

0.06x

GeForce GTX 1070 has 17.78x more

To match 1x NVIDIA GeForce GT 730

AWS Inferentia2

Compute (FP32-eq)

0.01x

Inferentia2 is 137.09x faster

VRAM

0.06x

Inferentia2 has 16.00x more

AMD MI300X

Compute (FP32-eq)

0.00x

MI300X is 943.00x faster

FP32 Compute

0.00x

MI300X is 235.79x faster

VRAM

0.01x

MI300X has 96.00x more

Memory Bandwidth

0.01x

MI300X has 185.96x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

1.89x

Need 1.89x GeForce GT 710

FP32 Compute

1.89x

Need 1.89x GeForce GT 710

VRAM

1.00x

GeForce GT 710 has 1.00x more

Memory Bandwidth

1.98x

Need 1.98x GeForce GT 710

NVIDIA GeForce GTX 1070

Compute (FP32-eq)

0.11x

GeForce GTX 1070 is 9.33x faster

FP32 Compute

0.11x

GeForce GTX 1070 is 9.33x faster

VRAM

0.25x

GeForce GTX 1070 has 4.00x more

Memory Bandwidth

0.11x

GeForce GTX 1070 has 8.98x more

To match 1x NVIDIA GeForce GTX 1070

AWS Inferentia2

Compute (FP32-eq)

0.07x

Inferentia2 is 14.70x faster

VRAM

0.25x

Inferentia2 has 4.00x more

AMD MI300X

Compute (FP32-eq)

0.01x

MI300X is 101.11x faster

FP32 Compute

0.04x

MI300X is 25.28x faster

VRAM

0.04x

MI300X has 24.00x more

Memory Bandwidth

0.05x

MI300X has 20.70x more

NVIDIA GeForce GT 710

Compute (FP32-eq)

17.66x

Need 17.66x GeForce GT 710

FP32 Compute

17.66x

Need 17.66x GeForce GT 710

VRAM

4.00x

Need 4.00x GeForce GT 710

Memory Bandwidth

17.78x

Need 17.78x GeForce GT 710

NVIDIA GeForce GT 730

Compute (FP32-eq)

9.33x

Need 9.33x GeForce GT 730

FP32 Compute

9.33x

Need 9.33x GeForce GT 730

VRAM

4.00x

Need 4.00x GeForce GT 730

Memory Bandwidth

8.98x

Need 8.98x GeForce GT 730

Pricing

Price Type	Inferentia2	MI300X	GeForce GT 710	GeForce GT 730	GeForce GTX 1070
CAPEX (Street Price)	—	$35,000	—	—	—
OPEX (per hour)	$6.49/hr	$10.40/hr	$0.07/hr	$0.04/hr	$0.04/hr
Price per TFLOPs (FP32-eq)	—	$54	—	—	—