AWS Inferentia2
Inferentia Gen2 Architecture
Active
Launched November 2022
Core Specifications
VendorAWS
ArchitectureInferentia Gen2
Form Factor—
VRAM32 GB
Memory Bandwidth—
TDP150 W
Compute Performance
| Precision | TFLOPs |
|---|---|
| BF16 | 190 |
Pricing
Cloud Rental (OPEX)
| Provider | Instance Type | Price per Hour | Region | As of |
|---|---|---|---|---|
| AWS | inf2.48xlarge (12x Inferentia2) | $6.49/hr | us-east-1 | Oct 2024 |
| AWS | inf2.48xlarge (12x Inferentia2) | $6.49/hr | us-east-1 | Oct 2024 |