AMD MI300X

MI300X is AMD’s first GPU to use CDNA 3.

Specifications

Each MI300X GPU has:¹

It also has a complex memory hierarchy that I don’t yet understand.³

The following are theoretical maximum performance in TFLOPS:¹

Data Type	VFMA	Matrix	Sparse
FP64	81.7	163.4
FP32	164.4	163.4
TF32		653.7	1307.4
FP16		1307.4	2614.9
BF16		1307.4	2614.9
FP8		2614.9	5229.8
INT32
INT8		2614.9	5229.8

A single 8-way OAM MI300X UBB is capable of hosting a copy of Llama 3.1 405B in FP16.⁴

The following platforms support MI300X GPUs:³

The following cloud providers sell MI300X: