~/best-gpu-for/llama-3-3-70b
Meta provider

Best GPU for Llama 3.3 70B

Ranked recommendations from the 42-GPU catalog for running Llama 3.3 70B (70.6B params). Top picks by quality, then by cost.

Top picks.

$ ./rank --by quality
Tightest budget
RTX A6000
48GB · runs at Q3_K_M
30 GB weights · 8.2 GB headroom
Best quality
2× H100 NVL
188GB · runs at FP16/BF16
smallest card supporting the best available quant
Apple Silicon
M2 Max 64
64GB · runs at Q5_K_M
smallest M-series that fits
Datacenter (FP16 ambition)
B200
192GB · runs at FP16/BF16
best-quality datacenter card that fits

Full ranked list.

$ ./vrambudget --rank llama-3-3-70b

Next steps.

$ ./next

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.