Head-to-head for local LLM inference. The honest comparison: VRAM, bandwidth, compute, and which of the 30 catalog models actually fit on each.
// showing 12 of 30 models; differing fits first
M5 Max 128 has 96 GB more.
RTX 5090 by +192%.
RTX 5090 by +1424% TFLOPS.
M5 Max 128: 27 fit · RTX 5090: 22.
// sign in with github to leave a comment. threads live in the repo's discussions tab.