OpenAI's August 2025 open-weight release. MoE; ships MXFP4-quantized at ~13GB on consumer cards.
// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →
// sign in with github to leave a comment. threads live in the repo's discussions tab.