OpenAI

gpt-oss 120B

OpenAI's open-weight flagship. Near o4-mini on core reasoning; MXFP4 deployment fits a single 80GB H100.

Parameters
117B
Family
OpenAI
Context
128K tokens
FP16 weights
234GB
// where you can run it
gpt-oss:120bLM StudiovLLMMLXoMLX
// hugging face stats (cached daily)
4.9M downloads · 5K likes · license: apache-2.0 · updated 8 months ago

What you need to run this.

$ ./vrambudget --model gpt-oss-120b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

Alternatives at this size.

$ grep --params similar catalog.json

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.