OpenAI

gpt-oss 20B

OpenAI's August 2025 open-weight release. MoE; ships MXFP4-quantized at ~13GB on consumer cards.

Parameters

20.9B

Family

OpenAI

Context

128K tokens

FP16 weights

42GB

// where you can run it

// hugging face stats (cached daily)

7.9M downloads · 5K likes · license: apache-2.0 · updated 8 months ago

What you need to run this.

$ ./vrambudget --model gpt-oss-20b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

$ grep --params similar catalog.json

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.