Qwen 2.5 32B

Alibaba's mid-range workhorse. Q4_K_M lands at ~18 GB, the canonical "fits on a 24GB" 30B-class.

Parameters

32.5B

Family

Alibaba

Context

128K tokens

FP16 weights

65GB

// where you can run it

// hugging face stats (cached daily)

883K downloads · 350 likes · license: apache-2.0 · updated 1 year ago

What you need to run this.

$ ./vrambudget --model qwen-2-5-32b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

$ grep --params similar catalog.json

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.