Alibaba provider

Qwen 2.5 72B

Alibaba's 70B-class flagship. Comparable footprint to Llama 70B; different inflections on benchmarks.

Parameters
72.7B
Family
Alibaba
Context
128K tokens
FP16 weights
145GB
// where you can run it
qwen2.5:72bLM StudiovLLMMLXoMLX
// hugging face stats (cached daily)
883K downloads · 940 likes · license: other · updated 1 year ago

What you need to run this.

$ ./vrambudget --model qwen-2-5-72b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

Alternatives at this size.

$ grep --params similar catalog.json

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.