Mistral provider

Mistral 7B v0.3

The original 7B benchmark. Still a reliable baseline; smaller context than the newer entries.

Parameters
7.2B
Family
Mistral
Context
32K tokens
FP16 weights
14GB
// where you can run it
mistral:7bLM StudiovLLMMLXoMLX
// hugging face stats (cached daily)
4.5M downloads · 3K likes · license: apache-2.0 · updated 5 months ago

What you need to run this.

$ ./vrambudget --model mistral-7b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

Alternatives at this size.

$ grep --params similar catalog.json

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.