Microsoft

Phi-4 Mini

3.8B Phi-4 distill. Strong reasoning at edge sizes; 128K context.

Parameters
3.8B
Family
Microsoft
Context
128K tokens
FP16 weights
7.6GB
// where you can run it
phi4-miniLM StudiovLLMMLXoMLX
// hugging face stats (cached daily)
1.6M downloads · 739 likes · license: mit · updated 5 months ago

What you need to run this.

$ ./vrambudget --model phi-4-mini --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

Alternatives at this size.

$ grep --params similar catalog.json

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.