Google provider

Gemma 2 9B

Google's 2024 9B. Solid baseline; shorter context than Gemma 4.

Parameters
9B
Family
Google
Context
8K tokens
FP16 weights
18GB
// where you can run it
gemma2:9bLM StudiovLLMMLXoMLX
// hugging face stats (cached daily)
410K downloads · 803 likes · license: gemma · updated 1 year ago

What you need to run this.

$ ./vrambudget --model gemma-2-9b --by quant

// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →

Alternatives at this size.

$ grep --params similar catalog.json

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.