Alibaba's mid-range workhorse. Q4_K_M lands at ~18 GB, the canonical "fits on a 24GB" 30B-class.
// budgets shown at ctx 8K, concurrency 1, 15% safety headroom. Tune in the calculator →
// sign in with github to leave a comment. threads live in the repo's discussions tab.