The best open-weight coding model you can run on a 24 GB GPU in 2026 is Qwen3.6-27B at Q4. It scores 77.2 on SWE-bench Verified while fitting in about 17 GB, the highest coding skill per gigabyte you can actually load at home. DeepSeek V4 wins the leaderboard, but no consumer card can hold it.
Key Takeaways
- Qwen3.6-27B at Q4 gives the most coding skill per GB on a 24 GB card.
- DeepSeek V4 tops the leaderboard, but no home GPU can run it.
- GLM-4.7-Flash fits 24 GB and still clears 59 percent on SWE-bench.
- Qwen and Devstral ship Apache 2.0; the big models lean on MIT.
- Pick by the GPU you own, not by the top of the leaderboard.
Why Capability Per GB Beats the Leaderboard
Most 2026 roundups rank coding models by the score of a flagship variant that needs a multi-GPU server. For anyone running models at home, that number is a fantasy. The only figure that counts is how much coding skill fits in the VRAM you actually own.
Botmonster Tech




