Local LLM model page
Qwen 3 (8B)
One of the best 8B models ever made. Thinking mode + lightning fast. The new king of 8B.
Parameters
8B
Minimum RAM
8 GB
Model size
5.5 GB
Quantization
Q5_K_M
Can Qwen 3 (8B) run locally?
Qwen 3 (8B) is best suited for entry-level laptops and desktops. LocalClaw recommends Q5_K_M as the default quantization, with at least 8 GB RAM.
Search term for LM Studio or compatible runtimes: qwen3-8b
Hugging Face repository: lmstudio-community/Qwen3-8B-GGUF
chatcodestandardgeneralreasoning
Strengths
- 128K context window
- Hybrid thinking mode
- Apache 2.0 license
- Strong at math and reasoning
- Excellent multilingual
Limitations
- Needs 8GB+ RAM
- Chinese-centric training may affect some English tasks
Best use cases
- General chat
- Coding assistance
- Math and reasoning
- Long document analysis
- Multilingual translation
Benchmarks
Speed: 8/10
Quality: 8/10
Coding: 8/10
Reasoning: 8/10
Technical details
Developer: Alibaba Cloud (Qwen Team)
License: Apache 2.0
Context window: 131,072 tokens
Architecture: Transformer with Thinking/Non-Thinking hybrid, 128K context
Released: 2025-04