Local LLM model page

Qwen 3 (8B)

One of the best 8B models ever made. Thinking mode + lightning fast. The new king of 8B.

Parameters
8B
Minimum RAM
8 GB
Model size
5.5 GB
Quantization
Q5_K_M

Can Qwen 3 (8B) run locally?

Qwen 3 (8B) is best suited for entry-level laptops and desktops. LocalClaw recommends Q5_K_M as the default quantization, with at least 8 GB RAM.

Search term for LM Studio or compatible runtimes: qwen3-8b

Hugging Face repository: lmstudio-community/Qwen3-8B-GGUF

chatcodestandardgeneralreasoning

Strengths

  • 128K context window
  • Hybrid thinking mode
  • Apache 2.0 license
  • Strong at math and reasoning
  • Excellent multilingual

Limitations

  • Needs 8GB+ RAM
  • Chinese-centric training may affect some English tasks

Best use cases

  • General chat
  • Coding assistance
  • Math and reasoning
  • Long document analysis
  • Multilingual translation

Benchmarks

Speed: 8/10

Quality: 8/10

Coding: 8/10

Reasoning: 8/10

Technical details

Developer: Alibaba Cloud (Qwen Team)

License: Apache 2.0

Context window: 131,072 tokens

Architecture: Transformer with Thinking/Non-Thinking hybrid, 128K context

Released: 2025-04