Local LLM model page
Cogito (8B)
Hybrid reasoning model outperforming peers. Strong general + reasoning at 8B. 558K downloads.
Parameters
8B
Minimum RAM
8 GB
Model size
5 GB
Quantization
Q4_K_M
Can Cogito (8B) run locally?
Cogito (8B) is best suited for entry-level laptops and desktops. LocalClaw recommends Q4_K_M as the default quantization, with at least 8 GB RAM.
Search term for LM Studio or compatible runtimes: cogito-8b
Hugging Face repository: deepcogito/cogito-v1-preview-llama-8B-GGUF
chatreasoningstandardgeneral
Strengths
- Hybrid reasoning — outperforms peers
- Apache 2.0
- Toggleable thinking mode
- 558K downloads
Limitations
- English-only
- New — less tested
- Reasoning overhead
Best use cases
- Reasoning tasks
- General chat
- Analysis
- Problem solving
Benchmarks
Speed: 8/10
Quality: 7/10
Coding: 7/10
Reasoning: 8/10
Technical details
Developer: Deep Cogito
License: Apache 2.0
Context window: 131,072 tokens
Architecture: Hybrid reasoning Transformer (Llama 8B base)
Released: 2025-04