Local LLM model page

Cogito (8B)

Hybrid reasoning model outperforming peers. Strong general + reasoning at 8B. 558K downloads.

Parameters
8B
Minimum RAM
8 GB
Model size
5 GB
Quantization
Q4_K_M

Can Cogito (8B) run locally?

Cogito (8B) is best suited for entry-level laptops and desktops. LocalClaw recommends Q4_K_M as the default quantization, with at least 8 GB RAM.

Search term for LM Studio or compatible runtimes: cogito-8b

Hugging Face repository: deepcogito/cogito-v1-preview-llama-8B-GGUF

chatreasoningstandardgeneral

Strengths

  • Hybrid reasoning — outperforms peers
  • Apache 2.0
  • Toggleable thinking mode
  • 558K downloads

Limitations

  • English-only
  • New — less tested
  • Reasoning overhead

Best use cases

  • Reasoning tasks
  • General chat
  • Analysis
  • Problem solving

Benchmarks

Speed: 8/10

Quality: 7/10

Coding: 7/10

Reasoning: 8/10

Technical details

Developer: Deep Cogito

License: Apache 2.0

Context window: 131,072 tokens

Architecture: Hybrid reasoning Transformer (Llama 8B base)

Released: 2025-04