Local LLM model page

Command A (111B)

Cohere open-weight flagship optimised for agentic workflows and long-context RAG. 256K context, excellent multilingual coverage (23 languages). CC-BY-NC 4.0 — non-commercial.

Find the best model for my hardware Browse all 183 LLMs

Parameters

111B

Minimum RAM

96 GB

Model size

68 GB

Quantization

Q4_K_M

Can Command A (111B) run locally?

Command A (111B) is best suited for large-memory workstations. LocalClaw recommends Q4_K_M as the default quantization, with at least 96 GB RAM.

Search term for LM Studio or compatible runtimes: command-a-111b

Hugging Face repository: CohereLabs/command-a-03-2025-GGUF

chatreasoningqualitygeneralpower

Strengths

Cohere open-weight flagship optimised for agentic workflows and long-context RAG. 256K context, excellent multilingual coverage (23 languages). CC-BY-NC 4.0 — non-commercial.

Limitations

Performance depends heavily on quantization, RAM bandwidth and runtime support.

Best use cases

chat
reasoning
quality
general
power

Benchmarks

Speed: 2/10

Quality: 10/10

Coding: 9/10

Reasoning: 10/10

Technical details

Developer: cohere

License: See model repository

Context window: Unknown tokens

Architecture: See model card

Released: 2025-03