Local TTS model

MOSS-TTS-Nano

Tiny 0.1B multilingual speech generation model for CPU-friendly real-time TTS. Supports voice cloning, streaming, ONNX CPU inference and MLX Audio on Apple Silicon.

Edge ready text-to-speech generation 20 languages Apache 2.0
Quality
8.5/10
Speed
9.7/10
Model size
0.1 GB
Voices
Voice cloning + long-text chunked generation

Can MOSS-TTS-Nano run locally?

MOSS-TTS-Nano can generate speech locally for private voice workflows. Start with pip install moss-tts-nano.

Apache 2.0 license. Still verify upstream usage notes before shipping.

cloningstreamingrealtimemultilinguallow-latency

Audio profile

Quality
8.5
Speed
9.7
Local
9.1

Best fit

MOSS-TTS-Nano is best for local voice cloning and expressive speech generation.

Hardware: cpugpuappleedge

Model details

Type
Local TTS model
Family
moss
Latency
ultra-low
Formats
pytorchonnxmlx
Languages
zh, en, de, es, fr, ja, it, hu, ko, ru, fa, ar, pl, pt, cs, da, sv, el, tr
Context
0.1B params, 20 languages, ONNX CPU version runs smoothly on MacBook Air M4

Install locally

01
Check runtimeConfirm the backend supports pytorch, onnx, mlx on your machine.
02
Install modelUse the upstream command or repository instructions.
03
Test locallyRun a short private audio prompt before moving into production workflows.
pip install moss-tts-nano

Good for

  • text-to-speech generation
  • Edge ready local workflows
  • cloning, streaming, realtime

Watch before shipping

  • Validate pronunciation, latency and artifacts with your own voice samples.
  • Review the upstream license and acceptable-use notes.
  • Benchmark on your target CPU, Apple Silicon or GPU setup.

Related TTS and speech models

CompareBrowse all TTS models Local AIBrowse LLM models macOS appGet LocalClaw