Question 1

What is MOSS-TTS-Nano?

Accepted Answer

MOSS-TTS-Nano is a compact, open-source text-to-speech model with 100 million parameters. It supports both built-in voices and voice cloning across 18 languages. The model uses an autoregressive audio tokenizer combined with an LLM pipeline to produce high-quality 48kHz stereo audio, and it is designed to run efficiently without a GPU.

Question 2

Who made MOSS-TTS-Nano?

Accepted Answer

MOSS-TTS-Nano was developed by MOSI.AI and the OpenMOSS team. It is released under the Apache 2.0 license, making it freely available for both personal and commercial use.

Question 3

What languages does MOSS-TTS-Nano support?

Accepted Answer

For built-in voices, MOSS-TTS-Nano supports English, Chinese, and Japanese. For voice cloning, it supports 18 languages: English, Chinese, Japanese, Korean, German, Spanish, French, Italian, Hungarian, Russian, Arabic, Polish, Portuguese, Czech, Danish, Swedish, Greek, and Turkish. MOSS is the only model in the free tool that supports cloning in languages other than English.

Question 4

How does voice cloning work in MOSS?

Accepted Answer

Upload a short audio sample of the voice you want to clone, then type the text you want spoken. MOSS will generate speech in that voice. For best results, use a clear recording with a single speaker and minimal background noise. You can clone voices in any of the 18 supported languages.

Question 5

What is the audio quality like?

Accepted Answer

MOSS-TTS-Nano produces 48kHz stereo audio, which is higher quality than many other TTS models that output mono audio at 22kHz or 24kHz. The result is richer, more natural-sounding speech.

Question 6

Is MOSS-TTS-Nano free?

Accepted Answer

Yes. MOSS-TTS-Nano is open-source software released under the Apache 2.0 license. On this site, it runs directly in your browser with no account, no signup, and no usage limits.

Question 7

How does MOSS-TTS-Nano compare to the full MOSS-TTS model?

Accepted Answer

MOSS-TTS-Nano is a smaller, more efficient version designed to run in real time on CPUs and in the browser. The full MOSS-TTS model is larger and may produce higher-fidelity output in some cases, but MOSS-TTS-Nano retains the core capabilities including multilingual voice cloning and 48kHz stereo output.

Question 8

What browsers are supported?

Accepted Answer

Chrome, Edge, and other Chromium-based browsers work best. Firefox and Safari have limited support for the WebGPU and WASM features that MOSS-TTS-Nano uses for acceleration. For the best streaming experience, use Chrome.

Try MOSS-TTS-Nano Online for Free

Why Use MOSS-TTS-Nano

Multilingual Voice Cloning

48kHz Stereo Output

Real-Time Streaming

Runs in Your Browser

How It Works

Open the free tool and choose your mode

Type your text

Generate and listen

Who Is MOSS-TTS-Nano For?

Multilingual Content Creators

Localization Teams

Developers and Researchers

Accessibility

Tips for Best Results

Use clean reference audio for cloning

Try different built-in voices for each language

Streaming works best in Chrome

Keep reference clips short and focused

Match the language to your use case

Need more languages, speed, or a commercial license?

Voice Cloning in 600+ Languages

GPU-Accelerated Processing

Voice Design from Text

Advanced Voice Cloning

Local REST API

Commercial License

Common Questions