Question 1

What is Supertonic?

Accepted Answer

Supertonic is an on-device multilingual text-to-speech system from Supertone. Supertonic 3 has 99 million parameters and supports 31 languages with 10 preset voices. Despite being a fraction of the size of larger open TTS systems, it produces natural, high-quality speech and runs entirely on-device with no cloud dependency.

Question 2

Who made Supertonic?

Accepted Answer

Supertonic is developed by Supertone, a voice AI company. The open-weight ONNX checkpoint is released on Hugging Face under the MIT license, so you can use it freely in personal or commercial projects.

Question 3

What languages does Supertonic support?

Accepted Answer

Supertonic 3 supports 31 languages: English, Korean, Japanese, Arabic, Bulgarian, Czech, Danish, German, Greek, Spanish, Estonian, Finnish, French, Hindi, Croatian, Hungarian, Indonesian, Italian, Lithuanian, Latvian, Dutch, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Swedish, Turkish, Ukrainian, and Vietnamese, plus a language-agnostic mode.

Question 4

How many voices does Supertonic have?

Accepted Answer

Supertonic ships with 10 preset voice styles: five male (M1 through M5) and five female (F1 through F5). Each voice can speak any of the supported languages.

Question 5

Is Supertonic free?

Accepted Answer

Yes. Supertonic 3 is released under the MIT license, one of the most permissive open-source licenses. On this site it runs directly in your browser with no account, no signup, and no usage caps.

Question 6

How does Supertonic compare to larger TTS models?

Accepted Answer

At about 99 million parameters, Supertonic 3 is a fraction of the size of 0.7B to 2B parameter open TTS systems while staying competitive on quality benchmarks. The smaller model size means faster cold starts, smaller downloads, and lower memory usage, which is what makes browser inference practical. For voice cloning, voice design, and 600+ languages, Voice Creator Pro is the desktop counterpart.

Question 7

Do I need a powerful computer to run Supertonic?

Accepted Answer

Supertonic 3 weighs about 400 MB on first download and is cached locally afterwards. It runs best on browsers with WebGPU support (Chrome, Edge) where it can use your GPU for inference. WebAssembly is used automatically as a fallback. Any recent laptop or desktop can run it, but the first download takes longer than smaller models.

Question 8

What browsers are supported?

Accepted Answer

Chrome and Edge work best because they support WebGPU acceleration. Firefox and Safari work too but fall back to WebAssembly, which is slower. The first run downloads the model and caches it; subsequent runs are much faster.

Try Supertonic TTS Online for Free

Why Use Supertonic for Text-to-Speech

31 Languages

10 Preset Voices

WebGPU + WASM

Quality Tuning

How It Works

Open the free tool

Pick a voice and language

Type your text and generate

Who Is Supertonic For?

Multilingual Creators

Language Learners

Developers

Accessibility

Tips for Best Results

Match the language to your text

Tune quality with denoising steps

Use WebGPU where available

Need more? Go further with Voice Creator Pro.

Voice Cloning

600+ Languages

GPU-Accelerated Processing

Voice Design from Text

Local REST API

Commercial License

Common Questions