Flexible pricing

Start free in the browser, scale up with Cloud, or own it forever.

Yearly pricing applies to Cloud plans. Lifetime is a one-time purchase.

Free

no card required

10,000 tokens / month

≈ up to 8 hrs of audio

Included

Voice cloning & Design
Voice changer
Speech to text
Subtitles generator
Files kept for 30 days

Get started free

Starter

$5/mo

billed monthly

250,000 tokens / month

≈ up to 80 hrs of audio

Everything in Free, plus

10× the monthly tokens
Video dubbing
Audiobook export
No 30-day storage limit

Premium

$20/mo

billed monthly

1,500,000 tokens / month

≈ up to 500 hrs of audio

Everything in Starter, plus

6× Starter's monthly tokens

Best value

Lifetime

from$54.99

One-timepay once, keep forever

Unlimited generations

no tokens, no monthly cap

Included

Runs fully offline
Voice cloning & Design
Video dubbing & audiobook export
Speech to text & voice changer
Files stay on your device

See desktop options

Cloud subscription

Free

no card required

10,000 tokens / month

≈ up to 8 hrs of audio

Included

Voice cloning & Design
Voice changer
Speech to text
Subtitles generator
Files kept for 30 days

Get started free

Starter

$5/mo

billed monthly

250,000 tokens / month

≈ up to 80 hrs of audio

Everything in Free, plus

10× the monthly tokens
Video dubbing
Audiobook export
No 30-day storage limit

Premium

$20/mo

billed monthly

1,500,000 tokens / month

≈ up to 500 hrs of audio

Everything in Starter, plus

6× Starter's monthly tokens

Desktop app

Best value

Lifetime

from$54.99

One-timepay once, keep forever

Unlimited generations

no tokens, no monthly cap

Included

Runs fully offline
Voice cloning & Design
Video dubbing & audiobook export
Speech to text & voice changer
Files stay on your device

See desktop options

Enterprise

Custom

Custom volume, security, and support for teams. Tell us what you need and we'll tailor a plan.

Custom token volume
SLA & uptime guarantee
Dedicated account manager
Priority processing

Custom pricing

How much audio will you get on Cloud?

For Cloud plans, estimate your monthly output by tier, and model.

Estimate your usage

Tier

Model

Estimated audio with OmniVoice only

Voice cloning, Voice design

51.4 hours

Availability depends on community GPUs being online. Jobs they do not pick up fall back to Standard pricing.

The desktop app

One-time purchase. Choose your platform, or bundle with Song Creator Pro.

Windows

Starting from$54.99

One-time payment, lifetime access
Unlimited generations
Runs fully offline

CPUSupported (GPU recommended)

GPUNVIDIA · AMD · Intel Arc

VRAM8 GB min · 12 GB+ recommended

macOS

$59.99

One-time payment, lifetime access
Unlimited generations
Runs fully offline

Mac App Store

$59.99

CPUApple Silicon (M1 or later)

RAM8 GB minimum

Pair the desktop app with Song Creator Pro (Windows only) for AI music generation.

Buy both for $72.50 on itch.io

Save $27.50 compared to buying separately

Already own Song Creator Pro? Get 50% off on Voice Creator Pro

On both itch.io & Microsoft · Discount applies at checkout automatically

Compare all options

Own it outright, or scale up in the cloud. Same voice engine underneath.

	CloudFree$0	CloudStarter$5 / mo	CloudPremium$20 / mo	DesktopLifetimefrom $54.99 once	CloudEnterpriseCustom
Plan basics
Payment	Free	Subscription	Subscription	One-time	Custom
Where it runs	Browser	Browser	Browser	Your computer	Browser
Needs a GPU?	No	No	No	Yes	No
Runs fully offline	–	–	–		–
Monthly quota	10k tokens	250k tokens	1.5M tokens	Unlimited	Custom
File storage	30 days	No limit	No limit	On your device	No limit
Commercial rights
Voice generation
Voice cloning
Voice design
600+ languages
Community voice library
Beyond text-to-speech
Speech to text
Voice changer
Subtitles generator
Long-form audio
Audiobook export
Video dubbing
Teams & enterprise
SLA & uptime guarantee
Dedicated account manager
Priority processing

FAQ

Common questions

No. The desktop app runs entirely offline after installation. Your voice data never leaves your device.

Windows 10+ with a GPU (8 GB VRAM recommended) or macOS with Apple Silicon (M1+). 8 GB RAM minimum, 12 GB+ recommended.

No. The desktop app is a one-time purchase starting at $54.99 that gives you lifetime access. There are no monthly fees or usage limits.

Refunds for the desktop app are handled through the store where you purchased. For Microsoft Store, request through your purchase history. For Mac App Store, use Apple's report a problem page. For itch.io, contact us through the itch.io page.

Tokens are the unit of usage for Cloud. Each generation consumes tokens based on the model used and the length of audio produced. The usage calculator above gives you a rough estimate of how much audio you can generate per month.

Cloud runs your jobs one of two ways. Standard uses our always-on cloud GPUs, and it is the amount of audio you can always count on. Peer network routes eligible jobs to community members who share their GPUs. That path is more efficient, so the same tokens stretch to far more audio. The calculator above lets you preview both: Standard is the conservative floor, Peer network is the best case when community capacity is available.

Peer network runs on our distributed compute platform, a network of community GPUs that costs far less to run than always-on cloud servers, and we pass that saving on to you as more audio for the same tokens. Peer capacity is not guaranteed for every job, so any job a community GPU does not pick up falls back to Standard pricing automatically, and you are never charged more than the Standard rate.

Yes. Cloud includes a free tier so you can try it out before committing to a paid plan.

No. Cloud runs entirely in your browser. All processing happens on our servers, so any device with a modern browser will work.

Yes. Enterprise is a custom plan for teams that need higher volume, an SLA, a dedicated account manager, and priority processing. Pricing is tailored to your usage. Tell us what you need on our contact sales page and we'll put a plan together.

You do. Any audio you generate with the desktop app or Cloud is entirely yours. You retain full ownership and rights to all generated content, with no royalties or attribution required.

Yes. Both the desktop app and Cloud include full commercial rights. You can use generated voices in YouTube videos, podcasts, audiobooks, games, apps, and any other commercial projects.

Both the desktop app and Cloud use zero-shot voice cloning, meaning they can replicate a voice from a single short sample. 3 to 10 seconds of clean audio is the sweet spot. Longer samples don't improve quality.

Stay in the loop

Get Updates

Get notified about new features, platform launches, and updates. No spam, unsubscribe anytime.