Introducing Song Creator Pro — create music with AI, locally on your device. Try it now →

Projects

Create audiobooks, podcasts, voiceovers, and other long-form audio content from scripts and documents.

Projects is where you produce long-form audio. Import a document, assign voices to different speakers or sections, fine-tune pacing, and export a finished audio file. The desktop app runs everything locally on your machine with no usage limits, and Projects are also available in your browser with VCP Cloud.

How It Works

1. Import Your Document

Import an EPUB, PDF, DOCX, or plain text file. Voice Creator Pro extracts the text and preserves chapter structure automatically. You can also paste text directly with Ctrl+V (Cmd+V on macOS). See Importing & Exporting for supported formats and options.

2. Assign Voices

Map different voices to different speakers or sections. Use any combination of built-in, cloned, and designed voices. Highlight text within a segment to assign a different voice to just that span. See Voice Assignment for details.

3. Fine-Tune

Click any segment to adjust its generation settings, pacing, and voice. Define custom pronunciations in the Lexicon section so names and unusual terms sound right across the entire project. Configure project-wide defaults in Project Settings.

4. Export

Export to MP3, M4B (audiobook with chapter markers), WAV, or FLAC. See Importing & Exporting for format details.

Features

Multi-Voice Assignment

Assign different voices to different speakers, characters, or sections. Highlight any part of a segment to assign it a different voice from the default, giving you precise control over who speaks what. Create multi-narrator audiobooks and dialogues with distinct voices for each role.

Chapter-Aware

Automatically detects chapter boundaries from your source file. Chapters carry through to M4B exports so listeners can navigate by section.

Granular Pacing Controls

Fine-tune pauses between paragraphs and chapters, adjust speaking speed per section, and control pacing for a natural listening experience. You can also highlight any part of a segment and add a pause before it for precise timing control.

Lexicon

Define how names, places, and unusual terms should be pronounced once, and have them spoken consistently across the entire project.

Per-Segment and Per-Selection Controls

Override the model, generation parameters, and voice at the segment level or even for a highlighted span of text within a segment.

Use Cases

Audiobooks

Convert novels, non-fiction, and self-published books into professional audiobooks with chapter markers and multiple narrator voices.

Podcasts

Generate podcast episodes from scripts with different voices for host and guest roles. Export directly to MP3.

E-Learning and Training

Convert manuals, study guides, and training documents into narrated audio that learners can listen to anywhere.

Social Media Voiceovers

Turn scripts into voiceovers for YouTube, TikTok, Instagram, and other platforms.

Documentation and Reports

Turn lengthy documents, research papers, or reports into audio so you can absorb the content while commuting or exercising.

Accessibility

Make written content accessible to people with visual impairments or reading difficulties by converting it to high-quality spoken audio.

Self-Publishing

Self-published authors can create audiobook editions of their work without hiring a narrator or booking studio time.