Introducing Song Creator Pro — create music with AI, locally on your device. Try it now →

Segments

Manage segments in your project - edit text, adjust generation settings, add lexicon entries, and control pacing per segment.

When you import a document, Voice Creator Pro splits it into segments based on paragraph boundaries and your max-characters-per-segment setting. Each segment is an independently configurable unit of audio.

Working with Segments

Click any segment in the project reader to select it. A selected segment reveals its controls, letting you:

  • Edit the segment text inline
  • Assign a voice (or override the default voice)
  • Adjust advanced generation parameters for that segment
  • Play back or regenerate the audio
  • View and pick from previous takes

Editing Text

Click into a segment's text to edit it directly. Changes only affect that segment, so you can fix typos, reword sentences, or tweak phrasing without touching the rest of the project.

Splitting and Merging

  • Split a segment at the cursor position to break a long segment into two
  • Merge adjacent segments to combine them into one

This is useful when the automatic segmentation splits in an awkward place, or when you want finer control over pause timing between paragraphs.

Adding Segments and Sections

You can add new segments and sections directly in the reader without re-importing the source file. Rename chapter titles in place to keep your project organized.


Per-Segment Settings

Each segment can override the project-level generation parameters. This gives you control over how individual segments sound without affecting the rest of the project.

Click the settings icon on a selected segment to access:

  • Voice - Override the assigned voice
  • Generation parameters - Override speed, steps, guidance scale, and other model-specific settings

For a full list of model-specific parameters, see the Voice Cloning docs.


Selection-Level Overrides

For even finer control, highlight a span of text within a segment to adjust settings for just that portion. This lets you:

  • Assign a different voice to a highlighted span (useful for dialogue within narration)
  • Use a different model for varied degree of emotional control
  • Adjust generation parameters for specific words or phrases
  • Add a pause before the highlighted text for precise timing

This is especially powerful for novels with inline dialogue, where you want a character voice for quoted speech but a narrator voice for the surrounding prose.


Takes

When Takes per generation is set above 1 in Project Settings, each generation produces multiple takes for the segment. All takes are stored in the segment's history so you can audition each one and pick the best.

This is useful when you want variety to choose from, or when a particular segment is tricky and you want several attempts to find the best delivery.