Skip to main content

Summary

The Voice Node converts text into natural-sounding speech using AI text-to-speech models. Write or connect a prompt, select a voice, and the node generates an audio output that sounds like a real person speaking. Use it for voiceovers, narration, dialogue, or any workflow that needs spoken audio from text.

How to Use

1

Add the Node

Click the Add (+) button and select Voice from the Audio node category.
2

Write Your Text

Type what you want spoken into the prompt field, or connect a Prompt or AI Copilot node.
3

Select a Voice

Choose a voice from the Voice dropdown (e.g., Roger). Each voice has a distinct tone, pitch, and character.
4

Run

Click Run, and the AI generates an audio file of the selected voice speaking your text.

Choosing the Right Settings

SettingTypeImpact on Output
VoiceDropdown (e.g., Roger)Selects the voice character used for speech generation. Different voices vary in tone, gender, accent, and style.
PromptText InputThe text content that will be converted into speech.
StabilitySlider (0–100%)Controls how consistent the voice sounds across the output.
Similarity BoostSlider (0–100%)Controls how closely the output matches the selected voice.
SpeedSliderAdjusts the speaking pace of the generated audio.
TimestampsCheckboxWhen enabled, returns word-level or sentence-level timestamps alongside the audio output.

Sample Use Cases

Voiceovers for AI-Generated Videos

Generate a voiceover and connect it to a Combine Audio & Video node to add narration to any video in your workflow.

Multilingual Audio Content

Write the same script in multiple languages and generate a voice for each — perfect for localizing video content without hiring voice actors.

Podcast and Audio Previews

Quickly generate audio previews of scripts, blog posts, or ad copy to hear how they sound before recording with a real voice.

Audio Models

Visit Audio Models to explore all available models and find the one that fits your audio needs.