How to Add Audio Nodes
- Click the Add (+) button on the left toolbar in the workflow canvas.
- Select Audio under node categories.
- Choose from the available nodes listed below.
Generation Nodes
These nodes create audio content from text prompts.Generate Audio
Convert text into natural-sounding speech. Write what you want spoken, select a voice character, and generate realistic audio. Supports multiple voices and models including ElevenLabs TTS and Minimax 2.8 HD.
Generate Music
Generate original music tracks from text descriptions or lyrics. Describe the mood, genre, and instruments or write full lyrics and the AI composes a complete track. Powered by ElevenLabs Music.
Combining Nodes
Audio nodes are designed to feed into the rest of your workflow. Here are some common combinations:- Voice → Combine Audio & Video: Generate a voiceover and layer it onto an AI-generated video.
- Music → Combine Audio & Video: Create a custom soundtrack and pair it with a product video or brand reel.
- Sound Effects → Combine Audio & Video: Add ambient sounds or action effects to animated scenes.
- Voice → Lipsync: Generate speech, then use it as the audio input for a Lipsync node to create a talking-head video.
- Text Iterator → Voice: Enter multiple scripts and generate a separate voiceover for each — perfect for multilingual content or personalized outreach.
- Prompt → Music + Prompt → Generate Video → Combine Audio & Video: Create both a video and its soundtrack from text, then merge them into a finished piece.

