Audio Import Guide
How to prepare TTS dialogue and SFX sound effects for AutoFlowCut
🚀 Quick Start 3 Steps
Generate audio files externally, then import the audio package folder into AutoFlowCut.
Generate TTS Audio from Your Script
Use Typecast or ElevenLabs to generate per-character dialogue voices. Organize files by character folder.
Generate SFX Sound Effects
Use ElevenLabs Sound Generation API to create ambient sounds, footsteps, props, and other effects by category.
Import Audio Package in AutoFlowCut
Place all audio files in the media/ folder of your project directory. AutoFlowCut automatically detects and imports the audio package.
💡 Tip: Audio files with timecodes in their filenames are automatically matched to the corresponding SRT subtitle timestamps.
📁 Audio Package Structure
Organize your audio files in the following directory structure inside your project folder:
media/ ├── voices/ # Per-character TTS files │ ├── narrator/ │ │ ├── narrator_001_0000.mp3 │ │ └── narrator_002_0035.mp3 │ └── scholar/ │ └── scholar_001_0120.mp3 ├── sfx/ # Sound effects by category │ ├── 01_props/ │ ├── 02_ambience_wind/ │ ├── 03_breath/ │ ├── 04_footsteps/ │ ├── 05_metal_doors/ │ ├── 06_writing/ │ └── 07_crowd/ └── .audio_review.json # Tracks unsuitable audio
💡 Note: The voices/ folder contains per-character subfolders. The sfx/ folder contains category-based subfolders.
🎙️ TTS (Dialogue Voice)
Per-character dialogue audio generated via Text-to-Speech APIs.
Generation API: Typecast
API: https://api.typecast.ai/v1/text-to-speech
File Naming Convention
{character}_{number}_{MMSS}.mp3
Examples:
narrator/narrator_001_0000.mp3 # narrator, line 1, at 00:00
scholar/scholar_003_0245.mp3 # scholar, line 3, at 02:45
The timecode (MMSS) enables automatic matching with SRT subtitle timestamps.
Emotion Parameters
| Emotion | Description |
|---|---|
normal | Default tone |
happy | Bright / joyful |
sad | Sad / melancholic |
angry | Angry / intense |
🔊 SFX (Sound Effects)
Ambient sounds, foley, and effects generated via AI sound generation.
Generation API: ElevenLabs
API: https://api.elevenlabs.io/v1/sound-generation
7 SFX Categories
01_props/ — Object interactions, clicks02_ambience_*/ — Wind, rain, birds03_breath/ — Breathing, sighing04_footsteps/ — Walking, running05_metal_doors/ — Doors, locks, impacts06_writing/ — Brush, pen strokes07_crowd/ — Murmuring, chatterFile Naming
{category}/{descriptive_name}.mp3
Timecoded SFX (synced to a specific scene):
abacus_beads_dark_01_0015.mp3 # for the 00:15 scene
abacus_beads_01_0134.mp3 # for the 01:34 scene
Timecoded filenames (ending in _MMSS) are automatically matched to corresponding scenes.
🔍 Audio Review System
Flag unsuitable audio files through the app UI or via Claude Code MCP tools. Flagged files are tracked in .audio_review.json.
.audio_review.json Structure
{
"media/sfx/02_ambience_wind/wind_howl_01.mp3": {
"status": "flagged",
"reason": "No timecode",
"flaggedAt": "2026-03-16T05:44:55.228Z"
}
}
MCP Tools
list_audio_reviews
— View list of flagged audio files
update_audio_review
— Add or remove flags
🎧 Audio Tab in App
Once imported, the Audio tab provides a full overview of your audio package.
Voice count per character, SFX count per category, total duration breakdown.
Chronological list with automatic scene matching via timecodes.
Preview any audio file directly in the app with inline player.
Flag unsuitable audio for replacement. Sort by name, duration, or status.