Import Guide

File Import Guide

How to import CSV, SRT, and text files into AutoFlowCut

🚀 Get Started in 5 Minutes Recommended

No complex setup needed. A simple text file is all you need to start!

1

Create a Text File

Write scene descriptions, one per line, in any text editor

A young scholar reading under an oak tree
The scholar walking toward the castle gate
A celebration scene with banners and a joyful crowd
2

Import the File

In AutoFlowCut, click "📂 Import" button → Select "Prompt Text File"

📝 Image 🎬 Video 🎞️ F→V
📋 Scenes (0) 🎵 Audio 🖼️ Ref (0) 📂 Import
← Click here!
3

Generate Images!

Click "Start Generation" button at the bottom and Google Flow AI will automatically create images for each scene

Enabled when scenes exist

💡 Next: Start with a text file. When you need subtitles or consistent characters, use the CSV formats below.

📄 Supported File Types

📝 Plain Text Easiest

One line per scene, simple format

📊 Scene CSV

Scene information with prompts, subtitles, and durations

🖼️ Reference CSV

Character/background/style definitions (for consistency)

🎬 SRT Subtitles Narration Videos

Standard subtitle format, auto-converted to scenes

🔄 Workflow Overview

See how the files connect at a glance

● Basic (Text only)

📝 Text File
→
📋 Scene List
→
🎨 Generate Images
→
📦 CapCut Export

● Narration video (Script → TTS → SRT → Scene CSV)

📜 Script (.txt)
→
🎙️ TTS (mp3 + srt)
→
🤖 AI splits SRT → CSV
→
📊 Scene CSV

💡 duration is auto-calculated from timecodes, the subtitle text becomes subtitle, and the English prompt is generated by AI.

● Advanced (Character consistency)

📊 Scene CSV
characters: scholar
🖼️ Reference CSV
name: scholar
Same tag? ✓ Auto-match! → Consistent character appearance

💡 Summary: When characters, scene_tag, style_tag in Scene CSV match name in Reference CSV, reference images are automatically applied.

📂 Import Modal Preview

This modal appears when you click the "📂 Import" button. Click on the file format you want.

📂 Import File ✕

Select a file format

📝
Prompt Text File 🖼️ Image 🎬 Video
Line-separated prompts
One line = One scene
→
📊
Scene CSV File
Structured scene data
prompt, subtitle, characters...
→
🖼️
Reference CSV File
Character/Background/Style definitions
name, type, prompt
→
📺
Subtitle SRT File
Auto-generate scenes from subtitles
Timecode + subtitle → prompt
→

Help buttons below each option:

📖 Guide 📄 Sample 🤖 AI Prompt

📝 Prompt Text File Easiest

The simplest format - just write one scene description per line. Each line becomes a separate scene with a default duration.

Example

A sunrise rising over mountain peaks
Two travelers walking through a valley
A sudden storm, both running into a cave for shelter
After the rain, a peaceful village seen in the distance

💡 In video mode, including action verbs + camera movement leads to better results.

📊 Scene CSV File

The Scene CSV file defines each scene's content, including prompts for image generation, subtitles, and timing.

🤔 Why use CSV?

Text files are simple, but CSV gives you more control:

  • • Subtitles separate from prompts (different content)
  • • Character/Background/Style tags → Auto-match with reference images (see below)
  • • Per-scene duration settings (3s, 5s, etc.)

💡 When tags match references, you'll see ✓ in the scene list

Required Columns

Column Description Example
prompt Image generation prompt A scholar reading under an oak tree
subtitle Subtitle text for the scene Once upon a time, a young scholar lived in a village.
characters Character names (semicolon-separated) scholar;queen
scene_tag Scene category tag palace, village, journey
style_tag Visual style tag medieval historical drama
duration Scene duration in seconds 5

CSV Example

scene_sample.csv
prompt,subtitle,characters,scene_tag,style_tag,duration
"A young scholar reading under an ancient oak tree, medieval England","Once upon a time, a young scholar lived in a small village.",scholar,village,medieval historical drama,5
"The scholar walking toward a grand castle gate at dawn","The scholar set off for the royal court to prove his knowledge.",scholar,journey,medieval historical drama,4
"Throne room filled with nobles in formal attire","The great hall was filled with dignified lords and ladies.",scholar;officials,palace,medieval historical drama,5
"The scholar writing with a quill, intense focus","The scholar poured all his effort into the royal examination.",scholar,examination,medieval historical drama,4
"A celebration scene with banners and joyful crowd","At last, the scholar earned the king's favor and glory.",scholar;crowd,celebration,medieval historical drama,5

🖼️ Reference CSV File

The Reference CSV file defines characters, backgrounds, and styles that can be used across scenes for consistent visual generation.

✅ No images required!

You can leave image_path empty in the CSV. Add images later using:

  • • Direct upload - Click image area in References tab → Select file
  • • AI generation - Generate reference images with Flow AI based on description
  • • Add later - Set up the structure now, add images anytime

🔗 Tag Matching System (Very Important!)

The name field in Reference CSV automatically matches with tags in Scene CSV. When generating images, matched reference images are uploaded to Flow AI to maintain consistent style/characters.

👤 Character Matching
type: character
name: scholar
↔️ Scene CSV characters column
🏞️ Scene Matching
type: scene
name: palace
↔️ Scene CSV scene_tag column
🎨 Style Matching
type: style
name: medieval historical drama
↔️ Scene CSV style_tag column

Required Columns

Column Description Example
type Reference type (character / scene / style) character
name Name for tag matching (must match Scene CSV) scholar, palace, medieval historical drama
image_path Path or URL to reference image (optional) ./images/scholar.png
description Detailed description (for image generation) Young scholar in dark academic robes

CSV Example

reference_sample.csv
type,name,image_path,description
character,scholar,,"A young medieval English scholar in dark academic robes and white linen shirt, neatly combed hair, gentle and determined expression"
character,officials,,"Medieval English court officials in long dark robes with chain-of-office necklaces, dignified bearing, stern expressions"
character,crowd,,"Medieval English townsfolk in simple tunics and dresses, joyful expressions, clapping and cheering"
scene,village,,"Traditional medieval English village with thatched-roof cottages, stone walls, apple orchards, rolling hills backdrop"
scene,journey,,"Winding dirt road through medieval English countryside at dawn, rolling green hills, distant castle on the horizon"
scene,palace,,"Grand medieval castle interior with stone pillars, heraldic banners, vaulted ceiling, torch-lit corridors, ornate tapestries"
scene,examination,,"Medieval castle study hall with wooden desks, quill pens, parchment scrolls, candlelight, gothic windows"
scene,celebration,,"Medieval castle courtyard decorated with colorful banners, flower garlands, festive crowd, golden sunlight"
style,medieval historical drama,,"Medieval European historical drama style, Renaissance influences, cinematic composition, warm color palette, detailed period costumes"

🎬 Subtitle SRT File

Import standard SRT subtitle files. Each subtitle block is automatically converted to a scene with the subtitle text used as both the prompt and subtitle. Duration is calculated from the timecodes.

SRT Structure

sample-subtitles.srt
1
00:00:00,000 --> 00:00:03,204
Once upon a time, a young scholar lived in a small village.

2
00:00:03,504 --> 00:00:06,848
Every day he studied, reading books under the old oak tree.

3
00:00:07,148 --> 00:00:11,142
One day, he decided to travel to the royal court for the examination.

4
00:00:11,442 --> 00:00:15,204
After a long journey, the scholar finally arrived at the castle.

5
00:00:15,504 --> 00:00:18,383
Standing before the palace, he marveled at its grandeur.

Each numbered block becomes one scene. The timecode determines scene duration automatically.

🎙️ Auto-generate SRT from TTS Services

These TTS (Text-to-Speech) services automatically generate SRT subtitle files along with the audio. Get your narration and subtitles ready to import directly into AutoFlowCut.

💡 Tip: After generating audio with a TTS service, download the SRT file and import it into AutoFlowCut for perfectly timed scenes.

📝 Auto-generate Text Prompt List with AI

Paste the prompt below into Claude, ChatGPT, Gemini, or other AI tools along with your story to automatically generate a simple prompt list (.txt) with one scene per line. Pick the tab that matches your video format.

text_prompt_short_en.txt
You are a scene prompt list assistant for AutoFlowCut short-form video production.

Given a topic and video length (15s/30s/60s), output a .txt prompt list where each line is a single visual scene in English.

Rules:
1. One line = one visual scene (single image frame)
2. Write in English; include composition, lighting, mood, camera angle
3. Keep each prompt under 200 characters
4. The first scene MUST be a "Hook" — a striking image that sparks curiosity within 0~3 seconds
   (hint of the ending, mystery, dramatic contrast, visual shock, etc.)
5. Scene order: Hook → Build-up → Twist/Impact → Resolution
6. Scene count by length:
   - 15s → 4~6 scenes
   - 30s → 7~10 scenes
   - 60s → 10~15 scenes
7. Fast pacing, no breathing room
8. No blank lines between prompts
9. Output only the prompts — no numbering, no commentary

Example output (30s, 8 scenes):
A lone figure stands on a cliff at dawn, silhouette against blood-red sky, dramatic wide shot
A young scholar reading old scrolls in candlelight, close-up on focused eyes
A sudden gust scatters pages into the storm, dynamic motion blur
The scholar chases papers through rain-soaked streets, over-shoulder shot
A hidden door revealed behind ivy, mysterious green glow, low angle
Eyes widen as scholar steps inside, reaction shot, soft torchlight
Ancient chamber unveils a glowing artifact, symmetrical wide shot, golden rays
The scholar raises the artifact against sunrise, triumphant silhouette, epic composition

Now create a prompt list for this topic (length: pick one of [15s/30s/60s]):

🤖 Auto-generate CSV with AI

Paste the prompts below into Claude, ChatGPT, Gemini, or other AI tools along with your story to automatically generate CSV files.

📊 Scene CSV Generation Prompt

scene_csv_prompt_en.txt
You are a scene breakdown assistant for AutoFlowCut video production.

Given a story or script, create a Scene CSV file with these columns:
- prompt: English scene description for AI image generation (include composition, lighting, mood, camera angle)
- subtitle: Subtitle text for the scene (concise, under 50 chars)
- characters: Character names in the scene (semicolon-separated if multiple, e.g., scholar;queen)
- scene_tag: Background/location tag (use consistent tags like palace, forest, village)
- style_tag: Art style (keep consistent, e.g., medieval historical drama, ghibli, cinematic)
- duration: Scene duration in seconds (3-5 for normal, 2 for quick cuts, 6+ for dramatic)

Rules:
1. Each row = one visual scene (single image frame)
2. Keep prompts descriptive but under 200 characters
3. Use consistent character names throughout (these match reference images)
4. Group similar locations under one scene_tag
5. Consider pacing: vary duration for dramatic effect
6. Output as CSV with header row, quote values containing commas

Example output:
prompt,subtitle,characters,scene_tag,style_tag,duration
"A young scholar reading under an ancient oak tree, medieval England","Once upon a time, a young scholar lived in a small village.",scholar,village,medieval historical drama,5

Now break down this story into scenes:

🎬 SRT → Scene CSV Conversion Prompt Narration video

Paste an SRT file (subtitles + timecodes) from your TTS service into AI to auto-convert it into a scene CSV with timecode-based duration and English prompt columns.

srt_to_scene_csv_prompt_en.txt
You are an SRT → Scene CSV conversion assistant for AutoFlowCut narration video production.

Given an SRT subtitle file, convert each subtitle block into a single scene row in a CSV.

Input format (SRT):
1
00:00:00,000 --> 00:00:05,000
Subtitle text...

2
00:00:05,000 --> 00:00:09,000
Next subtitle text...

Output columns:
- prompt: English image prompt that visualizes the subtitle (composition, lighting, mood, camera angle)
- subtitle: Original subtitle text from the SRT (do NOT modify)
- characters: Character names mentioned (semicolon-separated, e.g., king;queen)
- scene_tag: Background/location tag (use consistent tags, e.g., palace, forest, village)
- style_tag: Art style (keep consistent across the whole video, e.g., cinematic, ghibli)
- duration: Auto-calculate from timecodes (end − start, 1 decimal place)

Rules:
1. 1 SRT block = 1 CSV row (strict 1:1 mapping, no merging or skipping)
2. duration MUST be computed from the SRT timecodes (e.g., 00:00:05,000 → 00:00:09,000 = 4.0)
3. subtitle must match the SRT text verbatim (no translation or summarization)
4. prompt is English, visualizes the subtitle, max 200 chars
5. Use consistent character/location names (for reference matching)
6. Include a header row; quote values that contain commas

Example output:
prompt,subtitle,characters,scene_tag,style_tag,duration
"A young scholar reading under an ancient oak tree, soft morning light","Once upon a time, a young scholar lived in a small village.",scholar,village,medieval historical drama,5.0
"The scholar walking toward a grand palace gate at dawn, wide shot","The scholar set off for the capital to take the imperial exam.",scholar,journey,medieval historical drama,4.0

Now convert the SRT below into a scene CSV:
[Paste SRT content here]

💡 End-to-end flow: Script (.txt) → generate mp3+srt with a TTS service → use the prompt above to convert SRT into a scene CSV → import into AutoFlowCut → generate images. Timing stays perfectly aligned with the narration.

🖼️ Reference CSV Generation Prompt

reference_csv_prompt_en.txt
You are a reference planning assistant for AutoFlowCut.

Given a story, create a Reference CSV with these columns:
- type: Reference type (character, scene, or style)
- name: Name for tag matching (MUST match scene CSV tags exactly)
- image_path: Leave empty (images will be generated by Flow AI)
- description: Detailed description for generating the reference image

For characters: List all unique characters with their visual appearance
For scenes: List all unique locations/settings mentioned
For styles: Suggest 1-2 art styles that fit the story mood

IMPORTANT: The 'name' field must match exactly with:
- character type → matches 'characters' column in scene CSV
- scene type → matches 'scene_tag' column in scene CSV
- style type → matches 'style_tag' column in scene CSV

Example output:
type,name,image_path,description
character,scholar,,"Young medieval English scholar in dark academic robes with neatly combed hair"
scene,palace,,"Grand medieval castle throne room with stone pillars and heraldic banners"
style,medieval historical drama,,"Medieval European historical drama style, cinematic composition, warm tones"

Now create the reference list for this story:

💡 AI Tool Tips

🟠 Claude

Great for long stories. Use Artifacts feature to download CSV directly.

🟢 ChatGPT

Use Code Interpreter to create and download CSV. Preview as table.

🔵 Gemini

Works with Google Docs. Easy export to spreadsheets.

🚀 How to Import Files

1

Open AutoFlowCut desktop app

Launch AutoFlowCut and sign in with your Google account

2

Click the Import button

Located in the sidebar or toolbar

3

Select your file

Choose a CSV, SRT, or TXT file from your computer

4

Review and generate

Check the imported scenes and start generating images with Google Flow AI

📥 Download Sample Files

Download these sample files to get started quickly: