Import Guide

File Import Guide

How to import CSV, SRT, and text files into Whisk2CapCut

πŸš€ Get Started in 5 Minutes Recommended

No complex setup needed. A simple text file is all you need to start!

1

Create a Text File

Write scene descriptions, one per line, in any text editor

Sunrise over mountain peaks
A traveler walking through the forest
The hero arrives at the village
2

Import the File

In Whisk2CapCut, click "πŸ“‚ Import" button β†’ Select "Prompt Text File"

πŸ“ Prompts πŸ“‹ Scenes (0)
πŸ–ΌοΈ Ref (0) πŸ“‚ Import
← Click here!
3

Generate Images!

Click "Start Generation" button at the bottom and Whisk will automatically create images for each scene

Enabled when scenes exist

πŸ’‘ Next: Start with a text file. When you need subtitles or consistent characters, use the CSV formats below.

πŸ“„ Supported File Types

πŸ“ Plain Text Easiest

One line per scene, simple format

πŸ“Š Scene CSV

Scene information with prompts, subtitles, and durations

πŸ–ΌοΈ Reference CSV

Character/background/style definitions (for consistency)

🎬 SRT Subtitles Narration Videos

Standard subtitle format, auto-converted to scenes

πŸ”„ Workflow Overview

See how the files connect at a glance

● Basic (Text only)

πŸ“ Text File
β†’
πŸ“‹ Scene List
β†’
🎨 Generate Images
β†’
πŸ“¦ CapCut Export

● Advanced (Character consistency)

πŸ“Š Scene CSV
characters: king
πŸ–ΌοΈ Reference CSV
name: king
Same tag? βœ“ Auto-match! β†’ Consistent character appearance

πŸ’‘ Summary: When characters, scene_tag, style_tag in Scene CSV match name in Reference CSV, reference images are automatically applied.

πŸ“‚ Import Modal Preview

This modal appears when you click the "πŸ“‚ Import" button. Click on the file format you want.

πŸ“‚ Import File βœ•

Select a file format

πŸ“
Prompt Text File
Line-separated prompts
One line = One scene
β†’
πŸ“Š
Scene CSV File
Structured scene data
prompt, subtitle, characters...
β†’
πŸ–ΌοΈ
Reference CSV File
Character/Background/Style definitions
name, type, prompt
β†’
πŸ“Ί
Subtitle SRT File
Auto-generate scenes from subtitles
Timecode + subtitle β†’ prompt
β†’

Help buttons below each option:

πŸ“– Guide πŸ“„ Sample πŸ€– AI Prompt

πŸ“ Prompt Text File Easiest

The simplest format - just write one scene description per line. Each line becomes a separate scene with a default duration.

Example

A sunrise over mountain peaks
Two travelers walking through a valley
A campfire under the stars
The journey's end at a peaceful village

πŸ“Š Scene CSV File

The Scene CSV file defines each scene's content, including prompts for image generation, subtitles, and timing.

πŸ€” Why use CSV?

Text files are simple, but CSV gives you more control:

  • β€’ Subtitles separate from prompts (different content)
  • β€’ Character/Background/Style tags β†’ Auto-match with reference images (see below)
  • β€’ Per-scene duration settings (3s, 5s, etc.)

πŸ’‘ When tags match references, you'll see βœ“ in the scene list

Required Columns

Column Description Example
prompt Image generation prompt A hero stands on a cliff at sunset
subtitle Subtitle text for the scene The journey begins here.
characters Character names (comma-separated) Hero, Mentor
scene_tag Scene category tag opening, action, dialogue
style_tag Visual style tag cinematic, anime, realistic
duration Scene duration in seconds 5

CSV Example

scene_sample.csv
prompt,subtitle,characters,scene_tag,style_tag,duration
"A wise old king sits on golden throne","The wise old king sits on his golden throne",King,palace,cinematic,5
"Beautiful queen enters through doors","The beautiful queen enters through grand doors",Queen,palace,cinematic,4
"King and queen discuss matters","The king and queen discuss important matters",King;Queen,palace,cinematic,5

πŸ–ΌοΈ Reference CSV File

The Reference CSV file defines characters, backgrounds, and styles that can be used across scenes for consistent visual generation.

βœ… No images required!

You can leave image_path empty in the CSV. Add images later using:

  • β€’ Direct upload - Click image area in References tab β†’ Select file
  • β€’ AI generation - Generate reference images in Whisk based on description
  • β€’ Add later - Set up the structure now, add images anytime

πŸ”— Tag Matching System (Very Important!)

The name field in Reference CSV automatically matches with tags in Scene CSV. When generating images, matched reference images are uploaded to Whisk to maintain consistent style/characters.

πŸ‘€ Character Matching
type: character
name: King
↔️ Scene CSV characters column
🏞️ Background Matching
type: background
name: palace
↔️ Scene CSV scene_tag column
🎨 Style Matching
type: style
name: cinematic
↔️ Scene CSV style_tag column
πŸ“‹ Scene List Preview Tag matching indicator
#
Prompt / Subtitle
Matching Tags
Image
1
A wise old king sits on golden throne
πŸ’¬ The wise king on his throne
βœ“
βœ“
βœ“
⏳
2
Beautiful queen enters the room
πŸ’¬ The queen arrives
βœ“
βœ“
βœ—
⏳
Reference match: βœ“ Yes βœ— No

Required Columns

Column Description Example
type Reference type (character / background / style) character
name Name for tag matching (must match Scene CSV) King, palace, cinematic
image_path Path or URL to reference image ./images/king.png
description Detailed description (for image generation) Wise old king with white beard

πŸ–ΌοΈ Plugin Reference Panel Preview

This is the Reference panel displayed when you click "πŸ–ΌοΈ Ref" button. When you import a CSV, cards are created here.

β–Ό πŸ–ΌοΈ Reference Images (3)
βœ•
πŸ‘‘ βœ…
King πŸ“
βœ•
πŸ‘€ Click to upload
Queen πŸ“
βœ•
🏞️ Click to upload
Palace πŸ“
+ Add

πŸ’‘ Cards with prompt but no image can generate AI images with "🎨 Generate" button

CSV Example

reference_sample.csv
type,name,image_path,description
character,King,./images/king.png,Wise old king with white beard and golden crown
character,Queen,./images/queen.png,Elegant queen in red dress
background,palace,./images/palace.png,Grand royal palace interior with ornate decorations
style,cinematic,./images/cinematic.png,Film-like dramatic lighting

🎨 Style Tag List 87 styles

Available style IDs for the style_tag column. These can also be used as the name for style type in Reference CSV.

πŸ’‘ Click a style tag to copy it.

🎬 Subtitle SRT File

Import standard SRT subtitle files. Each subtitle block is automatically converted to a scene with the subtitle text used as both the prompt and subtitle. Duration is calculated from the timecodes.

SRT Structure

sample.srt
1
00:00:00,000 --> 00:00:05,000
The hero awakens in a mystical forest.

2
00:00:05,000 --> 00:00:09,500
Strange lights guide the way forward.

3
00:00:09,500 --> 00:00:14,000
A mysterious figure appears in the distance.

Each numbered block becomes one scene. The timecode determines scene duration automatically.

πŸŽ™οΈ Auto-generate SRT from TTS Services

These TTS (Text-to-Speech) services automatically generate SRT subtitle files along with the audio. Get your narration and subtitles ready to import directly into Whisk2CapCut.

πŸ’‘ Tip: After generating audio with a TTS service, download the SRT file and import it into Whisk2CapCut for perfectly timed scenes.

πŸ€– Auto-generate CSV with AI

Paste the prompts below into Claude, ChatGPT, Gemini, or other AI tools along with your story to automatically generate CSV files.

πŸ“Š Scene CSV Generation Prompt

scene_csv_prompt_en.txt
You are a scene breakdown assistant for Whisk2CapCut video production.

Given a story or script, create a Scene CSV file with these columns:
- prompt: English scene description for AI image generation (include composition, lighting, mood, camera angle)
- subtitle: English subtitle text for the scene (concise, under 50 chars)
- characters: Character names in the scene (semicolon-separated if multiple, e.g., King;Queen)
- scene_tag: Background/location tag (use consistent tags like palace, forest, village)
- style_tag: Art style (keep consistent, e.g., cinematic, ghibli, ink-wash)
- duration: Scene duration in seconds (3-5 for normal, 2 for quick cuts, 6+ for dramatic)

Rules:
1. Each row = one visual scene (single image frame)
2. Keep prompts descriptive but under 200 characters
3. Use consistent character names throughout (these match reference images)
4. Group similar locations under one scene_tag
5. Consider pacing: vary duration for dramatic effect
6. Output as CSV with header row, quote values containing commas

Example output:
prompt,subtitle,characters,scene_tag,style_tag,duration
"A wise old king sits on golden throne, dramatic lighting","The wise king sits on his throne",King,palace,cinematic,5

Now break down this story into scenes:

πŸ–ΌοΈ Reference CSV Generation Prompt

reference_csv_prompt_en.txt
You are a reference planning assistant for Whisk2CapCut.

Given a story, create a Reference CSV with these columns:
- type: Reference type (character, background, or style)
- name: Name for tag matching (MUST match scene CSV tags exactly)
- image_path: Suggested image filename
- description: Detailed description for generating/finding the reference image

For characters: List all unique characters with their visual appearance
For backgrounds: List all unique locations/settings mentioned
For styles: Suggest 1-2 art styles that fit the story mood

IMPORTANT: The 'name' field must match exactly with:
- character type β†’ matches 'characters' column in scene CSV
- background type β†’ matches 'scene_tag' column in scene CSV
- style type β†’ matches 'style_tag' column in scene CSV

Example output:
type,name,image_path,description
character,King,./images/king.png,"Wise old king with white beard and golden crown"
background,palace,./images/palace.png,"Grand palace throne room with red carpets"
style,cinematic,./images/cinematic.png,"Dramatic movie lighting with depth"

Now create the reference list for this story:

πŸ’‘ AI Tool Tips

🟠 Claude

Great for long stories. Use Artifacts feature to download CSV directly.

🟒 ChatGPT

Use Code Interpreter to create and download CSV. Preview as table.

πŸ”΅ Gemini

Works with Google Docs. Easy export to spreadsheets.

πŸš€ How to Import Files

1

Open Whisk2CapCut extension

Click the extension icon in your Chrome toolbar

2

Click the Import button

Located in the sidebar or toolbar

3

Select your file

Choose a CSV, SRT, or TXT file from your computer

4

Review and generate

Check the imported scenes and start generating images

πŸ“₯ Download Sample Files

Download these sample files to get started quickly:

πŸ“š Related Guides

Export to CapCut Guide