Whisk2CapCut Tutorial: From AI Images to CapCut Video in One Click
240 Images. One by One. Three All-Nighters.
Picture this: you've written the perfect script for your AI-generated YouTube video. You've got your voiceover, your subtitles, your scene breakdowns. Now all you need is 240 AI images β one for every scene.
So you open Google Whisk. Generate an image. Download it. Generate the next one. Download it. Drag it into CapCut. Adjust the timing. Add a subtitle. Repeat.
After three all-nighters doing exactly this, it hit me β this isn't something humans should be doing.
There had to be a better way. So I built one.
Meet Whisk2CapCut: Generate + Export in One Click
Whisk2CapCut is a desktop app that connects Google Whisk's AI image generation directly to CapCut's video editor. Instead of generating images one at a time and manually assembling your project, Whisk2CapCut lets you:
- Bulk generate 200+ consistent AI images from your prompts
- Automatically export everything into a ready-to-edit CapCut project β complete with timeline, subtitles, and Ken Burns animations
No more copy-pasting prompts. No more downloading files one by one. No more dragging 200 images into a timeline at 3 AM.
Complete Video Creation Workflow
Creating an AI-generated video from scratch involves several steps. Here's the full workflow, and where Whisk2CapCut fits in:
Step 1: Write Your Script
Start with your story, tutorial, or narration. Whether it's a fairy tale, a motivational short, or an educational explainer β write out the full script first.
Step 2: Generate Voiceover and Subtitles
Use a text-to-speech (TTS) tool to turn your script into audio. Most TTS tools will also generate an SRT subtitle file with precise timestamps. This is key β those timestamps define your scenes.
Step 3: Split Scenes from SRT Timestamps
Each subtitle line in your SRT file represents a scene. Whisk2CapCut reads these timestamps to understand how many images you need and how long each one should display.
Step 4: Write Image Prompts for Each Scene
For every scene, write a prompt describing the image you want. You can do this manually or use AI to help generate prompts from your script. Save them as a TXT, CSV, or SRT file.
Step 5: Bulk Generate Images + Export to CapCut
This is where Whisk2CapCut takes over. Load your prompt file, hit generate, and the app will:
- Send each prompt to Google Whisk
- Generate all your images in bulk
- Maintain visual consistency using tag matching (character, background, style)
- Auto-save every image as it's generated
- Export everything into a CapCut project file with timeline placement, Ken Burns pan-and-zoom effects, and subtitle tracks
Step 6: Final Edit in CapCut and Publish
Open the exported project in CapCut. Your images are already on the timeline, synced to your subtitles, with smooth animations applied. Add your voiceover, make any final tweaks, and publish.
What used to take three all-nighters now takes under an hour.
How to Use Whisk2CapCut (Step by Step)
1. Prepare Your Prompts
Create a file with your image prompts. Whisk2CapCut supports three formats:
- TXT β one prompt per line, simple and straightforward
- CSV β structured format with columns for prompt, character tags, background tags, and style tags
- SRT β subtitle format with timestamps, so the app knows the exact duration for each scene
2. Configure Your Tags
Whisk2CapCut uses an auto tag matching system for visual consistency. Set your character reference, background style, and art style once β and the app applies them across all generated images. This is how you get 200 images that actually look like they belong in the same video.
3. Generate Images in Bulk
Click generate, and Whisk2CapCut sends your prompts to Google Whisk one by one β automatically. It handles the waiting, the downloading, and the organizing. Every image is auto-saved to your local drive as it's generated.
4. Export to CapCut
Once generation is complete, hit the export button. Whisk2CapCut creates a .capcut project file that includes:
- All your images placed on the timeline
- Duration matched to your SRT timestamps
- Ken Burns animations (pan and zoom) applied to each image
- Subtitles automatically added to the subtitle track
Open it in CapCut, and you're ready to edit.
Key Features
- Bulk AI Image Generation β Generate 200+ images from a single prompt file. No more one-at-a-time downloads.
- Visual Continuity β Auto tag matching keeps your characters, backgrounds, and art style consistent across every image.
- Auto-Save β Every image is saved to your local drive the moment it's generated. No lost work, ever.
- One-Click CapCut Export β Instantly create a CapCut project with your images placed on the timeline and ready to edit.
- Ken Burns Animations β Automatic pan-and-zoom effects bring your still images to life without any manual keyframing.
- Auto Subtitles β Subtitles from your SRT file are automatically placed on the subtitle track, synced to your scenes.
Who Is This For?
Whisk2CapCut is built for anyone who creates video content from AI-generated images:
- AI Content Creators β If you're building videos with AI art, this tool eliminates the most tedious part of your workflow.
- Faceless YouTube Channels β Running a channel where AI images tell the story? This is your production pipeline.
- Storytellers and Authors β Turn your stories into visual content without filming a single frame.
- Shorts and Reels Creators β Rapidly produce short-form vertical content with consistent AI visuals.
- Educators and Explainers β Create illustrated tutorials and educational content at scale.
Pricing
Free Tier β Unlimited image generation with 5 CapCut exports every 7 days. Perfect for trying the tool and small projects.
Pro Plan β $4.99/month or $39.99/year. Unlimited exports, priority generation, and full access to all features. That's less than the cost of one stock image subscription.
Get Started
Whisk2CapCut is available on multiple platforms:
- Windows β Download from Microsoft Store
- macOS β Download from GitHub Releases
- Chrome Extension β Install from Chrome Web Store
Stop dragging images one by one. Stop pulling all-nighters for something a tool can do in minutes. Download Whisk2CapCut and turn your AI prompts into a finished CapCut project β in one click.
