The storyboard is the pre-production review step between script generation and final video rendering. You see every scene laid out visually -- image, voiceover text, timing, mood -- and decide what stays, what gets regenerated, and what gets edited before committing to the full render.
When you create a video in ILLIXIS, the AI first generates a script with scene breakdowns, narration text, visual descriptions, and music direction. The storyboard takes that script and produces the actual assets you will hear and see in the final video: scene images, a voiceover audio track, and background music.
The storyboard exists so you never produce a video blind. You review each element, approve what works, regenerate what does not, and edit narration text directly. Only after you approve every scene does the "Generate Video" button unlock. This prevents wasted render time and cost on a video you would have rejected.
The storyboard is audio-first. Voiceover is generated from your narration text using AI voice generation, and each scene's video clip duration adapts to the actual spoken length of that scene's narration. This means what you hear in the storyboard preview is exactly what plays in the final video -- no speed manipulation, no compression artifacts.
The full pipeline from idea to finished video follows this sequence:
| Step | What Happens | Time |
|------|-------------|------|
| 1. Create Video | Choose platform, duration, ad type, and topic or article | Instant |
| 2. Script Generation | AI writes scenes, narration, hooks, CTA, music direction | 30-60 seconds |
| 3. Script Review | Review and optionally edit the script before asset generation | Your pace |
| 4. Generate Storyboard | Scene images, voiceover, and music are produced | 1-2 minutes |
| 5. Storyboard Review | Review each scene, edit narration, regenerate elements | Your pace |
| 6. Approve All Scenes | Approve each scene individually or all at once | Your pace |
| 7. Generate Video | Full video render (clips, composition, multi-aspect export) | 3-10 minutes |
Steps 3 and 5 are where you spend your time. Everything else is automated.
Once storyboard assets finish generating, the page shows three sections:
The top card displays your full narration text organized by scene. Each scene shows:
A duration bar at the top shows the total video duration compared to your target (e.g., "Video: 28s / 30s target"). This is the sum of per-scene clip durations, which is what ad platforms measure.
Two audio players let you preview:
Each has a regenerate button with remaining count displayed.
A grid of cards, one per scene. Each card shows:
| Element | Description |
|---------|-------------|
| Scene image | AI-generated 2K image |
| Visual pill | What the image depicts |
| Narration pill | The voiceover text for this scene |
| Text Overlay pill | On-screen text that appears during the scene |
| Duration pill | Actual clip duration in seconds (from voiceover timing) |
| Mood pill | Emotional tone (e.g., "urgent", "hopeful", "confident") |
| Segment pill | Ad structure segment (Hook, Problem, Solution, etc.) |
| People pill | Whether the scene includes people |
| Approval badge | "Approved" (green) or "Pending" (amber) |
| Action buttons | Approve, Unapprove, Regenerate Image |
Work through the storyboard from top to bottom:
You can edit narration text at two stages:
On the script review screen (before clicking "Generate Storyboard"):
Once the storyboard exists with voiceover and images:
This saves your text changes and immediately triggers a new voiceover generation. The page polls automatically and reloads when the new voiceover is ready. Scene durations update to reflect the new spoken timing.
This is the primary way to fix duration compliance issues. If your video is 5 seconds over the target, shorten the narration across a few scenes, then Save & Regenerate.
The voiceover speed slider sits next to the voiceover regenerate button. It controls the speaking rate of the AI voice.
| Speed | Effect |
|-------|--------|
| 0.70x | Slowest -- deliberate, dramatic pacing |
| 1.00x | Default -- natural conversational speed |
| 1.20x | Fastest -- energetic, urgent delivery |
When your video duration exceeds the target, the storyboard suggests a speed value. For example, if your voiceover is 35 seconds but your target is 30 seconds, you will see "Suggest: 1.17x" as a clickable link that pre-fills the speed input.
To use:
Editing narration text is generally more effective than increasing speed. Faster speech can sound unnatural. The recommended approach is to trim a few words from the longest scenes first, then use speed as a fine-tuning tool for the last 1-2 seconds.
Each element type has its own regeneration limit per video:
| Element | Limit | What Changes |
|---------|-------|--------------|
| Scene Image | Equal to number of scenes | New image from same prompt |
| Voiceover | 10 per video | New audio from current narration text |
| Music | 10 per video | New music track matching script mood |
When remaining regenerations drop to 2 or fewer, the button turns amber as a visual warning. At 0 remaining, the button is disabled.
Ad platforms reject videos that exceed standard duration units. The storyboard enforces duration compliance visually:
Target durations are set when you create the video (15s, 30s, 45s, 60s, or custom).
The duration bar at the top of the Script & Narration section shows:
| State | Display |
|-------|---------|
| At or under target | Green bar: "Video: 28s / 30s target" |
| Over target | Amber bar: "Video: 34s / 30s target (+4s)" with guidance to shorten narration |
The displayed duration is the video duration (sum of per-scene clip durations), not the raw voiceover duration. Video clips round each scene's spoken duration up to the nearest whole second, so the video is always slightly longer than the voiceover.
Per-scene durations on each scene card show the actual clip duration with color coding:
To fix an over-target video:
Scene images are AI-generated at 2K resolution in 9:16 (vertical) aspect ratio. Key details:
Both audio elements have standard audio players in the storyboard:
Audio URLs include cache-busting parameters, so after regeneration the player always loads the fresh version.
Once you have reviewed and are satisfied with all elements:
You can unapprove a scene at any time before starting production by clicking "Unapprove." This re-disables the Generate Video button until you approve it again.
The cost summary at the bottom shows:
Transform your articles into platform-ready video scripts with scene breakdowns, timing, and B-roll suggestions. Perfect for creating TikToks, Reels, and Shorts from existing content.
ILLIXIS offers voice cloning to give your brand a consistent, recognizable voice across all video and audio content. Clone your CEO's voice, brand spokesperson, or any voice that represents your brand.
ILLIXIS generates fully animated videos from still images using advanced AI models. Unlike Ken Burns animations that simply pan and zoom, AI video generation creates actual motion — hair blowing, fabric rippling, water flowing.
Ken Burns Mode transforms static images into engaging videos with smooth pan and zoom animations. It's the faster, lower-cost alternative to full AI video generation while still creating professional motion effects.
Social video captions in ILLIXIS have two configuration dimensions: Caption Type content angle and Caption Style visual presentation.
One platform. You approve. ILLIXIS executes. Marketing that just happens.
Marketing, Unstacked.