Custom Voice Cloning lets you create AI voices that sound like real people—your CEO, brand spokesperson, or anyone else whose voice you want to use in marketing videos. Upload 1-5 audio samples, and ILLIXIS uses AI voice cloning to create a synthetic voice that matches the original speaker's tone, cadence, and characteristics.
Once created, your custom voice appears alongside preset voices in the video production workflow. Every video you produce can use your brand's unique voice instead of generic AI voices.
|
Requirement |
Specification |
|---|---|
|
Format |
MP3, WAV, or M4A |
|
File size |
Maximum 10MB per file |
|
Number of files |
1-5 samples recommended |
|
Total duration |
1-2 minutes of audio works best |
|
Content |
Natural speech only (no singing, music, or sound effects) |
|
Quality |
Clear recording without background noise or echo |
For high-quality voice clones:
Avoid these common issues:
If you have no custom voices yet, you'll see an empty state with a "Create Your First Voice" button.
Click "Clone Voice" (or "Create Your First Voice" if this is your first).
You'll be taken to a full-page form for creating your voice clone.
Fill in the voice information:
|
Field |
Required |
Description |
|---|---|---|
|
Voice Name |
Yes |
Display name (e.g., "CEO Voice", "Brand Spokesperson") |
|
Gender |
No |
Male, Female, or Neutral (helps categorize in dropdowns) |
|
Language |
No |
Primary language: English, Spanish, Portuguese, French, German, or Italian |
|
Description |
No |
Optional notes about when to use this voice |
As you add files, they appear in a list below the upload zone. Click the X icon on any file to remove it.
By default, "Remove background noise" is enabled. This applies audio isolation to clean up your samples before cloning. Leave this enabled unless your recordings are already studio quality.
Click "Create Voice Clone".
A processing overlay appears while ILLIXIS:
This takes 30-60 seconds. Do not close the page during processing.
After creation, you're redirected to the Voice Library. Your new voice appears in the "Your Custom Voices" section with:
Click the play button to hear a preview sample of your cloned voice.
When you have multiple custom voices, you can designate one as your default:
The default custom voice is automatically selected when:
To change your default:
Your custom voices appear in the voice selection dropdown when:
To use a custom voice:
When creating video posts from articles or briefs:
For programmatic access, custom voices are included in the voice options API:
```
GET /content_hub/api/voices/
```
Returns both preset and custom voices for your tenant, with the type: "custom" field indicating cloned voices.
You can create up to 10 custom voices per account. If you've reached the limit, delete an existing voice before creating a new one.
Deleting a voice:
|
Status |
Meaning |
|---|---|
|
Ready |
Voice clone complete, available for use |
|
Pending |
Voice is being created (wait ~60 seconds) |
|
Failed |
Cloning failed—error message shown; try again with different samples |
If the preview sample doesn't play or sounds wrong:
Good content for voice samples:
Poor content for voice samples:
This message appears when voice cloning is not enabled for your account. Contact support to enable it.
Delete an existing voice before creating a new one:
Causes and fixes:
Common causes:
Voice cloning captures general characteristics but isn't perfect reproduction. Results depend on:
For critical brand voices, test with a few video generations before full production use.
Before cloning someone's voice:
Transform your articles into platform-ready video scripts with scene breakdowns, timing, and B-roll suggestions. Perfect for creating TikToks, Reels, and Shorts from existing content.
ILLIXIS offers voice cloning to give your brand a consistent, recognizable voice across all video and audio content. Clone your CEO's voice, brand spokesperson, or any voice that represents your brand.
ILLIXIS generates fully animated videos from still images using advanced AI models. Unlike Ken Burns animations that simply pan and zoom, AI video generation creates actual motion — hair blowing, fabric rippling, water flowing.
Ken Burns Mode transforms static images into engaging videos with smooth pan and zoom animations. It's the faster, lower-cost alternative to full AI video generation while still creating professional motion effects.
Social video captions in ILLIXIS have two configuration dimensions: Caption Type content angle and Caption Style visual presentation.