Tutorials

AI Voice Cloning: How to Clone Your Voice (2024 Guide)

October 12, 2024 4 min read

AI Voice Cloning: How to Clone Your Voice

Want to create content without recording every word? Clone your voice.

Here’s how, done ethically and effectively.

Why Clone Your Voice?

Legitimate uses:

  • Narrate content without recording
  • Create consistent voiceovers at scale
  • Accessibility (help those who’ve lost their voice)
  • Localize content into other languages
  • Audiobook production
  • Podcast editing (fix mistakes)

Not legitimate:

  • Impersonating others without consent
  • Scams or fraud
  • Non-consensual content

Be ethical. Only clone voices you have rights to use.

Best Voice Cloning Tools

ElevenLabs — Best Overall

Price: Free tier / $5+ per month Quality: Excellent Clone time: 1-5 minutes of audio

How to use:

  1. Create account at elevenlabs.io
  2. Go to “Voice Lab”
  3. Click “Add Voice” → “Instant Voice Clone”
  4. Upload audio samples
  5. Name your voice and save

Tips for best results:

  • Use high-quality audio (no background noise)
  • Include varied speech (not monotone)
  • 3-5 minutes of audio is ideal
  • Multiple samples beat one long recording

Play.ht

Price: $31+/month Quality: Very good Best for: Podcasters, content creators

More affordable for high-volume use. Good API for automation.

Resemble AI

Price: Custom pricing Quality: Excellent Best for: Enterprises, app developers

Professional-grade. Better for production applications.

Descript Overdub

Price: Included in Descript ($12+/month) Best for: Podcast/video editors already using Descript

Train your voice, then type to generate speech. Great for fixing recording mistakes.

Step-by-Step: Creating Your Voice Clone

Step 1: Record Training Audio

What you need:

  • Quiet room
  • Decent microphone (phone mic works, but better is better)
  • 3-5 minutes of varied speech

Recording tips:

  • Read diverse content (articles, fiction, instructions)
  • Use natural pacing
  • Include questions, statements, different emotions
  • Avoid background noise completely

Script suggestion: Read this article out loud. Then read a Wikipedia article. Then a poem. Variety helps.

Step 2: Prepare Audio

Clean up:

  • Remove long silences
  • Cut any mistakes
  • Export as WAV or high-quality MP3

Don’t:

  • Add music or effects
  • Over-process with EQ
  • Include multiple speakers

Step 3: Upload and Train

ElevenLabs process:

  1. Upload your audio file(s)
  2. Add description labels
  3. Click “Add Voice”
  4. Wait 30-60 seconds

Test immediately: Enter test text and listen. Adjust if needed.

Step 4: Refine

If it doesn’t sound right:

  • Record more varied content
  • Check audio quality
  • Try different samples
  • Adjust stability/clarity settings

Settings Explained

Stability

Higher = more consistent, less emotional Lower = more expressive, less predictable

For narration: Higher stability For conversational: Lower stability

Clarity + Similarity Enhancement

Higher = closer to original voice Lower = more AI interpretation

Start high, adjust based on results.

Use Cases and Workflows

Content Narration

  1. Write script
  2. Generate audio with voice clone
  3. Edit in audio software
  4. Add to video/podcast

Time saved: Hours per piece of content

Podcast Production

  1. Record podcast normally
  2. Use voice clone to fix mistakes
  3. Generate additional segments as needed

Audiobooks

  1. Train high-quality voice clone
  2. Chapter-by-chapter generation
  3. Professional editing

Note: Some platforms restrict AI narration. Check terms.

Localization

  1. Clone voice in English
  2. Use tools that maintain voice in other languages
  3. Generate foreign-language versions

ElevenLabs supports this increasingly well.

Quality Comparison

ToolQualityEmotionLanguages
ElevenLabs9/10Excellent29+
Play.ht8/10Good20+
Descript7/10LimitedEnglish
Resemble9/10ExcellentMultiple

Only clone voices when you have explicit permission. That includes:

  • Your own voice ✓
  • Employees with written consent ✓
  • Voice actors with contracts ✓
  • Random people’s voices ✗
  • Celebrities ✗
  • Deceased persons (complex) ⚠️

Disclosure

Be transparent when content uses AI voices. Many platforms require disclosure.

Terms of Service

Check each tool’s terms. Most prohibit:

  • Deceptive use
  • Impersonation
  • Adult content
  • Political manipulation

Pricing Comparison

ToolFree TierPaid
ElevenLabs10K chars/month$5-330/month
Play.htTrial$31-99/month
DescriptLimited$12-24/month
ResembleTrialCustom

For hobbyist use, ElevenLabs free tier is sufficient.

Getting Started

Today:

  1. Sign up for ElevenLabs (free)
  2. Record 3-5 minutes of yourself reading varied content
  3. Create your voice clone
  4. Generate test content

This week:

  1. Refine your clone with more samples
  2. Create one piece of content with it
  3. Compare to manual recording

This month:

  1. Integrate into your workflow
  2. Identify where voice clone saves time
  3. Scale usage if valuable

Voice cloning is powerful. Use it responsibly and it’s a game-changer for content creation.

Disclosure: This post contains affiliate links. If you click through and make a purchase, we may earn a commission at no extra cost to you. We only recommend tools we genuinely believe in.