How to Use TTSMP3

Convert text to speech MP3 in seconds with our comprehensive guide. Learn basic conversion, advanced features, and pro tips.

Try TTSMP3 Now →

Enter Your Text

Start by typing or pasting your text into the main editor. TTSMP3 supports up to 5,000 characters per generation.

💡 Pro Tip:

Use proper punctuation! The AI uses commas, periods, and question marks to add natural pauses and intonation.

You can also:

Upload a TXT file: Click “📁 Upload TXT” to import text from a file
Paste from anywhere: Copy from Word, Google Docs, or any text editor
Use emojis: They’ll be automatically filtered out during processing

Choose Your Voice

Select from 15+ professional AI voices in the right panel. Each voice has unique characteristics:

American Female: Heart, Alloy, Bella, Sarah, Sky, Nicole, River
American Male: Adam, Echo, Michael, Ryan
British Female: Emma, Jessica
British Male: George, Lewis

🎙️ Voice Selection Tips:

Heart is warm and friendly (perfect for storytelling), Adam is clear and professional (ideal for business), Emma has a sophisticated British accent (great for audiobooks).

Adjust Settings (Optional)

Fine-tune your audio output with these settings:

Output Format:
- • WAV: Lossless quality, larger file size (~10MB per minute)
- • MP3: Compressed, smaller file size (~1MB per minute)
Playback Speed: Adjust from 0.5x (slow) to 1.5x (fast) without affecting pitch

⚙️ Recommended Settings:

Use WAV for professional productions and MP3 for quick sharing or web use.

Generate Audio

Click the “🎵 Generate Audio” button. Here’s what happens:

First Generation (10-20 seconds): The 82M-parameter AI model downloads and loads into your browser
Subsequent Generations (2-3 seconds): Instant processing using the cached model
Long Texts: Automatically split into chunks and seamlessly stitched together

You’ll see a progress bar showing:

Model loading status
Processing chunks (e.g., “3 / 8” means chunk 3 of 8)
Final assembly and crossfade application

Download Your Audio

Once generation is complete:

Preview: Use the built-in audio player to listen before downloading
Download: Click “⬇️ Download Audio” to save your MP3/WAV file
Filename: Files are named ttsmp3_[timestamp].[format]

✅ Commercial Use:

All generated audio is 100% royalty-free. Use in YouTube videos, podcasts, courses, ads, or any commercial project without attribution.

Advanced Features

🎬

Director Mode

Create multi-voice conversations by assigning different voices to dialogue segments.

✂️

Smart Chunking

Long texts are automatically split at sentence boundaries with crossfade transitions for seamless playback.

🔒

Privacy-First

All processing happens in your browser. Your text never touches our servers—100% private and offline.

🎬

Using Director Mode

Create dynamic conversations with multiple AI voices using simple syntax:

[Heart]: Welcome to our podcast!

[Adam]: Today we’re discussing AI technology.

[Emma]: It’s fascinating how far we’ve come.

[Michael]: Absolutely. Let’s dive into the details.

Director Mode Rules:

Start each line with [VoiceName]:
Voice names must match exactly: Heart, Adam, Emma, Michael, etc.
Each voice segment can be multiple sentences
Leave blank lines between speakers for natural pauses

🎭 Creative Uses:

Perfect for podcast intros, audiobook dialogues, educational content with narration + character voices, or interview simulations.

Common Questions

Q: Why does the first generation take 10-20 seconds?

A: The 82M-parameter AI model (about 40MB) needs to download and load into your browser on first use. After that, it’s cached locally and generation takes only 2-3 seconds.

Q: Can I generate audio longer than 5,000 characters?

A: Yes! Split your text into multiple 5,000-character batches, generate each separately, then use audio editing software to merge them.

Q: Does TTSMP3 work offline?

A: After the initial model download, yes! All processing happens locally in your browser using WebAssembly.

Q: Which audio format should I choose?

A: Use WAV for professional video production or editing (lossless quality). Use MP3 for web sharing, podcasts, or quick distribution (smaller file size).

Q: Can I adjust voice pitch or tone?

A: Currently, TTSMP3 offers speed adjustment (0.5x-1.5x). Each voice has its own natural pitch and tone. For pitch adjustment, use audio editing software after download.

Q: Is there a daily generation limit?

A: No! Generate unlimited audio, completely free, forever.

Ready to Start Generating?

Launch TTSMP3 →