How to Use TTSMP3
Convert text to speech MP3 in seconds with our comprehensive guide. Learn basic conversion, advanced features, and pro tips.
Try TTSMP3 Now →Enter Your Text
Start by typing or pasting your text into the main editor. TTSMP3 supports up to 5,000 characters per generation.
Use proper punctuation! The AI uses commas, periods, and question marks to add natural pauses and intonation.
You can also:
- Upload a TXT file: Click “📁 Upload TXT” to import text from a file
- Paste from anywhere: Copy from Word, Google Docs, or any text editor
- Use emojis: They’ll be automatically filtered out during processing
Choose Your Voice
Select from 15+ professional AI voices in the right panel. Each voice has unique characteristics:
- American Female: Heart, Alloy, Bella, Sarah, Sky, Nicole, River
- American Male: Adam, Echo, Michael, Ryan
- British Female: Emma, Jessica
- British Male: George, Lewis
Heart is warm and friendly (perfect for storytelling), Adam is clear and professional (ideal for business), Emma has a sophisticated British accent (great for audiobooks).
Adjust Settings (Optional)
Fine-tune your audio output with these settings:
- Output Format:
- • WAV: Lossless quality, larger file size (~10MB per minute)
- • MP3: Compressed, smaller file size (~1MB per minute)
- Playback Speed: Adjust from 0.5x (slow) to 1.5x (fast) without affecting pitch
Use WAV for professional productions and MP3 for quick sharing or web use.
Generate Audio
Click the “🎵 Generate Audio” button. Here’s what happens:
- First Generation (10-20 seconds): The 82M-parameter AI model downloads and loads into your browser
- Subsequent Generations (2-3 seconds): Instant processing using the cached model
- Long Texts: Automatically split into chunks and seamlessly stitched together
You’ll see a progress bar showing:
- Model loading status
- Processing chunks (e.g., “3 / 8” means chunk 3 of 8)
- Final assembly and crossfade application
Download Your Audio
Once generation is complete:
- Preview: Use the built-in audio player to listen before downloading
- Download: Click “⬇️ Download Audio” to save your MP3/WAV file
- Filename: Files are named
ttsmp3_[timestamp].[format]
All generated audio is 100% royalty-free. Use in YouTube videos, podcasts, courses, ads, or any commercial project without attribution.
Advanced Features
Director Mode
Create multi-voice conversations by assigning different voices to dialogue segments.
Smart Chunking
Long texts are automatically split at sentence boundaries with crossfade transitions for seamless playback.
Privacy-First
All processing happens in your browser. Your text never touches our servers—100% private and offline.
Using Director Mode
Create dynamic conversations with multiple AI voices using simple syntax:
[Adam]: Today we’re discussing AI technology.
[Emma]: It’s fascinating how far we’ve come.
[Michael]: Absolutely. Let’s dive into the details.
Director Mode Rules:
- Start each line with
[VoiceName]: - Voice names must match exactly: Heart, Adam, Emma, Michael, etc.
- Each voice segment can be multiple sentences
- Leave blank lines between speakers for natural pauses
Perfect for podcast intros, audiobook dialogues, educational content with narration + character voices, or interview simulations.
Common Questions
Q: Why does the first generation take 10-20 seconds?
A: The 82M-parameter AI model (about 40MB) needs to download and load into your browser on first use. After that, it’s cached locally and generation takes only 2-3 seconds.
Q: Can I generate audio longer than 5,000 characters?
A: Yes! Split your text into multiple 5,000-character batches, generate each separately, then use audio editing software to merge them.
Q: Does TTSMP3 work offline?
A: After the initial model download, yes! All processing happens locally in your browser using WebAssembly.
Q: Which audio format should I choose?
A: Use WAV for professional video production or editing (lossless quality). Use MP3 for web sharing, podcasts, or quick distribution (smaller file size).
Q: Can I adjust voice pitch or tone?
A: Currently, TTSMP3 offers speed adjustment (0.5x-1.5x). Each voice has its own natural pitch and tone. For pitch adjustment, use audio editing software after download.
Q: Is there a daily generation limit?
A: No! Generate unlimited audio, completely free, forever.