Will students notice it is AI-generated?

Modern TTS, especially from models like Fish Audio S2 Pro and Qwen3, sounds remarkably natural. Some students will notice, some will not. What matters more is clarity and consistency. Students care about learning the content, not analyzing the voice. That said, if you prefer, Murmur's voice cloning lets you use your own voice without actually recording.

Does Udemy accept AI-narrated courses?

Udemy's content policy requires that courses meet quality standards for audio clarity and consistency. They do not explicitly ban AI-generated narration. The key requirement is that the audio is clear, well-paced, and free of artifacts. Murmur's output meets these standards.

How do I update a single lesson without re-doing everything?

Edit the script for that lesson, regenerate the audio in Murmur with the same voice and speed settings, and replace the audio file in your video editor. The voice will be consistent with the rest of the course because you are using the same model and settings.

Can I generate narration for slides without video?

Yes. Many course platforms accept audio-only lessons or audio paired with PDF slides. Generate the audio in Murmur, export it, and upload alongside your slide deck. This is especially common for supplementary material and Q&A recaps.

What about courses in multiple languages?

Murmur supports 9 languages. For courses in English, Spanish, French, German, Italian, Portuguese, Japanese, Chinese, or Korean, you can generate narration locally. For other languages, you may need to supplement with a cloud TTS service for those specific modules.

Use Case

Text to Speech for Online Course Narration

How course creators use AI narration to produce consistent, updatable lessons without a recording studio. Workflow, costs, and platform tips.

April 19, 2026·4 min read

The Course Creator's Recording Problem

Creating an online course is already a massive effort. Writing the curriculum, building slides, recording screen captures, editing video. Then comes narration: sitting in a quiet room, recording 50+ lessons, re-recording when you stumble over a sentence, re-recording again when you update the content six months later.

For a typical 10-hour course, recording and editing narration takes 40 to 60 hours. That is before post-production (noise removal, volume normalization, breath editing). If you hire a professional narrator, expect to pay $2,000 to $4,000 for a 10-hour course. And every time you update a lesson, you either re-record it yourself or pay for another session.

AI text-to-speech changes this equation fundamentally. Write your script, generate audio, and export. Need to update lesson 37? Change the text and regenerate. The voice stays consistent, the cost stays zero (after the initial purchase), and the turnaround drops from hours to minutes.

Choosing a Voice for Educational Content

The right voice for course narration is one students can listen to for hours without fatigue. That means clear diction, moderate pace, and a warm but neutral tone. Avoid voices that are overly expressive or dramatic. Education content needs to inform, not perform.

In Murmur, Kokoro voices are the best starting point for course narration. The model maintains remarkably consistent tone and pacing across long sessions, which matters when students are listening to lesson after lesson. For courses that benefit from a more conversational feel (soft skills, creative topics), Qwen3 voices add warmth without sacrificing clarity.

Critical rule: use the same voice across every lesson in your course. Switching voices between modules is jarring and undermines the professional feel you are building. Pick one voice, test it with a sample lesson, and commit to it.

Batch Processing Workflow

For a full course, you will generate dozens of audio files. Here is the workflow that works:

Write all lesson scripts first. Do not generate audio until the scripts are reviewed and finalized. Regenerating is easy, but iterating on scripts while also managing audio files creates unnecessary complexity.
Name your scripts consistently: "01-introduction.txt", "02-getting-started.txt", etc. This keeps your audio exports organized.
Generate one test lesson and listen to the full thing. Check for mispronounced technical terms, awkward pauses, and pacing issues.
Adjust speed settings. For technical courses, 0.9x to 1.0x gives students time to absorb. For overview/conceptual lessons, 1.0x to 1.1x keeps energy up.
Generate all lessons with the same voice and speed settings. Consistency is non-negotiable.
Review each audio file. Regenerate any sections that sound off. With Murmur, there is no cost penalty for regeneration.

Handling Technical Terms

TTS models occasionally mispronounce specialized terminology, acronyms, or brand names. The fix is simple: spell it out phonetically in your script. Write "Kubernetes" as "koo-ber-net-eez" if the model stumbles. Write "SQL" as "sequel" if that is your preferred pronunciation. Most models handle common tech terms correctly, but always verify with a test generation.

Cost Comparison for a 10-Hour Course

Method	Initial Cost	Per Update	Total (Year 1)
Professional narrator	$2,000-4,000	$200-500/lesson	$3,000-6,000+
Cloud TTS (ElevenLabs Pro)	$99/month	Included (quota limits)	$1,188/year
Self-recording	Free (mic: $100-300)	Free (your time)	$100-300 + 60 hours
Murmur	$49 one-time	Free, unlimited	$49 total

Platform Requirements

Different course platforms have different audio requirements. Udemy requires audio at 48kHz sample rate with consistent volume levels. Teachable and Skillshare accept standard formats (MP3, WAV) without strict technical specifications. Murmur exports WAV at 44.1kHz or 48kHz, which meets or exceeds all major platform requirements.

For video lessons (which most platforms prefer), pair your TTS audio with screen recordings or slides. Import the audio track into your video editor (DaVinci Resolve, iMovie, or ScreenFlow) and sync it with your visuals. This workflow is significantly faster than recording voiceover live while screencasting.

Frequently Asked Questions

Narrate your entire course for $49.

Consistent AI voice across every lesson. Update any time, regenerate for free. No studio, no subscriptions, no per-minute billing.

Buy Murmur · $49

macOS 14+ · Apple Silicon required · 7-day refund policy