How large are the audio files?

A 1,500-word blog post generates roughly 8 to 10 minutes of audio. As a WAV file, that is about 80 to 100MB. Compressed to MP3 at 128kbps, it shrinks to 7 to 10MB. For web embedding, MP3 is the practical choice.

Can I use this with WordPress or Ghost?

Yes. Both platforms support custom HTML blocks where you can embed a standard audio element. Some WordPress plugins also handle audio embedding with additional features like download buttons and speed controls.

Should I use different voices for different blogs?

Generally, no. Consistency builds brand recognition. Pick one voice and use it across all your posts. The exception is if you run multiple distinct publications with different audiences.

How do I handle quotes from other people in the article?

You have two options. The simpler approach is to preface quotes with the speaker's name: "John Smith said, quote..." The more polished approach is to generate quotes with a different voice in Murmur and splice the audio files together in an editor like Audacity or GarageBand.

Does adding audio help with SEO?

Indirectly, yes. Audio increases time-on-page and reduces bounce rate, both of which are positive engagement signals. Google does not directly index audio content from embedded players, but the behavioral improvements can support rankings.

Tutorial

How to Turn Blog Posts into Audio (Step-by-Step)

A practical guide to converting your blog posts and newsletters into audio. Prepare text, choose a voice, generate, and embed on your site.

April 17, 2026·4 min read

Why Add Audio to Your Blog

Adding an audio version of your blog posts is not just a nice-to-have. Nielsen Norman Group research shows that pages with audio see 2.5x longer time-on-page compared to text-only versions. Readers can listen while commuting, cooking, or exercising. Mobile users, who often prefer listening over reading, stay engaged longer.

Accessibility is another strong reason. Visually impaired readers, people with dyslexia, and anyone experiencing screen fatigue benefit from an audio option. It also signals to your audience that you care about how they consume your content, not just that they consume it.

The practical barrier has always been effort. Recording yourself takes time, editing takes more, and hiring a voice actor is expensive. AI text-to-speech removes that barrier entirely. You can go from finished blog post to embedded audio in under ten minutes.

Step 1: Prepare Your Text

TTS engines read exactly what you give them. Blog posts often contain elements that sound wrong when read aloud: markdown formatting, bare URLs, image alt text, and code blocks. Before generating audio, clean your text.

Remove markdown syntax (headers, bold/italic markers, link formatting).
Replace URLs with descriptive text. Instead of "visit https://example.com" use "visit our website."
Remove or rewrite image captions and alt text that would be confusing out of context.
Strip code blocks entirely, or replace them with a spoken description like "the code example below shows how to initialize the client."
Break very long paragraphs into shorter ones. TTS handles moderate-length passages better than wall-of-text blocks.
Spell out abbreviations on first use. "TTS" should be "text-to-speech" the first time it appears.

Step 2: Choose the Right Voice

Voice selection matters more than most people think. A warm, steady voice works well for long-form articles. A slightly faster, more energetic voice suits listicles and news-style posts. Murmur includes 860+ voices across its model library, so you have options.

For blog narration, prioritize clarity and consistency over expressiveness. You want a voice that sounds natural across 1,000 to 3,000 words without becoming distracting. Kokoro voices tend to excel here because the model maintains even pacing over long passages. If you want more personality, Qwen3 and Fish Audio offer warmer, more dynamic options.

Pick one voice and stick with it across all your posts. Consistency builds familiarity. Your readers will start to associate that voice with your brand.

Step 3: Generate and Export

Paste your cleaned text into Murmur, select your voice, and set the speed. For blog narration, a speed setting between 0.9x and 1.1x works best. Slower speeds feel like a lecture. Faster speeds lose clarity. The sweet spot is natural conversational pace.

Generate the audio and listen through the first 30 seconds. If the voice and pacing feel right, export the file. Murmur exports WAV by default. For web embedding, convert to MP3 to reduce file size (a 10-minute WAV is roughly 100MB; the same MP3 is under 10MB).

Step 4: Embed on Your Site

The simplest approach is an HTML5 audio tag at the top of your post. Upload the MP3 to your hosting provider or a CDN, then add this to your post template:

Use the native HTML audio element with the controls attribute. Point the source to your hosted MP3 file. This works on every modern browser, requires no JavaScript, and loads efficiently. Most CMS platforms (WordPress, Ghost, Webflow) support custom HTML blocks where you can paste this directly.

Your Options Compared

Option	Cost	Quality	Privacy	Setup Time
Browser TTS plugins	Free	Low (robotic)	Varies	Minutes
Cloud TTS (ElevenLabs, etc.)	$5-99/month	High	Text sent to servers	Minutes
Murmur (local TTS)	$49 one-time	High	100% local	10 min first time
Hire voice talent	$50-200/post	Highest	Good (with NDA)	Days per post
Record yourself	Free (time cost)	Variable	Full control	30-60 min/post

Tips for Better Blog Audio

Add a brief intro. Something like "This is the audio version of [post title], published on [date]." It sets listener expectations.
Keep audio files under 15 minutes. For longer posts, consider splitting into parts.
Host your audio on a CDN rather than your web server. Services like Cloudflare R2 or AWS S3 are affordable and fast.
Add the audio player above the fold. If readers have to scroll to find it, they will not use it.
Include a speed control or mention that the player supports playback speed adjustment.

Frequently Asked Questions

Give your blog a voice.

Turn any blog post into audio in under 10 minutes. 860+ voices, 6 AI models, no subscription. $49 and your blog sounds better.

Buy Murmur · $49

macOS 14+ · Apple Silicon required · 7-day refund policy