0 / 5,000

Engine

Cloud · fast

Language

Voice

Loading voices…

Speed 1.0x

WAV · 24 kHz Checking...

Credits

— / —

Free text-to-audio converter

Text to Audio

Paste text, pick a voice, download a clean WAV file. 54 voices across 9 languages, commercial use, no signup.

The whole text-to-audio category

One tool, the right output for any project

Text-to-audio covers everything from a quick voice memo to a full audiobook master. The output format you actually need depends on where the audio is going. FreeTextoSpeech generates lossless 24 kHz WAV by default because it is the cleanest source for any downstream workflow, keep the WAV as your master, derive MP3 or streaming formats when distribution requires them.

Related use cases

Text to MP3 Text to WAV YouTube Podcasters Audiobooks Read PDF aloud

The quick answer

Paste your text (up to 5,000 characters), pick a voice from 54 Kokoro options across 9 languages, hit generate, and download a 24 kHz WAV file. No signup, no watermark, free for commercial use. Need MP3 instead? Generate the WAV here, then convert via the dedicated /text-to-mp3 workflow.

In four steps

From text to a downloadable audio file

01

Paste your text

Up to 5,000 characters per request. Split longer scripts into chapters or sections, each one becomes a separate audio file.
02

Pick a voice

54 Kokoro voices across 9 languages. Sample a few before committing, voice choice matters more than any post-processing.
03

Generate

Synthesis runs in your browser session. No queue, no waiting room, no signup. Most reads finish in a few seconds.
04

Download the WAV

You get a lossless 24 kHz WAV file by default. Need MP3 instead? Use the dedicated /text-to-mp3 page for the conversion workflow.

Common workflows

What people use text-to-audio for

04 scenarios

01 / 04

Video editing source audio

Drop the WAV straight into Premiere, DaVinci Resolve, or CapCut. Lossless input means cleaner ducking, EQ, and noise gating later.

02 / 04

Podcast production

Use TTS for show intros, segment bumpers, ad reads, or full episodes. WAV master goes into your DAW, MP3 ships to the host.

03 / 04

Audiobook draft listening

Catch awkward sentences and pacing problems by listening to your manuscript before recording. Cheaper than reshooting a chapter.

04 / 04

Accessibility & e-learning

Convert articles, PDFs, and course modules into audio for dyslexic readers, commuters, or learners who absorb better by ear.

Practical guidance

Picking the right format and settings

01

Default to WAV, convert when needed

WAV is lossless, generate once, derive MP3 or AAC for distribution. Going the other way (MP3 to WAV) does not recover the lost frequencies.
02

MP3 at 128 kbps is transparent for speech

For voice-only content, 128 kbps MP3 is sonically indistinguishable from the WAV master in blind tests. No reason to pay the file-size cost of 320 kbps for podcasts or audiobooks.
03

24 kHz is the right sample rate for voice

Human speech tops out around 8 kHz, so 24 kHz captures everything with headroom. 44.1 kHz or 48 kHz is overkill for narration and just inflates file size.
04

File size: roughly 1.4 MB per minute (WAV)

A 5-minute read is about 7 MB as WAV, 3.7 MB as 128 kbps MP3, or 1.2 MB as 64 kbps Opus. Pick the format your delivery channel actually accepts.
05

Stream vs download

For web playback, host an MP3 or AAC and stream it. For DAW work, download the WAV. Mixing the two, streaming a WAV, wastes bandwidth without sounding any better through laptop speakers.
06

Generate at 1.0× speed

Adjust pacing later in your editor. Compounding speed changes at both generation and editing introduces pitch artefacts that are hard to undo.

FAQ

Frequently Asked Questions

What is the difference between text to audio and text to speech?

They are the same thing in practice. "Text to audio" is the broader search term, it covers any text-to-audio-file workflow regardless of output format (WAV, MP3, OGG). "Text to speech" specifically refers to the speech synthesis step. FreeTextoSpeech does both: synthesises speech and exports it as a downloadable audio file.

What audio format does FreeTextoSpeech output?

Lossless 24 kHz, 16-bit mono WAV. WAV is the cleanest format for further editing, drop it into any DAW, video editor, or podcast tool with no decode step. If you need MP3 for distribution, the /text-to-mp3 page covers the conversion workflow.

Is the text-to-audio converter actually free?

Yes. No signup, no card, no watermark on the audio file. The free anonymous tier is 5,000 characters per request and a monthly cap to keep abuse manageable. Sign in with Google for higher limits, still free.

How many characters can I convert at once?

Up to 5,000 characters per request. For longer pieces (book chapters, full courses), split the text into sections and concatenate the WAV files in any audio editor. Audacity, Reaper, and Premiere all do this in seconds.

Can I use the audio commercially?

Yes. The Kokoro model is Apache 2.0 licensed, and FreeTextoSpeech does not impose additional restrictions. Use the audio in YouTube videos, paid courses, podcasts, audiobooks, ads, no royalties, no attribution required.

Which voices and languages are available?

54 Kokoro voices across 9 languages: English (US and UK), Spanish, French, Italian, Portuguese, Japanese, Mandarin, and Hindi. Each voice has a distinct tone, sample a few before settling on one for a long project.

Does the audio play in the browser before I download?

Yes. Generation produces an in-page audio player so you can listen before downloading. If you do not like the result, regenerate with a different voice or split the text differently.

What if I need an MP3 instead of a WAV?

Generate the WAV here, then follow the conversion steps on /text-to-mp3. On Mac, Apple Music does it in one click. On Windows, Audacity exports MP3 with the free LAME encoder. Online, cloudconvert.com handles WAV-to-MP3 free.

Still wondering? Get in touch →

Format & use-case pages

Pick the page that matches your project

All tools

Audio format ↗

Ready to generate your audio file?

One paste, one click, lossless WAV in seconds.

Open the tool See all voices

Text to Audio

One tool, the right output for any project

From text to a downloadable audio file

Paste your text

Pick a voice

Generate

Download the WAV

What people use text-to-audio for

Video editing source audio

Podcast production

Audiobook draft listening

Accessibility & e-learning

Picking the right format and settings

Default to WAV, convert when needed

MP3 at 128 kbps is transparent for speech

24 kHz is the right sample rate for voice

File size: roughly 1.4 MB per minute (WAV)

Stream vs download

Generate at 1.0× speed

Frequently Asked Questions

What is the difference between text to audio and text to speech?

What audio format does FreeTextoSpeech output?

Is the text-to-audio converter actually free?

How many characters can I convert at once?

Can I use the audio commercially?

Which voices and languages are available?

Does the audio play in the browser before I download?

What if I need an MP3 instead of a WAV?

Pick the page that matches your project

Text to MP3

Text to WAV

Text to Speech for YouTube

Text to Speech for Podcasters

Text to Speech for Audiobooks

Read PDF Aloud

Ready to generate your audio file?

Text to Audio - Free Online Text-to-Audio Converter