Home

Audio

11 tools

AssemblyAI

AssemblyAI

Audio

AssemblyAI is a developer-focused AI speech-to-text API delivering best-in-class transcription accuracy, real-time processing, and powerful audio intelligence features for any application.

freemium
ElevenLabs

ElevenLabs

Audio

Leading AI voice synthesis platform offering ultra-realistic text-to-speech, voice cloning, and real-time voice conversion in 32+ languages.

freemium
Maum AI

Maum AI

Audio

Maum AI (formerly MINDs Lab) is a Korean AI company offering enterprise-grade speech synthesis, speech recognition, vision AI, and NLP solutions with industry-leading Korean voice quality.

freemium
Murf AI

Murf AI

Audio

AI voice generator with 120+ studio-quality voices in 20+ languages for creating professional voiceovers for videos, e-learning content, and presentations.

freemium
Play.ht

Play.ht

Audio

Play.ht is an AI voice generation platform with 900+ ultra-realistic voices, voice cloning from a 30-second sample, and a real-time API used for podcasts, audiobooks, IVR systems, and multi-speaker conversational AI.

freemium
Speechify

Speechify

Audio

Speechify is an AI text-to-speech platform that turns any text, PDF, document, or web page into natural-sounding audio with 200+ voices in 60+ languages, helping students, professionals, and people with dyslexia consume content faster.

freemium
Suno

Suno

Audio

Suno is an AI music generation platform that creates full songs with vocals, instruments, and lyrics from simple text prompts using the state-of-the-art Suno v4 model.

freemium
Typecast

Typecast

Audio

Typecast is a Korean AI voice platform by Neosapience offering 400+ AI voices with emotion and style control, voice cloning, and professional text-to-speech for content creators.

freemium
Udio

Udio

Audio

Udio is an AI music generation platform that creates full songs with vocals from text prompts, known for exceptional audio quality and wide genre support.

freemium
Vito

Vito

Audio

Vito by Return Zero is Korea's best-in-class AI speech recognition platform offering real-time meeting transcription, audio file transcription, and developer APIs with industry-leading Korean STT accuracy.

freemium
Whisper

Whisper

Audio

Whisper is OpenAI's open-source speech recognition model offering state-of-the-art transcription accuracy across 99 languages, available free to run locally or via the OpenAI API.

free