Audio
11 tools
AssemblyAI
AudioAssemblyAI is a developer-focused AI speech-to-text API delivering best-in-class transcription accuracy, real-time processing, and powerful audio intelligence features for any application.
ElevenLabs
AudioLeading AI voice synthesis platform offering ultra-realistic text-to-speech, voice cloning, and real-time voice conversion in 32+ languages.
Maum AI
AudioMaum AI (formerly MINDs Lab) is a Korean AI company offering enterprise-grade speech synthesis, speech recognition, vision AI, and NLP solutions with industry-leading Korean voice quality.
Murf AI
AudioAI voice generator with 120+ studio-quality voices in 20+ languages for creating professional voiceovers for videos, e-learning content, and presentations.
Play.ht
AudioPlay.ht is an AI voice generation platform with 900+ ultra-realistic voices, voice cloning from a 30-second sample, and a real-time API used for podcasts, audiobooks, IVR systems, and multi-speaker conversational AI.
Speechify
AudioSpeechify is an AI text-to-speech platform that turns any text, PDF, document, or web page into natural-sounding audio with 200+ voices in 60+ languages, helping students, professionals, and people with dyslexia consume content faster.
Suno
AudioSuno is an AI music generation platform that creates full songs with vocals, instruments, and lyrics from simple text prompts using the state-of-the-art Suno v4 model.
Typecast
AudioTypecast is a Korean AI voice platform by Neosapience offering 400+ AI voices with emotion and style control, voice cloning, and professional text-to-speech for content creators.
Udio
AudioUdio is an AI music generation platform that creates full songs with vocals from text prompts, known for exceptional audio quality and wide genre support.
Vito
AudioVito by Return Zero is Korea's best-in-class AI speech recognition platform offering real-time meeting transcription, audio file transcription, and developer APIs with industry-leading Korean STT accuracy.
Whisper
AudioWhisper is OpenAI's open-source speech recognition model offering state-of-the-art transcription accuracy across 99 languages, available free to run locally or via the OpenAI API.