Discover the most powerful AI tools in this category with pricing, features, demo and use cases
Amazon Echo is a smart speaker powered by Alexa, an AI assistant designed for voice interaction, sma...
Google Cloud Text-to-Speech is a service that converts text into lifelike speech using advanced deep...
Synthesia is an AI-powered video generation platform that allows users to create professional videos...
AssemblyAI is a leading AI company providing powerful Speech-to-Text and Audio Intelligence APIs for...
Otter.ai is an AI-powered transcription and meeting assistant that records, transcribes, and summari...
ElevenLabs is a leading AI audio platform specializing in realistic text-to-speech (TTS) and voice c...
Amazon Transcribe is a fully managed machine learning service that provides highly accurate speech-t...
Rev AI is a cutting-edge speech recognition and natural language understanding platform that provide...
Murf AI is a text-to-speech AI voice generator that allows users to create realistic voiceovers for ...
RVC Tools is an open-source toolkit designed for voice conversion, enabling users to transform audio...
Descript is an AI-powered audio and video editing platform that allows users to edit media by editin...
Krisp is an AI-powered noise-canceling and voice enhancement application that removes background noi...
LibriSpeech is a large-scale, open-source dataset of read English speech used for training and evalu...
VoxCeleb is a large-scale dataset for speaker recognition and speaker diarization, comprising a vast...
Synthesia AI Templates provides pre-built video templates and AI avatars to quickly generate profess...
Microsoft Azure Speech Service offers a suite of AI-powered speech capabilities, including speech-to...
Deepgram is a leading AI service specializing in advanced speech-to-text and natural language unders...
Sonos AI Speakers are integrated smart speakers that leverage AI for enhanced audio experiences, voi...
Voicemod AI is a real-time voice changer and vocal effects application that transforms a user's voic...
ElevenLabs Voice is an advanced AI platform specializing in realistic and emotive text-to-speech (TT...
Resemble AI is a leading platform for generating realistic, high-quality synthetic voice using advan...
HeyGen is a leading AI-powered video generation platform that allows users to create professional vi...
Colossyan is a leading AI-powered video creation platform that enables users to generate professiona...
Amazon Polly is a cloud-based service that converts text into lifelike speech, offering a wide range...
Descript Overdub is an AI-powered voice cloning and text-to-speech service that allows users to crea...
Voicemod is a real-time voice changer and vocal effects application that allows users to modify thei...
ReadSpeaker provides advanced text-to-speech (TTS) and speech-to-text (STT) solutions, enabling natu...
Apple HomePod is a smart speaker that acts as a home hub for Apple's HomeKit ecosystem, offering voi...
Voicebox is an advanced AI system developed by Meta AI capable of generating speech in various voice...
Espnet is an open-source toolkit for end-to-end speech processing, supporting research and developme...