Speech to Text AI tools
Explore 60 AI tools tagged with Speech to Text across PromptStack's directory.
Observe.AI Studio
Productivity
AI agent evaluation and coaching studio
Observe AI Studio provides advanced tools for evaluating agent performance, building coaching workflows, and optimizing contact center operations with AI insights.
Castmagic AI
Audio
AI audio to content tool for creators
Castmagic AI converts podcast and video recordings into ready-to-publish content including show notes, blog posts, social clips, and newsletters with one click.
Swell AI
Audio
AI content repurposing for podcasters
Swell AI automatically repurposes podcast and video content into written articles, social posts, transcripts, and newsletters to maximize content reach and ROI.
Podsqueeze AI
Audio
AI podcast content repurposing tool
Podsqueeze AI generates show notes, timestamps, newsletters, tweets, and blog posts from podcast episodes automatically using AI transcription and content generation.
Deciphr AI
Audio
AI content repurposer for podcasters
Deciphr AI transforms podcast audio into timestamps, show notes, blog posts, and social media content automatically using AI transcription and summarization.
Transistor AI
Audio
AI podcast hosting with analytics
Transistor AI is a podcast hosting platform with AI transcription, smart show notes generation, and advanced analytics for professional podcast publishers.
Buzzsprout AI
Audio
AI podcast hosting and promotion platform
Buzzsprout AI adds artificial intelligence to podcast hosting with automated transcription, chapter generation, and AI-powered episode optimization for discoverability.
Zencastr AI
Audio
AI podcast recording and editing platform
Zencastr AI is a podcast production platform with AI audio enhancement, automatic transcription, video recording, and one-click audiogram creation for podcasters.
Alitu AI
Audio
AI podcast maker tool
Alitu AI is an all-in-one podcast maker that automates audio cleanup, transcript creation, chapter markers, and episode publishing for independent podcast creators.
Spext AI
Audio
AI podcast editing by text
Spext AI is an intelligent audio and video editing tool that lets you edit recordings by editing the transcript text, with automatic filler word and silence removal.
Flixier AI
Video Generation
Fast AI cloud video editor
Flixier AI is a fast cloud-based video editor with AI transcription, auto subtitles, text-to-speech, and collaboration tools for professional content production.
Veed IO AI
Video Generation
AI online video editor with subtitles
VEED IO AI is a browser-based video editing platform with AI subtitles, transcription, translation, eye contact correction, and avatar video creation for creators.
Whisper JAX
Audio
Accelerated Whisper transcription model
Whisper JAX is an optimized implementation of OpenAI's Whisper speech recognition model using JAX, providing up to 70x faster transcription for audio processing.
Speechbrain
Audio
Open-source AI speech processing toolkit
SpeechBrain is an open-source PyTorch-based speech processing toolkit for building speech recognition, speaker verification, speech enhancement, and synthesis systems.
Picovoice
Audio
On-device AI voice recognition platform
Picovoice is an on-device voice AI platform that provides wake word detection, speech recognition, and natural language understanding for privacy-first voice applications.
Vosk AI
Audio
Offline speech recognition toolkit
Vosk is an offline open-source speech recognition toolkit supporting 20+ languages with small model sizes suitable for mobile, IoT, and embedded device deployment.
Mozilla DeepSpeech
Audio
Open-source AI speech recognition engine
Mozilla DeepSpeech is an open-source speech-to-text engine based on Baidu's Deep Speech research, enabling developers to build offline speech recognition applications.
Kaldi AI
Audio
Open-source speech recognition toolkit
Kaldi is a state-of-the-art open-source speech recognition toolkit written in C++ widely used by researchers and developers for building custom speech recognition systems.
Speechmatics
Audio
Enterprise AI speech recognition API
Speechmatics provides enterprise-grade automatic speech recognition with high accuracy across 50+ languages and accents for real-time and batch transcription needs.
Nuance AI
Productivity
Microsoft AI for healthcare and enterprise
Nuance AI by Microsoft provides AI-powered solutions for healthcare documentation, enterprise virtual assistants, and conversational AI across voice and digital channels.
Bandwidth AI
Audio
AI voice and messaging network platform
Bandwidth AI is a cloud communications platform with AI-powered voice intelligence, call transcription, and real-time conversation analytics for enterprise applications.
Claap AI
Productivity
AI async video collaboration tool
Claap AI is an async video collaboration tool for teams with AI meeting recording, screen capture, automatic summaries, and searchable video libraries.
SalesAI
Marketing
AI sales call analysis and coaching
SalesAI records, transcribes, and analyzes sales calls using AI to identify winning behaviors, coaching opportunities, and deal risks for sales teams.
Enthu AI
Marketing
AI call monitoring for sales teams
Enthu AI automatically monitors and analyzes sales calls to provide conversation intelligence, compliance tracking, and performance coaching for sales managers.
Tethr AI
Productivity
AI conversation analytics platform
Tethr AI analyzes customer conversations across phone, chat, and email to surface insights on effort, sentiment, and performance for customer experience teams.
Observe AI
Productivity
AI contact center intelligence platform
Observe AI is a conversational intelligence platform for contact centers that transcribes calls, scores interactions, and provides real-time agent guidance and coaching.
Prodigal AI
Productivity
AI conversation intelligence for finance
Prodigal AI analyzes customer interactions in financial services to automate compliance checks, surface agent coaching insights, and improve collection performance.
Yoodli AI
Education
AI speech and communication coach
Yoodli AI is a private AI speech coach that analyzes video recordings of speeches and meetings to provide feedback on filler words, pacing, and body language.
Orai AI
Education
AI speech coach for presentations
Orai AI is an AI-powered speech coaching app that listens to presentations and provides real-time feedback on pacing, filler words, confidence, and energy.
Elsa Speak
Audio
AI English pronunciation coach
ELSA Speak is an AI-powered English pronunciation app that uses speech recognition to identify pronunciation errors and provide personalized accent coaching.
Pimsleur AI
Audio
AI-enhanced audio language learning
Pimsleur uses AI-powered speech recognition to evaluate pronunciation and personalize language learning through its proven audio-based method for 51 languages.
Limitless AI
Productivity
AI wearable for capturing conversations
Limitless AI is a personal AI platform with a wearable device that captures and summarizes conversations, meetings, and interactions to augment human memory.
Read AI
Productivity
AI meeting summaries and engagement scores
Read AI generates meeting summaries, transcripts, and engagement scores using AI, helping teams understand meeting effectiveness and improve communication.
MeetGeek
Productivity
AI meeting automation and insights tool
MeetGeek is an AI meeting assistant that auto-records, transcribes, and summarizes meetings while providing performance insights and topic tracking for teams.
Grain AI
Productivity
AI meeting recording and highlight tool
Grain AI records and transcribes meetings and uses AI to create highlight reels, coaching clips, and shareable moments from sales and customer calls.
Avoma
Productivity
AI meeting lifecycle management platform
Avoma is an AI meeting assistant that helps sales and customer success teams with agenda templates, live transcription, AI summaries, and CRM auto-sync.
Clearword
Productivity
AI meeting assistant for real-time summaries
Clearword is a real-time AI meeting assistant that writes meeting summaries, creates tasks, and updates your tools automatically as the meeting happens.
Airgram
Productivity
AI meeting recorder and notes tool
Airgram is an AI meeting assistant that records, transcribes, and summarizes video meetings with speaker identification, timestamps, and CRM integration.
Tactiq AI
Productivity
AI meeting transcription for Google Meet
Tactiq AI transcribes Google Meet, Zoom, and Teams meetings in real time, generating AI summaries, action items, and shareable meeting notes automatically.
Corti AI
Productivity
AI assistant for emergency medical calls
Corti AI assists emergency dispatchers by listening to calls in real time, detecting cardiac arrests, providing decision support, and documenting patient information automatically.
Nabla Copilot
Productivity
AI medical assistant for doctors
Nabla Copilot is an AI ambient scribe for healthcare that listens to patient consultations and automatically generates clinical notes, saving doctors hours of documentation.
Chorus AI
Marketing
AI conversation intelligence for sales
Chorus AI by ZoomInfo records and analyzes sales conversations to provide coaching recommendations, competitive intelligence, and deal risk insights for sales leaders.
Gladia AI
Audio
Real-time AI speech recognition API
Gladia AI provides a fast, accurate speech transcription API with speaker diarization, word-level timestamps, and real-time streaming for developers building voice apps.
Verbit AI
Audio
AI transcription for legal and education
Verbit AI is an AI-powered transcription and captioning platform specialized for legal, education, and media industries with human-in-the-loop quality assurance.
Rev AI
Audio
AI speech recognition and transcription API
Rev AI offers enterprise-grade speech recognition and transcription APIs with human review options, captions, and audio intelligence for developers and businesses.
Sonix AI
Productivity
AI automated transcription platform
Sonix is an AI-powered automated transcription, translation, and subtitling service used by media professionals, researchers, and legal teams for accurate, fast results.
Happy Scribe
Productivity
AI transcription and subtitle generator
Happy Scribe is an AI-powered transcription and subtitle platform supporting 120+ languages, used by journalists, podcasters, and video creators for accurate captions.
Ava AI
Productivity
AI captioning for deaf and hard-of-hearing
Ava AI provides real-time AI-powered captioning for conversations, meetings, and events, making communication accessible for deaf and hard-of-hearing individuals.
Podcastle AI
Audio
AI studio for podcast production
Podcastle AI Studio provides browser-based remote podcast recording with AI noise removal, voice enhancement, automatic transcription, and editing for independent podcasters.
Supernormal AI
Productivity
AI meeting notes and action items
Supernormal AI automatically generates meeting notes, action items, and summaries from your video calls with Google Meet, Zoom, and Teams integrations.
Notta AI
Productivity
AI transcription and meeting summary tool
Notta AI is an all-in-one transcription and meeting assistant that records audio and video, provides real-time transcription, generates meeting minutes, and exports notes.
Tldv AI
Productivity
AI meeting recorder with CRM sync
tl;dv is an AI meeting tool that records, transcribes, and summarizes meetings with timestamps and speaker identification, syncing insights to CRM and collaboration tools.
Fathom AI
Productivity
Free AI meeting recorder and summarizer
Fathom AI is a free AI meeting assistant that records, transcribes, highlights, and summarizes your video calls so you can focus on conversation instead of note-taking.
Otter Pilot
Productivity
AI meeting assistant and note-taker
Otter Pilot is an AI meeting assistant that joins Zoom, Teams, and Google Meet calls to take notes, write summaries, identify action items, and answer questions.
Adobe Podcast AI
Audio
AI audio enhancement for podcasters
Adobe Podcast AI offers AI-powered audio recording, transcription, and enhancement tools that remove background noise and improve voice quality for podcasters and creators.
Deepgram
Audio
Enterprise AI speech recognition API
Deepgram is an AI-powered speech recognition platform offering real-time and batch transcription APIs with high accuracy, low latency, and custom model training for enterprises.
AssemblyAI
Audio
AI speech recognition API for developers
AssemblyAI provides a powerful speech-to-text API with speaker diarization, sentiment analysis, content moderation, and auto-highlights for developers building audio-powered apps.
Whisper AI
Audio
OpenAI's speech recognition model
Whisper is an open-source automatic speech recognition system by OpenAI trained on large-scale multilingual data, offering highly accurate transcription across dozens of languages.
Sembly AI
Productivity
AI team meeting notes and analytics
Sembly AI records, transcribes, and analyzes meetings to generate smart summaries, decisions, risks, and task assignments automatically for distributed teams.