Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Meetings & notes
Page 3 of 5 · 102 tools
A meeting assistant that records audio, writes notes, automatically captures slides, and generates summaries.
AI Powered Text to Voice Generator. — Generate realistic Text to Speech (TTS) audio using our online AI Voice Generator and the best synthetic voices. Instantly convert text in to natural-sounding speech and download as MP3 and WAV audio files.
Podcast.Ai. — Welcome to podcast.ai, a podcast that is entirely generated by artificial intelligence. Every week, we explore a new topic in-depth, and listeners can suggest topics or even guests and hosts for future episodes.
poly.ai — listed in “1000 AI collection tools” (no English blurb in source data).
Synthesize Voice AI and Natural Sounding Text-to-Speech — Replica. — Try today with 30 minutes of free voice credit.
A professional tool widely used in the entertainment industry to create emotion-rich, realistic voice clones.
Convert Audio to Text With Rythmex Converter. — Transcribe audio to text easily, quickly, and effectively.
Your Complete Generative Voice AI Toolkit. — Resemble's AI voice generator lets you create human–like voice overs in seconds.
Speechelo - Generate Voice From Text With Only 3 Clicks. The Most Realistic Souding Text to Audio Converter. — We GUARANTEE no one will tell your voiceover is A.I. generated with a text to voice tool.
With 150 text-to-speech languages and numerous accent options, Speechgen.io ensures a wide range of choices for users, catering to global and multicultural audiences.
The #1 Text to Speech Reader. — Power through docs, articles, PDFs, email — anything you read — by listening with our leading text-to-speech reader.
Stable Audio is Stability AI's first product for music and sound effect generation.
Stable Audio — AI music and sound effect generation application by stability.ai — Free/Paid
Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. — Free
Supertranslate - Add Subtitles to Videos Automatically. — Powered by OpenAI's Whisper, the world's most accurate speech-to-text engine!.
A multi-voice text-to-speech system trained with an emphasis on quality. #opensource