News

Discover the key differences between Moshi and Whisper speech-to-text models. Speed, accuracy, and use cases explained for your next project.
Key features, accuracy, and usability factors to consider when selecting the right speech-to-text converter for your needs ...
The company took a step in another technological direction by launching its first stand-alone speech-to-text model called Scribe.
Meta says that it's the biggest open-source multimodal dataset, containing 270,000 hours' worth of mined speech and text alignment on which its AI was trained.
Meta’s “massively multilingual” AI model translates up to 100 languages, speech or text Meta aims for a universal translator like "Babel Fish" from Hitchhiker’s Guide.
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
The single-system approach of SeamlessM4T reduces errors and delays, Meta claims, increasing the efficiency and quality of the translation.
Parrot, an AI-powered transcription platform offering speech-to-text depositions, raised $11 million in Series A funding.
The defining metric of the speech-to-text industry is accuracy. However, what accuracy really means and how it can be measured accurately is a subject of huge debate within the speech-to-text ...
OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds ...