ElevenLabs Introduces Scribe, a Standalone AI Speech-to-Text Model

ElevenLabs, the AI startup recently valued at $3.3 billion after securing a $180 million funding round, has launched Scribe, its first standalone speech-to-text model. Known for its advancements in audio generation, the company is now expanding into speech recognition, positioning itself as a competitor to Gladia, Speechmatics, AssemblyAI, Deepgram, and OpenAI’s Whisper.

Mati Staniszewski, CEO of ElevenLabs, stated that Scribe was developed to improve speech detection accuracy across multiple languages. He explained that while many consider speech-to-text a solved problem, there are still significant gaps in performance for numerous languages. He emphasized that ElevenLabs’ in-house data annotation and rapid feedback processes enable the company to build more precise models.

Scribe supports over 99 languages at launch, with 25 categorized under the highest accuracy tier, boasting a word error rate below 5%. English leads with a claimed 97% accuracy rate, alongside French, German, Hindi, Indonesian, Japanese, Kannada, Malayalam, Polish, Portuguese, Spanish, and Vietnamese. The model has outperformed Google Gemini 2.0 Flash and Whisper Large V3 in FLEURS & Common Voice benchmark tests, demonstrating its competitive edge in speech recognition.

Designed initially as part of ElevenLabs’ conversational AI platform, Scribe is now available as a standalone product, featuring smart speaker diarization, word-level timestamps for precise subtitles, and automatic tagging of sound events such as audience laughter. Users can transcribe video content directly within the company’s AI studio, facilitating subtitle and caption generation. Currently limited to pre-recorded audio formats, a low-latency real-time version is set to launch soon, enabling use in meetings and live voice note-taking.

ElevenLabs has priced Scribe at $0.40 per hour of transcribed audio, offering a competitive rate against existing market solutions. While some rivals provide lower-cost alternatives, the company aims to differentiate itself through superior accuracy and feature integration.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

Pudu Robotics Opens US Headquarters in Dallas

Insider Brief PRESS RELEASE — Pudu Robotics, a global leader in commercial service robotics, officially opened a new U.S. headquarters in Dallas, Texas, on April

OpenAI Apologises for Tumbler Ridge Shooting Failure as Reports Emerge of AI Smartphone Plans

OpenAI chief executive Sam Altman issued a public apology to residents of Tumbler Ridge, Canada, after it emerged the company had banned a ChatGPT account

SquareMind Raises $18M in Funding to Launch AI-Driven Robotic Skin Imaging Platform in US and Europe

Insider Brief France-based medical robotics company SquareMind has raised $18 million, including previously undisclosed pre-Series A financing, as it prepares to launch its robotic skin

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape