ElevenLabs Outlines Model-plus-product Strategy as AI Audio Advances Toward Multimodal Future

At TechCrunch Disrupt 2025, Mati Staniszewski, co-founder and CEO of ElevenLabs, underscored his belief that core AI models in audio will become commoditized within a few years, even as they remain crucial today. He said the company’s researchers have already tackled key model-architecture challenges and will continue advancing proprietary audio technology in the near term, because high-quality voice interactions still rely on building models in-house.

Staniszewski projected that the industry is shifting toward multimodal systems, citing the growing ability to create audio and video simultaneously or integrate audio with large language models for conversational use. He noted that ElevenLabs intends to partner with other players and open-source communities to combine its audio capabilities with external strengths, framing the company’s long-term value in pairing advanced models with applied products — a strategy he compared to the hardware-software synergy that defined Apple’s success.

James Dargan

James Dargan is a writer and researcher at The AI Insider. His focus is on the AI startup ecosystem and he writes articles on the space that have a tone accessible to the average reader.

Share this article:

AI Insider

Discover the future of AI technology with "AI Insider" - your go-to platform for industry data, market insights, and groundbreaking AI news

Subscribe today for the latest news about the AI landscape