Microsoft AI Launches Multimodal Foundation Models to Expand In-House AI Capabilities

Microsoft AI has announced the release of three new multimodal foundation models designed to generate text, voice, and images, marking a continued expansion of its internal AI stack. The models — MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 — were developed by the MAI Superintelligence team led by CEO Mustafa Suleyman.

The transcription model supports 25 languages and delivers faster performance than existing Azure offerings, while the voice model enables rapid audio generation and custom voice creation. The image model, previously introduced via MAI Playground, is now being deployed more broadly across Microsoft Foundry.

Leadership has positioned the release as part of a broader push toward human-centered AI design and cost-efficient model deployment, as Microsoft strengthens its position in the competitive multimodal AI landscape while maintaining its partnership with OpenAI.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

OpenAI Acquires TBPN to Expand AI Media and Communications Strategy

OpenAI has acquired Technology Business Programming Network (TBPN), marking its first acquisition of a media company as it looks to expand how artificial intelligence is

VerbaFlo Announces $7M in Funding to Expand AI Leasing and Communications Platform for Student Housing and Multifamily Operators

Insider Brief PRESS RELEASE — VerbaFlo, an AI communications platform built for student housing and multifamily operators, announced it has raised a $7 million seed

Google Expands AI Video Capabilities in Vids with Avatar Control, Veo 3.1 Integration, and YouTube Export

Google has introduced a series of AI-driven updates to its Vids video editor, enhancing its capabilities for enterprise content creation through advanced generative and editing

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

Subscribe today for the latest news about the AI landscape