AudioShake Launches Breakthrough AI Model to Separate Overlapping Voices in Audio

Press Release · San Francisco, CA, USA · March 11 2025

This article is at least a year old

AudioShake, the leader in AI sound separation technology, today announced the launch of Multi-Speaker, a powerful new model designed to separate an unlimited number of speakers into individual audio tracks. It is the first model of its kind to achieve multi-speaker separation with high-resolution audio opening up new creative uses for voice AI, film, podcasts, UGC, and TV content.

Multi-Speaker represents a significant technical achievement, addressing one of the most persistent challenges in audio: overlapping speech. Multi-Speaker leverages AudioShake’s proprietary AI technology to handle complex audio environments–including crowd dialogues, panel discussions, and fast-paced interviews–and separate them into individual speaker streams. This model allows users to easily isolate individual speakers to improve transcription and captioning accuracy, enable more precise editing workflows, isolate voice for speech AI tasks, and clean up overlapping dialogue for dubbing and localization.

Click to play this video from YouTubeYouTube’s privacy policy

“With the launch of Multi-Speaker, we’re pushing the boundaries of what’s possible in sound separation,” said Jessica Powell, CEO of AudioShake. “This model is designed for any professional dealing with complicated audio mixes—whether in broadcasting, film, or even transcription. Multi-Speaker makes it easier than ever to work with voices that were previously impossible to isolate.”

Fabian-Robert Stotter, AudioShake’s Head of Research, emphasized how the new model was designed to handle real-world scenarios: “Separating multiple voices in overlapping situations is one of the most diﬃcult challenges in audio separation. Our team worked to create a solution that is not only robust but accurate, even in highly challenging environments.”

The Multi-Speaker model represents a significant advancement for professionals in the media and content industries. By providing a powerful tool for separating overlapping voices, it enhances both workflow eﬃciency and audio clarity for uses including:

Media & Entertainment: Achieve cleaner dialogue tracks, even in chaotic soundscapes, enhancing the overall listening experience for audiences.
Localization & Dubbing: Translators and voice-over artists can work with precise, isolated speech tracks, enabling more accurate and natural dubbing, especially in fast-paced or overlapping dialogue scenarios.
Transcription & Captioning Services: Provide clearer and more accurate transcriptions of conversations for journalism, accessibility, and automated summarization purposes.
Live Broadcasting & Events: Broadcasters can extract distinct voices for clearer speech during interviews, sports commentary, and panel discussions, improving audience engagement and understanding.
AI Voice Synthesis & Research: Enhanced separation allows for more realistic and natural-sounding AI-generated voices, improving user interactions and applications in voice recognition and customer service.

AudioShake’s Multi-Speaker technology is empowering new workflows for companies like Bridging Voice and Wondercraft. Bridging Voice used AudioShake’s speaker separation technology to isolate the voices of ALS patients and feed them into voice cloning models built by Eleven Labs. These voices were then used for communications technologies for patients to “speak” in their actual voice, after they’d lost the ability to talk on their own. Wondercraft integrated AudioShake’s Multi-Speaker into its audio studio so users could separate generated podcasts from NotebookLM into distinct speaker tracks, giving them more control over the conversation and final edit.

Multi-Speaker is now available through our web-based platform and API, enabling seamless integration into existing workflows. For inquiries or to experience Multi-Speaker firsthand, reach out to info@audioshake.ai.

This is a press release which we link to from Podnews, our daily newsletter about podcasting and on-demand. We may make small edits for editorial reasons.

The latest...

Loading
.
.
.

AudioShake Launches Breakthrough AI Model to Separate Overlapping Voices in Audio

The latest...

Get a global view on podcasting and on-demand with our daily news briefing