OpenAI Unveils Advanced AI Voice Generator Capable of Replicating Human Speech

AI in Healthcare

OpenAI has introduced Voice Engine, an AI tool adept at creating highly accurate human voice replicas. This innovation can transform text into speech mimicking any voice from a 15-second sample, with vast implications for accessibility and language translation, yet it raises concerns about potential misuse in scams or spreading disinformation.

“We recognize that generating speech that resembles people’s voices has serious risks, which are especially top of mind in an election year,” OpenAI said in a blog post.

The company recognized the necessity for significant adjustments with the broader availability of AI-generated audio, opting not to immediately release Voice Engine to the public. It proposed, for instance, discontinuing voice-based authentication for banking.

“Any broad deployment of synthetic voice technology should be accompanied by voice authentication experiences that verify that the original speaker is knowingly adding their voice to the service and a no-go voice list that detects and prevents the creation of voices that are too similar to prominent figures,” OpenAI said.

Currently, Voice Engine is in limited use with select partners under strict ethical guidelines, including consent for voice replication and disclosure of AI involvement to listeners. This tool can also translate a voice into multiple languages while retaining the original’s tone and accent, demonstrating its versatility. Alongside, OpenAI made ChatGPT accessible without signup, though without an account, certain features remain restricted.