ChatGPT has introduced a new advanced voice mode, offering users access to GPT-4o’s highly realistic audio responses.
This update comes after criticism of the first voice mode, which resembled the voice of a character played by Scarlett Johansson in the film Her.
As a result, the initial launch was delayed from May to the end of July.
Currently, the new voice mode is available to a small group of ChatGPT Plus users.
Expansion and Functionality
The new voice feature will be available to all users by the end of the year. Unlike the previous version, which converted spoken questions into text before responding, the updated voice mode processes audio input directly using advanced OpenAI models.
This change means that voice interactions will no longer require intermediate text conversion.
Preset Voices and Rollout
The advanced voice mode includes four preset voices: Juniper, Breeze, Cove, and Ember. These voices were developed in collaboration with professional actors.
The feature is being rolled out gradually, with Alpha group users receiving notifications within the ChatGPT app.
Additionally, users will be informed via email on how to use the new feature.
The GPT-4o voice feature has undergone testing in 45 languages by over 100 external experts to ensure safety and performance.
A detailed report on these tests is expected in August.