OpenAI Building New Voice Model Ahead of AI Device Launch: Report

Last year, OpenAI acquired ex-Apple design chief Jony Ive's AI hardware startup io for $6.5 billion. The post OpenAI Building New Voice Model Ahead of AI Device Launch: Report appeared first on Analytics India Magazine.

OpenAI Building New Voice Model Ahead of AI Device Launch: Report

OpenAI is working on a new AI audio model architecture, which is slated to release in the first quarter of this year, reported The Information.

This model is also being developed for the new voice-based device the company is working on, added the report. 

Furthermore, OpenAI has restructured and brought together several engineers and researchers in a team to build the new AI model. It is expected to bring significant improvements in accuracy, emotion, more natural responses while also being able to handle interruptions like a real conversation partner.

Last year, OpenAI deepened its push into hardware by partnering with former Apple design chief Jony Ive. In May last year, Ive’s startup io, focused on building hardware products around artificial intelligence, was acquired by the AI giant in a nearly $6.5 billion all-stock deal. 

It also marked the next phase of a two-year collaboration between Ive’s design firm LoveFrom and OpenAI, to lead design for the AI giant’s future hardware and software.

In July last year, OpenAI ramped up its hiring across multiple positions in the consumer hardware sector, with positions open for hardware systems product designer, to help build the ‘next generation of world’s most innovative mobile devices’.

According to a Wall Street Journal report from last year, Altman and Ive hinted that these AI companion devices would be fully aware of the user’s surroundings while offering an ‘unobtrusive’ experience. The report added that these devices would be standalone units, and will be released later this year. 

Last August, OpenAI made its Realtime API generally available with new features and released its “most advanced” speech-to-speech model, gpt-realtime. 

The company claimed that gpt-realtime is better at interpreting system messages and developer prompts. 

This includes reading disclaimer scripts word-for-word on a support call, repeating back alphanumerics, or switching seamlessly between languages mid-sentence. 

The post OpenAI Building New Voice Model Ahead of AI Device Launch: Report appeared first on Analytics India Magazine.

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow