OpenAI bets on sound AI, preparing for personal devices without screens

HẠO THIÊN (THEO techcrunch) |

OpenAI is promoting a strategy to develop sound artificial intelligence, not just stopping at improving ChatGPT's conversational capabilities.

OpenAI dat cuoc lon vao AI am thanh va chuan bi cho thiet bi ca nhan khong man hinh. Do hoa: AI
OpenAI makes a big bet on sound AI and prepares for non-screen personal devices. Graphics: AI

According to The Information, in the past two months, OpenAI has merged many technical groups, products and research to comprehensively restructure audio models, in preparation for a personal device prioritizing voice communication, expected to be launched in about a year.

This move reflects the general trend of the technology industry, where sound is gradually becoming the central interface, replacing the dominant role of the screen. Voice assistants have appeared in more than one-third of households in the US through smart speakers. Many large technology corporations are also following this trend.

Meta recently added directional listening feature to the Ray-Ban smart glasses, using a multi-micro system to help users hear more clearly in noisy environments. Google is testing the feature of converting search results into audio-conference summaries. Meanwhile, Tesla integrates xAI's Grok chatbot into electric vehicles, allowing users to control many functions with natural voice.

Not only technology "giants", many startups are also pursuing the ambition to build non-screen AI devices. However, this path is not easy. Some products that have attracted attention such as Humane AI Pin or Friend AI necklace have faced failures or controversies related to privacy, showing the great risk of bringing sound AI into personal life.

However, this trend continues to be promoted. Some startups, including Sandbar and the company founded by Eric Migicovsky, are developing AI rings that allow users to chat directly through wearable devices, expected to be launched in 2026.

According to The Information, OpenAI's new sound model, expected to be launched in early 2026, will have a more natural voice, handle interruptions flexibly and can even "speak in parallel" with the user, creating a feeling like a real conversation. OpenAI is also said to be imagining a new device ecosystem, which may include glasses or smart speakers without screens, operating as a companion rather than a tool.

This strategy is associated with the sound-prioritized design orientation of Jony Ive - former Apple Design Director, who joined the hardware division of OpenAI after the 6.5 billion USD acquisition of the company io. He is said to want to reduce dependence on screens and see the audio interface as an opportunity to reshape how people interact with consumer technology in the future.

In that context, sound AI is no longer a supporting feature, but is being seen as the foundation for the next generation of personal devices, where voice becomes the new "control surface" of humans.

HẠO THIÊN (THEO techcrunch)
RELATED NEWS

Countdown Hue is stirred up by a vibrant sound and light performance

|

HUE - The Hue Countdown program opened with a sound and lighting performance, stirring up the entire large square.

Google Notebook expands learning capabilities with sound

|

Google Notebook adds a sound lecture mode of up to 30 minutes, helping users learn passively with a seamless reading voice and clear structure.

FPT's unusual sound detection AI model is protected in the US

|

Thanks to its novelty and high applicablebility, FPT's AI (artificial intelligence) unusual sound detection model is protected in the US.

Designer Duc Hung: For me, ao dai is perfectly beautiful, it needs to be promoted all over the world

|

In the program "Saturday Afternoon Coffee", designer Duc Hung shared his perspective on Vietnamese fashion in the context of cultural industry development.

Clean water brings many positive changes to rural people

|

Hanoi - After a few years of using clean water, the worries and hardships in daily life of people in Phong Trieu village, Phu Xuyen commune have been improved.

Central Highlands Regional General Hospital has not completed legal procedures, infrastructure is degraded

|

Dak Lak - Despite operating for nearly 7 years, the Central Highlands Regional General Hospital has not yet completed legal documents, while the infrastructure has seriously deteriorated.

Ho Chi Minh City uses 33 land plots to exchange for a series of key infrastructure projects

|

Ho Chi Minh City uses 33 land plots in the area to pay investors to implement projects in the form of BT (build - transfer) contracts.

Poland proposes expanding NATO fuel lines to deal with Russia

|

Poland and France are discussing expanding the NATO fuel pipeline network to areas near the Russian border to increase military logistics capabilities.

Countdown Hue is stirred up by a vibrant sound and light performance

PHÚC ĐẠT - NGUYỄN LUÂN |

HUE - The Hue Countdown program opened with a sound and lighting performance, stirring up the entire large square.

Google Notebook expands learning capabilities with sound

Cát Tiên |

Google Notebook adds a sound lecture mode of up to 30 minutes, helping users learn passively with a seamless reading voice and clear structure.

FPT's unusual sound detection AI model is protected in the US

NGUYỄN ĐĂNG |

Thanks to its novelty and high applicablebility, FPT's AI (artificial intelligence) unusual sound detection model is protected in the US.