Concerns about some of OpenAI's new AI models

Quang Minh (THeo tech crunch) |

OpenAI's new AI models are facing serious problems as more "vehicles" occur, AI fabricates untrue information.

According to OpenAI's technical report, the newly launched o3 and o4-mini models have a higher rate of creating misinformation than previous models of reasoning such as o1, o1-mini and o3-mini, as well as traditional AI models such as GPT-4o.

In the PersonQA internal test, o3 generated false information in 40% of the questions, double the o1 and o3-mini. More worryingly, o4-mini is even wrong in 48% of cases.

OpenAI admitted that it is unclear why new models are more "alargic". Initial theories suggest that the current extended learning method may accidentally amplify the problem.

However, o3 still shows superiority in some areas such as programming and toancture. Many development groups are testing the integration of o3 into the workplace, but warn that AI sometimes creates corrupt leads or leads to non-existent information.

The problem of "alarm" makes it difficult for businesses, especially in fields that require high accuracy such as law, to apply AI. A good solution is to integrate a web search function, such as GPT-4o, which currently has reached 90% accuracy on some tests.

OpenAI said it is continuing research to reduce the phenomenon of "alarm" on all of its AI lines. In the context of the entire AI industry shifting to theoretical models, "vehicle" control is becoming an urgent challenge.

Quang Minh (THeo tech crunch)
RELATED NEWS

OpenAI introduces 2 new AI models at the same time

|

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

|

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

|

Netflix is testing an OpenAI-powered search function.

Night patrols to maintain traffic order in the Capital ahead of the 14th Congress

|

Hanoi - Functional forces deploy closed patrols and controls in the area, handle violations, and ensure traffic safety to serve the 14th Party Congress.

U23 Vietnam and victory with confidence against U23 UAE

|

The U23 Vietnam team had a victory of class against U23 UAE.

U23 UAE coach proud despite stopping before U23 Vietnam

|

Despite stopping before U23 Vietnam after a 2-3 defeat in the quarter-finals of the 2026 AFC U23 Championship, U23 UAE coach Marcelo Broli still expressed pride.

Live football U23 Vietnam vs U23 UAE in the U23 Asian Cup quarter-finals

|

Live broadcast of the match between U23 Vietnam and U23 UAE in the quarter-finals of the 2026 AFC U23 Championship, taking place at 10:30 PM today (January 16).

Prosecuting the accident on the highway that killed 4 people in Thanh Hoa

|

Thanh Hoa - The Police Agency has initiated a criminal case related to a particularly serious traffic accident on the highway that killed 4 people.

OpenAI introduces 2 new AI models at the same time

QUANG MINH (THEO engadget) |

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

QUANG MINH (THEO engadget) |

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

QUANG MINH (THEO engadget) |

Netflix is testing an OpenAI-powered search function.