Concerns about some of OpenAI's new AI models

Quang Minh (THeo tech crunch) |

OpenAI's new AI models are facing serious problems as more "vehicles" occur, AI fabricates untrue information.

According to OpenAI's technical report, the newly launched o3 and o4-mini models have a higher rate of creating misinformation than previous models of reasoning such as o1, o1-mini and o3-mini, as well as traditional AI models such as GPT-4o.

In the PersonQA internal test, o3 generated false information in 40% of the questions, double the o1 and o3-mini. More worryingly, o4-mini is even wrong in 48% of cases.

OpenAI admitted that it is unclear why new models are more "alargic". Initial theories suggest that the current extended learning method may accidentally amplify the problem.

However, o3 still shows superiority in some areas such as programming and toancture. Many development groups are testing the integration of o3 into the workplace, but warn that AI sometimes creates corrupt leads or leads to non-existent information.

The problem of "alarm" makes it difficult for businesses, especially in fields that require high accuracy such as law, to apply AI. A good solution is to integrate a web search function, such as GPT-4o, which currently has reached 90% accuracy on some tests.

OpenAI said it is continuing research to reduce the phenomenon of "alarm" on all of its AI lines. In the context of the entire AI industry shifting to theoretical models, "vehicle" control is becoming an urgent challenge.

Quang Minh (THeo tech crunch)
TIN LIÊN QUAN

OpenAI introduces 2 new AI models at the same time

|

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

|

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

|

Netflix is testing an OpenAI-powered search function.

Overall review of the display and conservation of national treasures

|

Deputy Prime Minister Mai Van Chinh requested to review and evaluate the overall display and conservation of national treasures nationwide.

The Ministry of Public Security proposes to remove the regulation on not driving for more than 48 hours a week

|

The Ministry of Public Security proposed to remove the regulation that the driving time of car drivers must not exceed 48 hours per week; but still maintain the regulation that continuous driving cannot exceed 4 hours.

Traders at Dong Xuan market operate at a low level when eliminating contract tax

|

Hanoi - Traders at Dong Xuan market are operating at a low level, facing many difficulties in implementing electronic invoices when selling goods.

Proposal to keep the death penalty for the crime of manufacturing and trading counterfeit medicine

|

There are opinions suggesting keeping the death penalty for the crime of illegally transporting drugs, the crime of producing and trading counterfeit goods as medicine.

Two ways to look up 10th grade exam scores in Hanoi in 2025

|

Hanoi - The 10th grade entrance exam scores of more than 102,000 candidates will be announced on the Electronic Information Portal of the Department of Education and Training and the City's Early Admission Portal.

OpenAI introduces 2 new AI models at the same time

QUANG MINH (THEO engadget) |

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

QUANG MINH (THEO engadget) |

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

QUANG MINH (THEO engadget) |

Netflix is testing an OpenAI-powered search function.