Concerns about some of OpenAI's new AI models

Quang Minh (THeo tech crunch) |

OpenAI's new AI models are facing serious problems as more "vehicles" occur, AI fabricates untrue information.

According to OpenAI's technical report, the newly launched o3 and o4-mini models have a higher rate of creating misinformation than previous models of reasoning such as o1, o1-mini and o3-mini, as well as traditional AI models such as GPT-4o.

In the PersonQA internal test, o3 generated false information in 40% of the questions, double the o1 and o3-mini. More worryingly, o4-mini is even wrong in 48% of cases.

OpenAI admitted that it is unclear why new models are more "alargic". Initial theories suggest that the current extended learning method may accidentally amplify the problem.

However, o3 still shows superiority in some areas such as programming and toancture. Many development groups are testing the integration of o3 into the workplace, but warn that AI sometimes creates corrupt leads or leads to non-existent information.

The problem of "alarm" makes it difficult for businesses, especially in fields that require high accuracy such as law, to apply AI. A good solution is to integrate a web search function, such as GPT-4o, which currently has reached 90% accuracy on some tests.

OpenAI said it is continuing research to reduce the phenomenon of "alarm" on all of its AI lines. In the context of the entire AI industry shifting to theoretical models, "vehicle" control is becoming an urgent challenge.

Quang Minh (THeo tech crunch)
TIN LIÊN QUAN

OpenAI introduces 2 new AI models at the same time

|

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

|

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

|

Netflix is testing an OpenAI-powered search function.

Motorcycle emission inspection facilities must be more than 50m from hospitals

|

Currently, the Government is still developing a roadmap to apply motorbike emission testing.

2 domestic wastewater treatment plants that have only been used for 1 year have broken down

|

Ha Tinh - After being built and put into operation for a year, 2 domestic wastewater treatment models in Cam Nhuong commune (Cam Xuyen district) have been damaged and devastated.

International visitors impressed by the bustling atmosphere in Ho Chi Minh City during the April 30 holiday

|

HCMC - Choosing HCMC as a tourist destination, many international visitors were surprised when immersing themselves in the atmosphere of the April 30 holiday.

Dong Bac Corporation proposes bauxite exploitation associated with security and defense tasks

|

Quang Ninh - Dong Bac Corporation is implementing procedures to prepare for investment in bauxite mining projects associated with security and defense tasks in the Central Highlands.

Ministry of Health tightens the act of taking advantage of prescriptions and selling functional foods

|

The Ministry of Health requires hospitals and medical facilities nationwide to strengthen supervision of prescriptions, sales of drugs and functional foods, and prevent profiteering.

OpenAI introduces 2 new AI models at the same time

QUANG MINH (THEO engadget) |

Just two days after announcing GPT-4.1, OpenAI continues to surprise the technology world by introducing two new AI models at the same time: o3 and o4-mini.

Openai may be silently developing social networks

QUANG MINH (THEO engadget) |

According to a new report from The Verge, Openai is building an internal prototype for a social networking platform similar to X (formerly Twitter).

Netflix tests OpenAI-powered search feature

QUANG MINH (THEO engadget) |

Netflix is testing an OpenAI-powered search function.