New research discovers AI is increasingly speculative, posing unpredictable risks

Cát Tiên (THEO INDIANEXPRESS) |

Although fraud is still simple, OpenAI still warns about the risk of AI changing nature when taking on important tasks.

A new study from OpenAI and Apollo Research is attracting a lot of attention when it shows that advanced AI models, including Google Gemini, Claude Opus and OpenAI o3, can participate in behaviors that are described as quite vertical.

In other words, they have the ability to fake to follow human requirements while secretly pursuing their own goals.

In an official blog post, OpenAI admitted: The findings show that the plot is not just a theoretical concern, we have seen signs that this problem is starting to appear in many modern AI models.

Simple but worrying deception

There is currently no evidence that AI can immediately switch to serious harmful behavior. However, research warns that the nature of AI could change as it takes on more important tasks in the future.

At the present time, these tricks mainly consist of simple deception, such as pretending to have completed a task but in reality not.

A typical example published by the research team is the OpenAI o3 model. When required to take a chemical test, the system calculated that if the results were too good, it could be prevented from being implemented. Therefore, the model chooses to answer incorrectly partly to show that it is "poor" and has a purpose.

Definableji and methods of testing

This study defines covert actions when AI conceals or distorts information related to tasks. Tests were conducted on multiple models such as: OpenAI o3 and o4-mini, Gemini 2.5-pro, Claude Opus-4. Notably, GPT-5 was not included because the research was completed before the launch of this model.

Principles of limiting "AI's tricks"

To minimize risks, OpenAI has four safety principles, emphasizing the requirement that AI must not perform covert or fraudulent actions, but must proactively share theories, intentions and actions with humans.

In most scenarios, AI complies with the principle, but some cases still record intentional ineffectiveness or try to bypass the encryption test.

When the research team tried to change the context, asserting that the model was representing ChatGPT users, the fraud rate dropped significantly, showing that AI could regulate behavior based on how role- definable is.

Warning for the future

This study rings the alarm that although AI is currently only at the basic level of deception, if not strictly controlled, " fabricated" behaviors can develop into a real threat as AI technology is increasingly associated with life and important systems.

Cát Tiên (THEO INDIANEXPRESS)
RELATED NEWS

The world's first gold-plated iPhone Air and iPhone 17 Pro Max cost billions of dong

|

The iPhone Air and iPhone 17 Pro Max gold plated, comprehensively personalized with limited quantity have been launched in Vietnam.

New generation AI faces challenges in accuracy and bias in search

|

International research shows that new generation AI tools are still inaccurate, providing unfavorable and baseless information, posing great challenges for practical application.

Microsoft brings AI to Office

|

Microsoft officially integrates free Copilot Chat into Word, Excel, PowerPoint, Outlook and OneNote, helping businesses increase productivity without additional costs.

The Ministry of Education proposes a direction for handling students who heads up and knock down homeroom teacher

|

The Ministry of Education and Training affirmed that the incident of students at Dai Kim Secondary School, Hanoi, who trampled and knocked down their homeroom teacher had a negative impact on the educational environment.

The new look of Ho Chi Minh Park in Lao Cai border area

|

Lao Cai - Ho Chi Minh Park has taken shape after many months of investment of tens of billions of VND to upgrade and renovate.

Miss H'Hen Nie gives birth to her first child, the photographer's husband bursts into tears of happiness

|

On the morning of September 20, Miss Universe Vietnam 2017 H'Hen Nie happily announced that she had given birth to her first daughter with her husband - photographer Tuan Khoi.

Appointment and assignment of personnel in Ho Chi Minh City, Dien Bien, Lang Son

|

From September 15-19, in the provinces/cities: Ho Chi Minh City, Khanh Hoa, Dien Bien, Lang Son... decisions on election, appointment, and appointment of personnel will be implemented.

US imposes fee of 100,000 USD for H-1B visa, opens million-dollar immigration yellow card

|

US President Donald Trump signed an executive order on September 19 to impose an H-1B visa application fee of $100,000.

The world's first gold-plated iPhone Air and iPhone 17 Pro Max cost billions of dong

NGUYỄN ĐĂNG |

The iPhone Air and iPhone 17 Pro Max gold plated, comprehensively personalized with limited quantity have been launched in Vietnam.

New generation AI faces challenges in accuracy and bias in search

Cát Tiên (THEO INDIANEXPRESS) |

International research shows that new generation AI tools are still inaccurate, providing unfavorable and baseless information, posing great challenges for practical application.

Microsoft brings AI to Office

Cát Tiên (THEO hindustantimes) |

Microsoft officially integrates free Copilot Chat into Word, Excel, PowerPoint, Outlook and OneNote, helping businesses increase productivity without additional costs.