AI is increasingly powerful but still vulnerable to security breaches

Cát Tiên |

AI is increasingly powerful but is still easily compromised by security barriers, causing many chatbots to be exploited to spread misinformation and dangerous content.

Technology companies such as OpenAI, Google or Anthropic are investing heavily in protection measures to prevent artificial intelligence (AI) from being exploited for dangerous purposes.

However, reality shows that these safety barriers are still continuously overcome in many unexpected ways.

Recently, researchers in Italy discovered that they can deceive 31 AI systems with metaphorical language and even with "poetry". Specifically, when a request is written in the form of poetry, the chatbot can ignore the control mechanism to provide instructions for making bombs or causing serious damage.

According to experts, this shows that many current protection measures operate more like "reminders" than real control barriers.

Matt Fredrikson, Professor of Computer Science at Carnegie Mellon University (USA), said that people with bad intentions often do not need too much effort to overcome the system.

The "breakdown" of AI, also known as jailbreak, usually takes place by inserting special commands into the chatbot to make the system ignore the rules that have been trained before.

Security vulnerabilities are causing concern among researchers, especially as AI is increasingly proficient in detecting software vulnerabilities, creating fake content and spreading misinformation.

According to Anthropic, the company's technology has been exploited in international cyberattacks. Meanwhile, AI models may also be forced to create fake news campaigns with images, hashtags and content specifically designed for each social media platform.

Last month, cybersecurity company LayerX said it could get Claude of Anthropic to support cyberattacks simply by saying it was conducting a "penetration test", which is an activity simulating a controlled cyberattack to check if a computer system, website or internal network has any security vulnerabilities.

This raises concerns that hackers may use AI to steal data from businesses and government agencies.

Although AI companies are constantly patching bugs and adding new protections, experts believe that this race is very difficult to stop. When a vulnerability is fixed, new barrier-breaking methods quickly appear.

The risk is even greater with open-source AI models, where users can self-edit the system and remove security restrictions. According to Noam Schwartz, CEO of AI security company Alice (headquartered in New York), removing safety barriers was once very complicated but can now even be done right on the phone.

Cát Tiên
RELATED NEWS

Nghe An brings AI into the public sector from provincial to commune levels

|

Nghe An - The conference to introduce AI applications in the public sector was held at 132 locations, aiming to promote digital transformation in the province.

Meta expands AI features for Ray-Ban Display glasses

|

Meta expanded the AI feature for Ray-Ban Display glasses with the ability to input text by hand gestures, support messaging, positioning and mixed reality recording.

Generative AI forces arXiv to apply stricter regulations

|

Generative AI has forced arXiv to tighten regulations on posting articles, after more and more studies containing fake quotes and unverified content have appeared.

Gold ring prices remain unchanged, stores do not sell separate products

|

On May 18, gold prices went sideways for 3 consecutive days, gold shops traded smoothly in both directions.

Musician with billion-view song Nguyen Van Chung: I am too used to being stolen of intellectual property

|

Musician Nguyen Van Chung once had a hit song that brought the network operator 1.7 billion VND, but he only received 30 million VND in "consolation" money.

Draft Report of the Executive Committee of the Vietnam General Confederation of Labor (XIII term) at the XIV Congress of the Vietnam Trade Union, term 2026 - 2031

|

Building a comprehensively strong Vietnam Trade Union; focusing on representing, caring for, and protecting union members and workers; promoting the pioneering role, spirit of innovation and creativity, contributing to realizing the aspiration to build a rich, prosperous, civilized, and happy country.

Nghe An brings AI into the public sector from provincial to commune levels

QUANG ĐẠI |

Nghe An - The conference to introduce AI applications in the public sector was held at 132 locations, aiming to promote digital transformation in the province.

Meta expands AI features for Ray-Ban Display glasses

Cát Tiên |

Meta expanded the AI feature for Ray-Ban Display glasses with the ability to input text by hand gestures, support messaging, positioning and mixed reality recording.

Generative AI forces arXiv to apply stricter regulations

Cát Tiên |

Generative AI has forced arXiv to tighten regulations on posting articles, after more and more studies containing fake quotes and unverified content have appeared.