Silicon Valley bet on the environment to train AI agents

HẠO THIÊN (theo techcrunch) |

Currently, current AI testing products such as ChatGPT Agent or Comet still have many limitations.

Thung lung Silicon dat cuoc vao moi truong de huan luyen tac nhan AI. Do hoa: Hao Thien
Silicon Valley bet on the environment to train AI agents. Graphics: Hao Thien

The solution is to create an environment - a simulation space for AI agents to practice multi-step tasks, called enhanced learning (RL). Similar to the way labeling data once promoted the chatbot era, the RL environment is becoming an important factor for the new generation of AI.

adventure funds, startups, and AI laboratories are all in this race. Andreessen Horowitz commented that all major laboratories build internal RL environments, while looking for external partners.

Many new companies such as Mechanize, Prime Intellect have called for large investments to develop an environmental platform, while big names that label data such as Scale AI, Surge, Mercor have also changed their investment direction to avoid being left behind.

Some deals show the heat of the trend: Anthropic is said to consider spending more than 1 billion USD on the RL environment; Surge achieved revenue of 1.2 billion USD last year thanks to cooperation with OpenAI, Google, Meta; Mercor - valued at 10 billion USD.

The nature of the RL environment is to simulate how AI operates software, for example, an actor is required to buy on Amazon and is rated based on the results. The work seems simple but requires an environment that is sophisticated enough to record unexpected acts. This makes RL much more complex and expensive than static data.

While the opportunity to expand RL is still controversial, Silicon Valley still considers this one of the important directions to promote AI advances, with the expectation of recreating the wave of label data that created ChatGPT.

HẠO THIÊN (theo techcrunch)
RELATED NEWS

Google tests AI features and many new improvements on Gboard

|

Google is testing a new AI feature on Gboard, helping users draft text quickly, enter symbols and improve the typing experience.

US publishing industry is on the rise before Google's AI strategy

|

Tensions between Google and the publishing industry escalated as many leaders believe that AI reviews retain readers right on Google, threatening the online journalism model.

OpenAI and technology startup set a new record in the private market

|

Three years after ChatGPT was launched, the global private technology company market witnessed an unprecedented valuation boom.

Prime Minister sends letter of commendation for singer Duc Phuc's outstanding achievements

|

With the excellent achievement at the Intervision International Music Competition 2025, the Prime Minister congratulated and commended the great efforts of singer Duc Phuc.

Student who knocked down a teacher: Teachers also need to be protected

|

In many situations, teachers have also become victims of school violence. This caused serious consequences for both teachers and students.

Google tests AI features and many new improvements on Gboard

Cát Tiên (THEO INDIANEXPRESS) |

Google is testing a new AI feature on Gboard, helping users draft text quickly, enter symbols and improve the typing experience.

US publishing industry is on the rise before Google's AI strategy

Cát Tiên (THEO techcrunch) |

Tensions between Google and the publishing industry escalated as many leaders believe that AI reviews retain readers right on Google, threatening the online journalism model.

OpenAI and technology startup set a new record in the private market

HẠO THIÊN (THEO cnbc) |

Three years after ChatGPT was launched, the global private technology company market witnessed an unprecedented valuation boom.