OpenAI is questioned about how to collect AI training data

HẠO THIÊN (THEO techcrunch) |

OpenAI is said to be requesting third-party contractors to provide actual work products that have been performed.

OpenAI bi dat cau hoi ve cach thu thap du lieu huan luyen AI. Do hoa: AI
OpenAI is questioned about how to collect AI training data. Graphics: AI

According to Techcrunch, this approach aims to create high-quality training data for AI, but also raises concerns about legal and security risks.

According to Wired's report, OpenAI and the training data company Handshake AI are asking contractors to upload actual projects they have done in the past and present. These data are used to train and evaluate new artificial intelligence models, aiming to automate many office jobs.

Internal documents show that OpenAI requires contractors to specifically describe the tasks they have undertaken in previous jobs, and provide examples of "the actual work they actually did". The files requested to be uploaded are not summaries but complete products, which may include Word, PDF, PowerPoint, Excel, image or data warehouse documents.

OpenAI guides contractors to delete proprietary information and personal identification data before sending, and suggests using the "Superstar Scrubbing" tool integrated in ChatGPT to clean data.

Intellectual property lawyer Evan Brown said that letting contractors decide for themselves what confidential information contains poses a "very high risk" for AI laboratories.

According to Techcrunch, before these concerns, OpenAI's spokesperson declined to comment.

HẠO THIÊN (THEO techcrunch)
RELATED NEWS

OpenAI and SoftBank pour 1 billion USD to expand Stargate project

|

OpenAI and SoftBank poured a total of 1 billion USD into SB Energy, expanding the electricity infrastructure and data center for the Stargate project.

OpenAI bets on sound AI, preparing for personal devices without screens

|

OpenAI is promoting a strategy to develop sound artificial intelligence, not just stopping at improving ChatGPT's conversational capabilities.

OpenAI silently creates a new AI device with Jony Ive

|

OpenAI is said to be cooperating with Jony Ive to develop mysterious AI devices in pen form, prioritizing natural interaction with ChatGPT.

Live football U23 Japan vs U23 China in the AFC U23 Championship final

|

Live broadcast of the match between U23 Japan and U23 China in the 2026 AFC U23 Championship final, taking place at 10:00 PM today (January 24).

People release buffaloes and crawl to the edge of the Vinh Hao - Phan Thiet expressway

|

Lam Dong - From the locations where the barbed wire fence was cut, people released buffaloes and cows into the corridor next to the Vinh Hao - Phan Thiet expressway.

It's a bit of a bit of a bit of a bit of a bit of a bit.

Vinh Chau purple onions in season, rural workers have more jobs

|

Can Tho - Vinh Chau is not only famous for its purple onion fields but also creates stable livelihoods for thousands of local workers.

US economy accelerates in 2025, opening a brighter growth period in 2026

|

The US economy recorded strong growth in 2025, GDP broke through in the third quarter and consumption maintained its momentum, creating a positive foundation for the outlook for 2026.

How Thanh Nhan deceived the U23 Korean goalkeeper

|

Striker Thanh Nhan took the decisive penalty to bring home a bronze medal for U23 Vietnam at the 2026 AFC U23 Championship.

OpenAI and SoftBank pour 1 billion USD to expand Stargate project

Cát Tiên |

OpenAI and SoftBank poured a total of 1 billion USD into SB Energy, expanding the electricity infrastructure and data center for the Stargate project.

OpenAI bets on sound AI, preparing for personal devices without screens

HẠO THIÊN (THEO techcrunch) |

OpenAI is promoting a strategy to develop sound artificial intelligence, not just stopping at improving ChatGPT's conversational capabilities.

OpenAI silently creates a new AI device with Jony Ive

Cát Tiên |

OpenAI is said to be cooperating with Jony Ive to develop mysterious AI devices in pen form, prioritizing natural interaction with ChatGPT.