arXiv (a famous pre-publication research archive in fields such as computer science, mathematics and physics) is strengthening measures to limit the abuse of artificial intelligence (AI) in scientific papers.
The new move comes in the context of growing concerns about the quality of research created or overly supported by the Large Language Model (LLM).
arXiv is a popular platform for scientists to publish research before it is officially reviewed, and also becomes an important data source reflecting global research trends.
According to Thomas Dietterich - President of computer science at arXiv, if an article shows that the author does not check the content created by AI, the platform will apply strict handling measures.
These evidences may include non-existent references due to AI "ghosts", conversations with chatbots inserted incorrectly into articles, or errors showing content directly copied from language models without verification.
According to the new regulations, violating authors may be banned from posting on arXiv for one year. After this time, all subsequent studies that want to appear on the platform must be accepted in advance by a reputable review forum.
However, arXiv emphasized that this is not a ban on the use of AI in scientific research. According to Dietterich, scientists can still use large language models as support tools but must be "fully responsible" for the published content, regardless of how it is created.
This means that if the author directly copies faulty paragraphs, biased content, misleading references or misleading information from AI, they still have to be responsible as with any other academic error.
Mr. Dietterich also said that before issuing a penalty, arXiv's coordinators must report the incident and the specialized presiding judge must confirm the evidence of violation. The author being handled still has the right to appeal for decision.
In recent years, the number of low-quality articles appearing on arXiv has tended to increase sharply along with the popularity of generative AI tools. To limit this situation, the platform has required first-time publishers to be certified by a reputable author in the research community.
After more than 20 years of management by Cornell University, arXiv is now also transforming into an independent non-profit organization to mobilize more resources to improve the censorship system and maintain academic quality.
Some recent review studies show that fake citations in the field of biomedicine are increasing, likely related to the abuse of AI modeling for writing. This raises concerns that AI could reduce the reliability of scientific works if not tightly controlled.