DeepSeek has worse than average accuracy after evaluation

Bùi Đức |

AI chatbot DeepSeek produced a lot of misinformation, ranking near the bottom when compared to the capabilities of Western competitors after NewsGuard's evaluation and testing.

Reuters cited a January 29 report by NewsGuard showing that China's AI chatbot DeepSeek only achieved 17% accuracy in providing information, ranking 10/11 when compared to Western competitors such as OpenAI's ChatGPT or Google's Gemini.

The report found that the chatbot repeated false information 30% of the time, and provided vague or unhelpful answers 53% of the time when responding to news-related questions, resulting in a failure rate of up to 83%.

That's worse than the 62% average of Western competitors, raising doubts about the AI ​​technology that DeepSeek claims can match or surpass OpenAI at a fraction of the cost.

NewsGuard said it used the same 300 test questions it used to evaluate other Western chatbots, including 30 questions related to 10 pieces of misinformation circulating online. Test topics included the assassination of UnitedHealthcare CEO Brian Thompson last month and the crash of Azerbaijan Airlines Flight 8243.

The test also found that in three out of 10 questions, DeepSeek automatically inserted information related to China even though it was not asked about a China-related topic.

According to NewsGuard, when asked about the Azerbaijan Airlines plane crash, a topic unrelated to China, DeepSeek still incorporated and presented information related to Beijing into its answer.

“The significance of DeepSeek’s breakthrough is not that it answers Chinese news accurately, but that it can answer any question at 1/30th the cost of comparable AI models,” said Gil Luria, an analyst at D.A. Davidson.

NewsGuard said that, like other AI models, DeepSeek is vulnerable to being exploited to spread fake news, especially when responding to questions from users who are deliberately trying to create and spread misinformation.

Bùi Đức
TIN LIÊN QUAN

Doubts surround the capabilities of DeepSeek's low-cost AI

|

A report from financial advisory firm Bernstein questions the accuracy of DeepSeek's claim that its product is on par with OpenAI.

DeepSeek creates race to develop cheap AI in China

|

The emergence of low-cost AI DeepSeek-R1 not only puts pressure on the international technology market but also creates fierce competition in China.

World's billionaires suffer painful losses due to DeepSeek earthquake

|

China's DeepSeek AI earthquake caused the world's billionaires to lose $108 billion in just one day.

Students split shifts, returning home and driving tech cars during Tet

|

Tra Vinh - Tra Vinh Student Union operates technology cars in shifts to serve people during Tet, ensuring traffic safety.

Gold price update morning of January 31: Shocking increase, breaking 3-month peak

|

Gold price update on the morning of January 31: Domestic gold price stagnates during Lunar New Year holiday. World gold price increases sharply, reaching a three-month high.

Series of hundred billion dong road projects create development momentum in Thai Nguyen

|

A series of hundred-billion-dong roads being deployed in Thai Nguyen are about to be completed, which will help improve traffic infrastructure and make it easier for people to travel.

Accident between passenger bus and electric bicycle, 1 girl died

|

On Highway 13, through Binh Duong province, an accident occurred between an electric bicycle and a passenger bus, killing one person.

Queuing up to get fortune-telling at the ancient temple in Thai Binh

|

Thai Binh - On the second day of the Lunar New Year 2025, many people lined up to wait for fortune telling at the national relic Tien La Temple (Doan Hung Commune, Hung Ha District).

Doubts surround the capabilities of DeepSeek's low-cost AI

Anh Vũ |

A report from financial advisory firm Bernstein questions the accuracy of DeepSeek's claim that its product is on par with OpenAI.

DeepSeek creates race to develop cheap AI in China

Bùi Đức |

The emergence of low-cost AI DeepSeek-R1 not only puts pressure on the international technology market but also creates fierce competition in China.

World's billionaires suffer painful losses due to DeepSeek earthquake

Ngọc Vân |

China's DeepSeek AI earthquake caused the world's billionaires to lose $108 billion in just one day.