DeepSeek Data Controversy

News

Alibaba’s Qwen 2.5 AI Faces MAth ‘Cheating’ Allegations Over Contaminated Benchmark Data

A new study alleges Alibaba's Qwen2.5 AI model cheated on key math benchmarks by memorizing test data, raising serious questions about AI benchmark integrity.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

News

Trending now