Skip to main content
European Commission logo
AI Watch

Filter by

News (43)

RSS
Showing results 1 to 10
  • News article

A recent JRC paper explores AI benchmarks, considered an essential tool to evaluate performance, capabilities, and risks of AI models. Through a comprehensive literature review, the paper identifies key shortcomings of AI benchmarking, as well as policy approaches that could mitigate these.

  • 3 min read