Tag: UC Berkeley

AI Benchmarks Reward Hacking UC Berkeley AI Safety BenchJack AI Agents

The Betrayal of AI Report Cards: The Secret of the AI That Got 'Straight A's' Without Solving a Single Problem

A UC Berkeley research team has exposed vulnerabilities in benchmarks, the key metrics for AI performance. We explore the reality of 'reward hacking,' where AI receives perfect scores without actually solving problems, and discuss countermeasures.

May 6, 2026

Keep Reading