GDPval-AA

A benchmark measuring AI performance on economically valuable knowledge work tasks.

GDPval-AA is an AI evaluation benchmark designed to measure how well models perform on tasks that have direct economic value in knowledge work contexts. Unlike academic benchmarks that test isolated capabilities, GDPval-AA attempts to capture real-world productivity impact, making it particularly relevant for enterprise AI applications.

Also known as

GDP validation benchmark