OpenAI Says GPT-5 Stacks Up To Humans In A Wide Range Of Jobs
OpenAI says GPT-5 stacks up to humans in a wide range of jobs, raising big questions about the future of work. The company has introduced a new benchmark, GDPval, designed to test how well AI compares with human professionals across major industries. This marks one of the clearest signals yet of how close OpenAI is getting to artificial general intelligence (AGI).
Image Credits:sompong_tom / Getty Images
A Benchmark For Human-Level AI
The GDPval benchmark is an early attempt to measure whether AI models can deliver the same quality of work as experts. According to OpenAI, GPT-5 and Anthropic’s Claude Opus 4.1 already approach the level of seasoned professionals in some areas.
This doesn’t mean AI will instantly replace humans. OpenAI is careful to note that the test currently covers only a small slice of real-world tasks. Still, it highlights how quickly AI systems are advancing toward economically valuable work.
How The Test Works
GDPval focuses on nine industries that make up the largest share of the U.S. economy, including healthcare, finance, manufacturing, and government. Within those sectors, it evaluates 44 different occupations — from software engineers to nurses to journalists.
In its first version, GDPval-v0, OpenAI had experienced professionals compare AI-generated outputs with human-generated reports. For instance, investment bankers were asked to review market competitor landscapes created by both AI and humans, then pick which was stronger.
Why It Matters
By framing GPT-5’s performance against GDP-driving industries, OpenAI is signaling that AI is inching closer to being useful at the highest levels of professional work. While some CEOs predict widespread job disruption, OpenAI stresses that these benchmarks are still narrow and experimental.
Still, the fact that GPT-5 can hold its own against trained experts suggests a future where AI doesn’t just assist with small tasks but plays a central role in knowledge work.
The Road To AGI
OpenAI’s mission has always been tied to developing AGI — AI that can outperform humans at most economically valuable work. The GDPval benchmark offers a snapshot of progress toward that goal.
For now, GPT-5’s abilities serve more as a preview than a finished product. But if early results are any indication, AI may soon be a true collaborator across industries rather than just a tool.