Leaked benchmark results purportedly from OpenAI's upcoming GPT-5 model show performance matching or exceeding human experts on graduate-level mathematics, physics, and legal reasoning tasks, sparking intense debate about AI capabilities.

Benchmark Results

The leaked evaluation results, first reported by The Information and partially confirmed by OpenAI insiders, show dramatic improvements over GPT-4.

Industry Reaction

AI researchers are divided between those who see the results as a clear path toward artificial general intelligence and skeptics who argue benchmarks do not capture the full spectrum of human reasoning. OpenAI has declined to comment on the leak but is expected to announce GPT-5 at a May event.