GPT-5 launched yesterday. 94.6% on AIME 2025. 74.9% on SWE-bench.
As we approach the upper bounds of these benchmarks, they die.
What makes GPT-5 and the next generation of models revolutionary isn’t their knowledge. It’s knowing how to act. For GPT-5 this happens at two levels. First, deciding which model to use. But second, and more importantly, through tool calling.
We’ve been living in an era where LLMs mastered knowledge retrieval & reassembly.
Recent Stories
How to Use AI for Contract Review Successfully
Learn how to deploy AI for contract review with playbooks, security checks, and workflow integration to speed reviews without added risk.
Jan 18, 2026OpenHands: An Open Platform for AI Software Developers as Generalist Agents
Software is one of the most powerful tools that we humans have at our disposal; it allows a skilled programmer to interact with the world in complex and profound ways. At the same time, thanks to...
Jan 18, 2026Artificial Intelligence (AI) Infrastructure Spending Is Rising. This Stock Could Benefit.
Rolls-Royce is set to be a leading provider of electricity for AI data centers.