back

2025 in LLMs so far, illustrated by Pelicans on Bicycles

AI progress races ahead while humans pedal to catch up

The pace of development in large language models has accelerated dramatically in early 2025, with breakthroughs arriving almost weekly that are reshaping our expectations of artificial intelligence. Simon Willison's recent talk, whimsically illustrated with pelicans on bicycles, captures this technological vertigo perfectly. As business leaders struggle to keep pace with these developments, Willison offers a clear-eyed assessment of where we are and where we're headed.

In his characteristically accessible style, Willison walks us through the current state of LLMs in 2025, highlighting how dramatically the landscape has shifted in a short time. He focuses on the extraordinary capabilities of models like Claude 3.5 Sonnet and GPT-5, using concrete examples that demonstrate their reasoning abilities and surprising emergent properties. The pelican metaphor runs throughout – suggesting we're all somewhat awkward creatures trying to balance on unfamiliar technology – yet the underlying message is deadly serious: AI capabilities are advancing at a pace that demands our attention.

Key developments shaping the AI landscape in 2025:

  • Tool use has become sophisticated and seamless – Modern LLMs can now identify when to use tools, select the appropriate ones, and execute complex workflows without explicit instruction, representing a massive shift from earlier models that required careful prompting.

  • Multi-modal integration has reached new heights – The latest models can reason across text, images, code, and other modalities in ways that feel increasingly natural, breaking down the artificial boundaries between different types of information.

  • Self-improvement capabilities have emerged – Models can now critique their own outputs, identify weaknesses, and iteratively improve solutions, demonstrating a form of metacognition that was barely hinted at in previous generations.

  • Reasoning abilities have deepened significantly – 2025's LLMs show dramatically improved abilities to work through complex problems step by step, maintain context, and avoid the hallucinations that plagued earlier generations.

Perhaps the most profound insight from Willison's talk is how the relationship between humans and AI is fundamentally changing. We're moving beyond the era of prompt engineering where humans carefully crafted inputs to extract useful outputs. Today's models actively collaborate with users, suggesting approaches, identifying gaps in reasoning, and bringing relevant knowledge to

Recent Videos

May 6, 2026

Hermes Agent Master Class

https://www.youtube.com/watch?v=R3YOGfTBcQg Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging every feature of Nous Research's open-source agent. In this first episode, we install Hermes from scratch on a brand new machine with no prior skills or memory, walk through full configuration with OpenRouter, tour the most important CLI and slash commands, and run our first real task: a competitor research report on a custom children's book AI business idea. Every future episode will build on this fresh install so you can see the compounding value of the agent in real time....

Apr 29, 2026

Andrej Karpathy – Outsource your thinking, but you can’t outsource your understanding

https://www.youtube.com/watch?v=96jN2OCOfLs Here's what Andrej Karpathy just figured out that everyone else is still dancing around: we're not in an era of "better models." We're in a different era of computing altogether. And the difference between understanding that and not understanding it is the difference between being a vibe coder and being an agentic engineer. Last October, Karpathy had a realization. AI didn't stop being ChatGPT-adjacent. It fundamentally shifted. Agentic coherent workflows started to actually work. And he's spent the last three months living in side projects, VB coding, exploring what's actually possible. What he found is a framework that explains...

Mar 30, 2026

Andrej Karpathy on the Decade of Agents, the Limits of RL, and Why Education Is His Next Mission

A summary of key takeaways from Andrej Karpathy's conversation with Dwarkesh Patel In a wide-ranging conversation with Dwarkesh Patel, Andrej Karpathy — former head of AI at Tesla, founding member of OpenAI, and creator of some of the most popular AI educational content on the internet — shared his views on where AI is headed, what's still broken, and why he's now pouring his energy into education. Here are the key takeaways. "It's the Decade of Agents, Not the Year of Agents" Karpathy's now-famous quote is a direct pushback on industry hype. Early agents like Claude Code and Codex are...