The thinking machine is upon us

I’ve always been fascinated by the moment when a technology stops being just impressive and starts being truly useful. We’ve reached that inflection point with AI coding assistants, AI Agents and Anthropic’s latest model, Claude 3.7 Sonnet, represents a significant leap forward in this space.

The Power of Thinking Twice

When I write code, I often find myself working in two distinct modes. Sometimes I dash off quick solutions to simple problems, and other times I need to step back, break down complex issues, and work through them methodically. This dual approach to problem-solving seems to be exactly what inspired Claude 3.7 Sonnet’s most distinctive feature: hybrid reasoning.

Claude 3.7 Sonnet combines rapid response capabilities with an extended thinking mode, essentially giving users two AI assistants in one. It’s like having both a sprint runner and a marathon runner on your team, ready to tackle problems of varying complexity.

“It’s the difference between answering off the top of your head versus pulling out a pen and paper to work through a problem step by step,” as one developer put it to me when describing the experience.

Leading the Pack in Real-World Engineering

The numbers tell a compelling story about Claude 3.7 Sonnet’s capabilities. On SWE-bench Verified, a benchmark for software engineering tasks, Claude 3.7 Sonnet achieves 62.3% accuracy, significantly outperforming OpenAI’s o3-mini at 49.3%.

What makes this particularly impressive is that these aren’t just academic coding challenges—they’re real-world software engineering tasks that require understanding complex codebases and making precise modifications.

Think of it like the difference between a chef who can follow a recipe precisely and one who understands the underlying principles of cooking well enough to improvise. Claude 3.7 Sonnet appears to have developed a deeper understanding of software engineering principles.

The Claude code preview

Alongside Claude 3.7 Sonnet, Anthropic has introduced Claude Code, a command-line tool that takes AI coding assistance to a new level. While still in limited research preview, it represents a shift from AI as advisor to AI as actor.

Claude Code doesn’t just suggest code—it can search codebases, read documentation, edit files, write new code, run tests, and even commit changes to GitHub. It’s the difference between having a navigator in your car versus having a driver.

A senior developer might save an hour here or there with standard AI coding tools, but Claude Code aims to reduce the mechanical aspects of development by automating entire workflows.

The AI coding landscape in flux

The AI coding assistant space has become increasingly competitive. The winners in this space will gain enormous wealth and power over the future of computing. OpenAI’s o3-mini performs well in competitive programming but lags behind in software engineering tasks. Grok 3 shows promise with its Think mode but has less real-world testing data. Cursor AI offers seamless IDE integration but can struggle with more complex tasks.

Each has carved out a different niche in the ecosystem:

Claude 3.7 Sonnet: Excels in software engineering and autonomous coding
OpenAI o3-mini: Strong in competitive programming challenges
Grok 3: Shows potential in reasoning capabilities
Cursor AI: Provides a smooth in-IDE experience

Terminal vs. IDE

The contrast between Claude Code and Cursor AI illustrates two fundamentally different approaches to AI-assisted coding. Claude Code operates from the terminal, emphasizing flexibility and autonomy in managing complex codebases. It’s particularly effective for developers who prefer command-line interactions and need robust debugging features for complex projects.

Cursor AI, on the other hand, integrates with Visual Studio Code, offering a more traditional IDE experience with real-time code suggestions. It excels in speed and efficiency, making it ideal for developers working with languages like JavaScript, HTML, CSS, and Python who prioritize rapid code generation.

This difference extends to their autonomy models as well. Claude Code offers incremental permissions, allowing it to earn trust over time and perform tasks autonomously—a significant advantage for test-driven development and version control. Cursor AI typically requires manual approval for most actions, though this may evolve in future iterations.

The reality is that most developers regularly switch between terminal and IDE environments during their workflow. We use terminals for git operations, build processes, and server management, while relying on IDEs for code editing, debugging, and navigation. Having strong AI assistance in both domains isn’t about choosing one approach over the other—it’s about enhancing our entire development ecosystem. The combination of Claude Code and tools like Cursor AI represents a more complete vision of AI-assisted development that meets developers where they actually work, rather than forcing a single interaction model.

The trade-offs: speed, cost, and control

As with any technology, there are trade-offs. Claude 3.7 Sonnet’s extended thinking mode introduces some latency compared to standard responses. While pricing remains consistent with previous Claude models ($3 per million input tokens and $15 per million output tokens), the extended thinking tokens are factored into the cost. And the most powerful features aren’t available on the free tier.

It’s like choosing between a custom-tailored suit and off-the-rack clothing—you get better results with the former, but at a higher price point and with a longer wait.

The acceleration of creative AI

I’ve been a longtime fan of Claude and rely on it for most of my “vibe working” and “vibe coding” activities—those moments when I need a creative partner more than just a coding tool. With the release of Claude 3.7 Sonnet and Claude Code, I’m witnessing a direction shift in how coding and creation are evolving, and it’s happening much faster than I had anticipated.

Anthropic is advancing this field in ways that feel genuinely transformative. The leap from AI as a passive suggestion engine to an active collaborator that can think through problems methodically represents a paradigm shift in how we approach software development.

What impresses me most is not just the technical capabilities, but how natural the interaction feels. When I’m working with Claude 3.7 Sonnet, the experience is less about prompting an AI and more about collaborative problem-solving with an entity that understands both the technical and creative aspects of coding.

The future of human-AI collaboration

What makes Claude 3.7 Sonnet particularly interesting is how it points toward a future where AI systems can not only assist humans but also work alongside them as collaborators. The ability to toggle between quick responses and deep thinking creates a more natural interaction paradigm that mirrors how humans approach problem-solving.

As one of our CO/AI dev community members described it, “It’s less like a tool and more like a junior developer who can handle increasingly complex tasks with minimal supervision.”

This represents a subtle but important shift in how we think about AI in the workplace—not as something that replaces human capabilities, but as something that amplifies them, handling the tedious and mechanical aspects of coding while freeing up human developers to focus on creativity and innovation.

In the end, that may be Claude 3.7 Sonnet’s most significant contribution: showing us a glimpse of what truly effective AI-human collaboration might look like.

Menu

How Claude 3.7 Sonnet is changing the game in AI coding and AI agents

The thinking machine is upon us

The Power of Thinking Twice

Leading the Pack in Real-World Engineering

The Claude code preview

The AI coding landscape in flux

Terminal vs. IDE

The trade-offs: speed, cost, and control

The acceleration of creative AI

The future of human-AI collaboration

Recent Articles

Anthropic Claude 4 release

How Sam Altman just executed the tech industry’s most audacious talent heist

Google IO 2025 generative AI tools look impressive

Join the revolution

CO/AI

Resources

Join the revolution

Menu

Welcome

How Claude 3.7 Sonnet is changing the game in AI coding and AI agents

The thinking machine is upon us

The Power of Thinking Twice

Leading the Pack in Real-World Engineering

The Claude code preview

The AI coding landscape in flux

Terminal vs. IDE

The trade-offs: speed, cost, and control

The acceleration of creative AI

The future of human-AI collaboration

Recent Articles

Anthropic Claude 4 release

How Sam Altman just executed the tech industry’s most audacious talent heist

Google IO 2025 generative AI tools look impressive

Join the revolution

CO/AI

Resources

Join the revolution