CO/AI Subscribe
Thursday · June 18, 2026 · Issue No. 900
Video

o3 & o4-mini. Deep dive & secret abilities

Watch on YouTube

OpenAI's o3 reshapes how we search information

OpenAI's latest models o3 and o4-mini have arrived with capabilities that move beyond conventional AI chatbots. After extensive testing over several days, I've discovered these models represent a significant leap in how AI can autonomously research, analyze images, and solve complex problems—all with minimal human guidance.

Key Points

  • Agentic capabilities are the game-changer: Both models can autonomously deploy multiple research agents in parallel, allowing them to search the web, analyze images, and run code simultaneously to solve complex tasks.
  • Visual analysis reaches new heights: o3 can identify restaurants from blurry menu photos, pinpoint geographic locations from ordinary landscapes, and solve visual puzzles—all by combining image analysis with web research.
  • Performance varies by task: While o3 and o4-mini excel at reasoning and creative tasks, they still struggle with hallucinations (6.8% and 4.6% rates respectively) compared to Google's Gemini models.
  • TIFF layering and other visual abilities: Unlike other models, o3 can generate layered image files with transparent elements that can be edited afterward in design software.

The Business Game-Changer: Autonomous Research

The most impressive aspect of these models isn't just their ability to analyze information, but how they autonomously research, verify, and synthesize it. When shown a blurry photo of a yacht sailing through a harbor, o3 identified not just the vessel's name and owner, but also researched its last known location and movement patterns—all in just 35 seconds.

This represents a fundamental shift in how businesses will conduct research and analysis. Instead of employees spending hours searching through multiple sources, verifying information, and piecing together reports, these models can coordinate multiple research streams simultaneously, cross-reference findings, and deliver verified results in seconds.

In the context of business intelligence, this means the time from question to actionable insight shrinks dramatically. A financial analyst won't just receive a summary of earnings reports but a comprehensive analysis that contextualizes it against market trends, competitor performance, and historical patterns—all generated while they grab a coffee.

Beyond the Video: Where This Fits in Business Workflows

While the video demonstrates impressive consumer-facing

Share: X LinkedIn Email
Video Feed

More videos

All videos →
Claude Fable 5: When Capability Meets Economics
Video

Claude Fable 5: When Capability Meets Economics

Anthropic released Cloud Fable 5 with a paradox built in: safeguards sophisticated enough to let a mythosclass model...

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs
Video

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs

Apple’s MLX framework is mature enough now that you can run serious agentic AI workflows locally on Silicon...

Hermes Agent Master Class
Video

Hermes Agent Master Class

Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging...

CONSULTING

Outsider
Labs.

A management consulting team focused on AI transformations for executives and business owners.

Work with us →