×
Everything we know about OpenAI’s new o3 AI models
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

OpenAI is developing two new advanced reasoning models that promise significant improvements in complex problem-solving capabilities, particularly in coding, mathematics, and scientific applications.

Breaking developments: OpenAI CEO Sam Altman announced two new frontier models, o3 and o3-mini, during the company’s final “12 Days of OpenAI” livestream event.

  • The announcement comes just one day after Google’s release of Gemini 2.0 Flash Thinking, intensifying competition in the AI reasoning model space
  • Initial access will be limited to selected third-party researchers for safety testing
  • O3-mini is expected to launch by January 2025, with o3 following shortly after

Performance benchmarks: The o3 model has demonstrated unprecedented capabilities across multiple technical disciplines.

  • Achieved a 22.8 percentage point improvement over its predecessor on SWE-Bench Verified coding tests
  • Scored 96.7% on the AIME 2024 mathematics exam
  • Set new records on EpochAI’s Frontier Math, solving 25.2% of problems where other models achieve less than 2%
  • Tripled the previous model’s score on the ARC-AGI test, reaching over 85% accuracy

Safety and alignment innovations: OpenAI has introduced a new approach called deliberative alignment to ensure responsible AI development.

  • The technique embeds human-written safety specifications directly into the models
  • Models can now engage in chain-of-thought reasoning about safety policies before generating responses
  • This approach improves upon previous methods like reinforcement learning from human feedback (RLHF)
  • Early results show enhanced performance on safety benchmarks and better resistance to jailbreak attempts

Access and testing program: OpenAI has opened applications for early access to researchers until January 10, 2025.

  • Applicants must provide detailed information about their research focus and experience
  • Selected researchers will help evaluate capabilities and safety implications
  • The program emphasizes testing high-risk scenarios and developing robust evaluation methods
  • Applications will be reviewed on a rolling basis

Strategic implications: The rapid advancement in AI reasoning capabilities marks a significant shift in the competitive landscape.

  • The timing of OpenAI’s announcement, following Google’s Gemini 2.0 release, highlights the intensifying race in AI development
  • The focus on reasoning models suggests a new phase in AI evolution, moving beyond language models toward more sophisticated problem-solving capabilities
  • OpenAI’s emphasis on safety testing and researcher collaboration indicates a measured approach to deploying these powerful new tools

Looking ahead: While these models represent significant technical achievements, their true impact will depend on how effectively they can be deployed while maintaining safety and reliability standards, potentially reshaping the boundaries of what AI can accomplish in scientific and technical fields.

OpenAI confirms new frontier models o3 and o3-mini

Recent News

Bill Gates thinks this is one of the most important books on AI ever written

Within five years, AI could force hundreds of millions of workers to switch careers while creating new opportunities in emerging tech roles.

Samsung unveils AI-powered interactive classroom displays

Samsung adapts popular Android phone features into a large interactive screen designed specifically for classroom teaching.

Deloitte: 74% of enterprises have already beat their generative AI goals

Early investments in generative AI are yielding positive returns for enterprises, though system integration and compliance remain key hurdles.