×
OpenAI’s New AI Models Claim PhD-Level Skill — Here’s Who Has Access
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

OpenAI unveils new AI model family: OpenAI has introduced a new series of AI models called “o1,” designed to tackle complex tasks and surpass the capabilities of their previous GPT series.

  • The o1 family currently includes two models: o1-preview and o1-mini, both available to ChatGPT Plus users with initial usage limits.
  • These models are specifically designed for reasoning through complex tasks and solving harder problems in fields like science, healthcare, and technology.
  • OpenAI cautions that the o1 models currently lack some features present in GPT-4, such as web browsing and image processing capabilities.

o1-preview: A PhD-level performer: The o1-preview model demonstrates exceptional capabilities in various academic and professional domains.

  • It performs at a level comparable to PhD students in physics, chemistry, and biology.
  • The model excels in coding, ranking in the 89th percentile in Codeforces competitions.
  • On the International Mathematics Olympiad (IMO) qualifying exam, o1-preview solved 83% of the problems, a significant improvement over GPT-4o’s 13% success rate.

o1-mini: Cost-effective alternative: OpenAI has also launched o1-mini, a more streamlined version of the o1 model family.

  • While optimized for coding and STEM tasks, o1-mini still delivers strong performance in math and programming.
  • It achieved a 70% score on the IMO math benchmark, nearly matching o1-preview’s 74%.
  • o1-mini offers an 80% lower price tag compared to o1-preview, making it an attractive option for developers and researchers with specific reasoning needs.

Enhanced safety and security measures: Both o1 models incorporate improved safety features and align with OpenAI’s commitment to responsible AI development.

  • The models use a new safety training approach, enhancing their ability to follow safety and alignment guidelines.
  • o1-preview scored 84 on one of OpenAI’s toughest jailbreaking tests, significantly outperforming GPT-4o’s score of 22.
  • OpenAI has entered into agreements with the U.S. and U.K. AI Safety Institutes to support the evaluation and testing of future AI systems.

Future developments and implications: The introduction of the o1 model family represents a significant step forward in AI capabilities, with potential impacts across various industries.

  • OpenAI plans to regularly update and improve these models, including adding features like browsing and image processing.
  • The company will continue developing both the GPT and o1 series, potentially leading to further advancements in AI applications.
  • As these models become more widely available, they could significantly impact fields such as scientific research, healthcare, and software development.

Analyzing the broader impact: While the o1 models show impressive capabilities, their full potential and limitations remain to be seen.

  • The development of these models raises questions about the future of AI in specialized fields and the potential for AI to augment or even replace human expertise in certain areas.
  • As AI continues to advance, it will be crucial to monitor and address ethical concerns, particularly regarding data privacy, bias, and the societal implications of increasingly capable AI systems.
  • The introduction of the o1 family also highlights the rapid pace of AI development, underscoring the need for ongoing discussions about AI governance and regulation.
Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance

Recent News

Netflix drops AI-generated poster after creator backlash

Studios face mounting pressure over AI-generated artwork as backlash grows from both artists and audiences, prompting hasty removal of promotional materials and public apologies.

ChatGPT’s water usage is 4x higher than previously estimated

Growing demand for AI computing is straining local water supplies as data centers consume billions of gallons for cooling systems.

Conservationists in the UK turn to AI to save red squirrels

AI-powered feeders help Britain's endangered red squirrels access food while diverting invasive grey squirrels to contraceptive stations.