×
Experts weigh in on what happens when AI models don’t keep getting better
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The rapid advancement of artificial intelligence may be approaching a significant slowdown, particularly in the development of large language models (LLMs), as traditional training methods show signs of diminishing returns.

Current state of AI development: OpenAI’s next major model release, codenamed Orion, is demonstrating smaller performance improvements compared to previous generational leaps between models like GPT-3 and GPT-4.

  • Internal researchers at OpenAI report that Orion isn’t consistently outperforming its predecessor on various tasks
  • This plateauing effect represents a significant departure from the exponential growth in AI capabilities observed in recent years
  • The development raises questions about the sustainability of current AI training approaches

Key challenges facing LLM advancement: The AI industry is grappling with a fundamental limitation in the availability of quality training data, which has been essential for improving model performance.

  • Researchers have largely exhausted the most accessible and valuable text data from the public internet and published books
  • Epoch AI research suggests that language models will fully utilize the available stock of human-generated public text between 2026 and 2032
  • This looming data scarcity presents a significant obstacle for traditional training methods

Alternative approaches and solutions: Companies and researchers are exploring various strategies to overcome the limitations of conventional training methods.

  • OpenAI has begun experimenting with synthetic data generated by existing models, though this approach risks “model collapse” after multiple training cycles
  • Some researchers are investigating improved reasoning capabilities as an alternative to expanding training data
  • Knowledge distillation techniques are being explored to help larger “teacher” networks train more refined “student” networks

Industry perspective: Former OpenAI co-founder Ilya Sutskever acknowledges a shift away from the scaling-focused approach that characterized AI development in the 2010s.

  • Sutskever emphasizes that the industry is entering a new phase of “wonder and discovery”
  • The focus is shifting from simple scaling to finding more innovative approaches to AI development
  • The challenge lies in identifying and scaling the right elements rather than just increasing computational resources

Emerging trends: A more specialized approach to AI development may represent the next evolution in the field.

  • Microsoft has demonstrated success with smaller, task-specific language models
  • The future of AI might mirror academic specialization, with models focusing on narrower, more specific domains
  • This trend could lead to more efficient and effective AI systems for specialized applications

Future implications: The apparent plateau in traditional LLM training methods could mark a crucial turning point in AI development, potentially leading to more diverse and specialized approaches rather than the pursuit of ever-larger generalist models.

What if AI doesn’t just keep getting better forever?

Recent News

Meta’s AI training plans ignite privacy showdown with European watchdog

EU privacy watchdog noyb challenges Meta's use of personal data for AI training without explicit user consent, threatening legal action.

Zayo moves to the frontlines of AI infrastructure with major fiber expansion

Zayo's Crown Castle acquisition creates one of North America's largest fiber networks to support growing bandwidth requirements for AI compute, edge deployments, and digital transformation initiatives.

The global fight for AI dominance enters a critical phase as Europe risks falling behind

European leaders attempt to build sovereign cloud infrastructure as the continent lags behind the US and China in AI development investment, talent acquisition, and technological infrastructure.