×
Why generative AI may not actually cause a compute crunch
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The AI computing power landscape: Debunking scarcity concerns: Despite widespread worries about generative AI (genAI) causing a shortage of computing power, a detailed analysis suggests that the long-standing regime of affordable and widely available computing resources is likely to persist.

  • Even under aggressive assumptions, genAI workloads are projected to account for only about 34% of global data centre-based AI computing supply by 2028.
  • This forecast challenges the notion of an impending scarcity crisis in AI computing resources.

Breaking down genAI computational demands: GenAI computational requirements into three distinct areas, each with varying resource intensities and usage patterns.

  • Model Training: This is the most resource-intensive phase, typically undertaken by a small number of companies with substantial computing capabilities.
  • Fine-Tuning: Less demanding than initial training, fine-tuning requires approximately 0.1-10% of the resources used in training.
  • Inference: The most common form of genAI usage, inference can often be performed on less specialized hardware, making it more accessible and less resource-intensive.

Quantitative modeling and bullish projections: A comprehensive quantitative model was developed to assess the future landscape of AI computing supply and demand, incorporating ambitious growth projections and realistic supply constraints.

  • The model assumes continued scaling of AI models, with projections reaching up to 15 trillion parameters per prompt by 2028.
  • Aggressive global adoption rates and exponential growth in utilization were factored into the demand projections.
  • On the supply side, estimates suggest AI computing power will increase by approximately 60 times by the end of 2025 compared to early 2023 levels.
  • Growth is expected to moderate to around 60% annually through 2028, still representing significant expansion.

Potential breaking points beyond hardware: While computing hardware supply appears sufficient to meet demand, other potential constraints could impact the AI computing landscape.

  • A sudden explosion in consumer demand for inference could strain available resources.
  • Supply chain disruptions could affect the production and distribution of necessary hardware components.
  • Energy constraints are identified as the most pressing concern, potentially limiting the expansion of AI computing capabilities.

Implications for businesses: Given the projected abundance of computing power, businesses should focus on leveraging this resource rather than preparing for scarcity.

  • Companies are advised to develop strategies to capitalize on the anticipated availability of AI computing resources.
  • This approach could lead to more innovative and ambitious AI projects, as concerns about resource limitations are alleviated.

Broader context and future outlook: Future analysis promises to propose a framework for leveraging the expected abundance of computing power.

  • This comprehensive approach to analyzing AI computing resources provides valuable insights for both technology strategists and policymakers.
  • The findings challenge prevailing narratives about AI resource scarcity, potentially reshaping discussions about the future of AI development and implementation.

Critical analysis and remaining questions: Several important questions remain unanswered.

  • How might geopolitical factors or unexpected technological breakthroughs affect the projected supply and demand dynamics?
  • What are the environmental implications of the massive expansion in AI computing power, particularly in light of the identified energy constraints?
  • How might the democratization of AI computing resources impact innovation and competition in the tech industry?

By addressing these questions in future research, a more complete picture of the AI computing landscape can emerge, helping stakeholders navigate the rapidly evolving world of artificial intelligence.

🔮 Will genAI cause a compute crunch? No.

Recent News

Veo 2 vs. Sora: A closer look at Google and OpenAI’s latest AI video tools

Tech companies unveil AI tools capable of generating realistic short videos from text prompts, though length and quality limitations persist as major hurdles.

7 essential ways to use ChatGPT’s new mobile search feature

OpenAI's mobile search upgrade enables business users to access current market data and news through conversational queries, marking a departure from traditional search methods.

FastVideo is an open-source framework that accelerates video diffusion models

New optimization techniques reduce the computing power needed for AI video generation from days to hours, though widespread adoption remains limited by hardware costs.