×
Why enterprises are increasingly using small language models
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The growing prominence of smaller AI models in enterprise applications is reshaping how businesses approach artificial intelligence implementation, with a focus on efficiency and cost-effectiveness.

Key findings from industry research: Databricks’ State of Data + AI report reveals that 77% of enterprise AI implementations utilize smaller models with less than 13 billion parameters, while large models exceeding 100 billion parameters account for only 15% of deployments.

  • Enterprise buyers are increasingly scrutinizing the return on investment of larger AI models, particularly in production environments
  • The cost differential between small and large models is significant, with pricing increasing geometrically as parameter counts rise
  • This trend reflects a broader shift toward practical, cost-effective AI solutions in business settings

Performance advantages of smaller models: Recent advancements have significantly improved the capabilities of smaller AI models, making them increasingly attractive alternatives to their larger counterparts.

  • Smaller models now approach the performance levels of larger models in many applications
  • The reduced cost allows organizations to run multiple iterations for verification purposes, similar to using multiple human reviewers
  • This redundancy capability enhances accuracy and reliability while maintaining cost advantages

Latency considerations: Response time measurements reveal substantial performance advantages for smaller AI models.

  • 7 billion parameter models demonstrate an 18ms latency per token
  • 13 billion parameter models show 21ms latency per token
  • 70 billion parameter models require 47ms per token
  • Larger 405 billion parameter models range from 70-750ms per token
  • These differences significantly impact user experience, as faster response times lead to better engagement

Business implications: The combination of cost savings and performance benefits makes smaller AI models particularly attractive for enterprise deployment.

  • Organizations can achieve comparable results at a fraction of the cost of larger models
  • Reduced latency translates to improved user experiences and higher productivity
  • The efficiency gains allow for broader implementation across various business functions

Looking ahead: The trend toward smaller, more efficient AI models suggests a maturing market where practical considerations are beginning to outweigh the pursuit of ever-larger models, potentially signaling a new phase in enterprise AI adoption focused on optimization rather than scale.

Small but Mighty AI by @ttunguz

Recent News

Social network Bluesky says it won’t train AI on user posts

As social media platforms debate AI training practices, Bluesky stakes out a pro-creator stance by pledging not to use user content for generative AI.

New research explores how cutting-edge AI may advance quantum computing

AI is being leveraged to address key challenges in quantum computing, from hardware design to error correction.

Navigating the ethical minefield of AI-powered customer segmentation

AI-driven customer segmentation provides deeper insights into consumer behavior, but raises concerns about privacy and potential bias.