News/Cloud

Apr 12, 2025

Tech giants pour billions into AI infrastructure as data center demand surges

The AI infrastructure sector continues to see major developments as tech giants adapt their data centers for AI compute demands. Google's massive Li-ion battery deployment, Apple's potential billion-dollar AI hardware investment, and warnings of a potential data center construction bubble highlight how infrastructure requirements are evolving to meet AI's exponential growth. These developments reveal the substantial physical foundation being built to support the next generation of AI applications. The big picture: Google has deployed over 100 million lithium-ion battery cells across its global data center network, marking a significant shift from traditional lead-acid batteries to more efficient power solutions. The...

read
Apr 12, 2025

How telecom providers use AI to cut costs and create new revenue

Telecommunications providers are rapidly embracing artificial intelligence as a dual-purpose technology that simultaneously reduces operational costs and creates new revenue opportunities. Fujitsu Networks executive Rich Colter highlighted at Mobile World Congress that the most successful AI implementations in telecom extend beyond just radio network optimization to encompass the entire network infrastructure, including optical networks and operational workflows. This comprehensive approach allows communications service providers (CSPs) to remain competitive while extracting maximum value from their substantial network investments. The big picture: CSPs need an end-to-end AI strategy that addresses both network optimization and new business opportunities rather than implementing isolated AI...

read
Apr 11, 2025

Tech giants forge AI infrastructure partnerships to meet growing computing demands

Major tech partnerships are transforming the AI infrastructure landscape, with top companies focusing on digital twins, cloud integrations, and increased data center capacity in Asia. These collaborations highlight how the underlying technology powering AI systems is evolving to meet growing demands for more efficient, sustainable computing resources capable of running increasingly complex AI workloads. The big picture: Schneider Electric and ETAP have created what they claim is the first AI factory digital twin, allowing comprehensive simulation of power requirements across AI data centers. The solution runs on NVIDIA Omniverse Cloud APIs and provides real-time monitoring and predictive analytics for AI...

read
Apr 11, 2025

NTT Data launches agentic AI services targeting $2 billion revenue by 2027

NTT Data is making an aggressive move into the agentic AI market with a new suite of services that work across major cloud platforms. The company aims to help businesses create their own AI agents, with particular focus on manufacturing, banking, and healthcare sectors. This launch reflects the broader industry shift toward autonomous AI systems, with Gartner forecasting substantial growth in this space and predicting that AI agents will increasingly influence enterprise decision-making in the coming years. The big picture: NTT Data has launched a comprehensive agentic AI service suite designed to work with major cloud platforms, targeting multiple industries...

read
Apr 11, 2025

Nvidia’s new benchmarking tools help businesses measure AI infrastructure performance

Nvidia's new DGX Cloud Benchmark Recipes offer businesses unprecedented insight into AI infrastructure performance, addressing a critical need as organizations struggle to evaluate hardware capabilities for increasingly complex AI workloads. The tools allow organizations to make data-driven decisions about infrastructure investments by providing real-world performance data on today's most advanced AI models. The big picture: Nvidia has developed performance testing tools called DGX Cloud Benchmark Recipes that help organizations evaluate how their hardware and cloud infrastructure perform when running sophisticated AI models. The toolkit includes both a database of performance results across various GPU configurations and cloud providers, as well...

read
Apr 11, 2025

Enterprise AI in its self-driving phase: 5 trends to watch in 2025

Major technology companies are prioritizing enterprise-ready AI platforms that deliver measurable business value as they navigate the next phase of AI evolution. At Morgan Stanley's recent Technology, Media & Telecom Conference, industry leaders outlined five key trends shaping AI's trajectory in 2025—focusing on advanced reasoning capabilities, custom silicon development, cloud migration opportunities, AI evaluation methodologies, and autonomous AI systems. These developments signal a maturing AI landscape where performance optimization, profitability metrics, and security considerations are becoming central to enterprise adoption. The big picture: Tech companies are building AI platforms specifically designed for enterprise requirements, emphasizing optimized performance, profitability metrics, and...

read
Apr 11, 2025

Supermicro unveils petascale storage server with NVIDIA Grace CPU for AI workloads

Supermicro's new petascale storage server represents a significant advancement in AI infrastructure by combining NVIDIA's Grace CPU Superchip with high-density storage capabilities specifically designed for AI workloads. This collaboration between Supermicro, NVIDIA, and WEKA addresses the growing demand for efficient, high-performance storage solutions needed to support the massive datasets required for AI and ML operations, potentially transforming how organizations build their AI computing environments. The big picture: Supermicro has introduced a first-of-its-class petascale all-flash storage server optimized for high-performance software-defined storage workloads in AI environments. The system utilizes the NVIDIA Grace CPU Superchip with 144 Arm Neoverse V2 cores, providing...

read
Apr 11, 2025

WEKA’s new AI data platform cuts inference costs by 24% through NVIDIA partnership

WEKA's newest AI data platform innovations are set to reshape the technological landscape for enterprises deploying advanced AI systems. The company's expanded partnership with NVIDIA introduces capabilities specifically designed to address memory bottlenecks that have historically limited inference performance for large language models. By combining WEKA's data management expertise with NVIDIA's computing power, these advancements aim to dramatically reduce costs and improve efficiency for organizations deploying increasingly complex AI reasoning and agent-based systems. The big picture: WEKA has strengthened its NVIDIA partnership through integration with the NVIDIA AI Data Platform reference design and achieved new storage certifications for NVIDIA's cloud...

read
Apr 10, 2025

(Not) Just Between Us: Amazon is killing Echo’s local voice processing, forcing all commands to the cloud

Amazon is forcing a significant privacy change on Echo users by eliminating local voice processing capabilities, requiring all voice commands to be processed in the cloud starting March 28, 2025. This shift represents a growing tension between advanced AI features and user privacy controls, as companies increasingly centralize processing to support more sophisticated capabilities. The change affects even privacy-conscious users who had specifically opted to keep their voice commands processed locally on their devices. The big picture: Amazon is removing the "Do Not Send Voice Recordings" option from Echo devices, forcing all user voice commands to be processed in the...

read
Apr 8, 2025

CoreWeave’s IPO filing shows $1.9B revenue surge as AI cloud demand explodes

CoreWeave's surge to nearly $2 billion in revenue showcases the exploding demand for specialized AI cloud infrastructure. The Nvidia-backed startup's IPO filing reveals an eightfold revenue increase in 2024, positioning it as the first major AI infrastructure provider to test public markets this year. The offering could value CoreWeave above $35 billion, reflecting investors' appetite for companies enabling the AI revolution despite substantial operating losses. The big picture: CoreWeave's IPO filing reveals spectacular revenue growth to $1.92 billion in 2024, up from $228.9 million the previous year, highlighting the enormous demand for specialized AI computing resources. The New Jersey-based cloud...

read
Apr 8, 2025

Huawei Cloud expands AI-native strategy with 50% overseas growth at MWC 2025

Huawei Cloud is aggressively expanding its AI-native approach with a comprehensive set of cloud services designed to drive intelligent transformation across industries. At MWC 2025, the company unveiled new solutions that reflect its two-pronged strategy: enhancing cloud services with AI and building specialized platforms for AI development. This strategic focus comes at a time when the company is experiencing over 50% overseas growth and gaining traction with carriers, financial institutions, and other sectors worldwide. The big picture: Huawei Cloud is positioning itself at the intersection of AI and cloud computing with its "Accelerate Intelligence, Amplify Success" strategy unveiled at MWC...

read
Apr 8, 2025

Amazon forms dedicated agentic AI group as next “multi-billion business” for AWS

Amazon's new focus on agentic AI represents a significant strategic shift that could reshape how the company delivers automated services to customers. This development, led by key AWS executives, aims to create AI systems that can independently perform complex tasks without continuous user prompting—potentially transforming everything from business operations to consumer interactions with Amazon's ecosystem. The big picture: Amazon has established a dedicated group focused on agentic artificial intelligence under the leadership of AWS executive Swami Sivasubramanian, who will report directly to AWS CEO Matt Garman. According to an internal email viewed by Reuters, Garman believes "agentic AI has the...

read
Apr 7, 2025

Inside hyperscale AI data centers: How tech giants power the AI revolution

Hyperscale AI data centers represent the backbone infrastructure powering the artificial intelligence revolution, providing the massive computational resources needed for today's most advanced AI applications. These specialized facilities differ significantly from traditional data centers, incorporating specialized hardware, advanced cooling systems, and optimized architectures specifically designed to handle the unique demands of AI workloads like machine learning and deep learning. As major tech companies including AWS, Google Cloud, Microsoft Azure, and NVIDIA continue expanding their hyperscale facilities, these data centers are becoming increasingly critical to enabling the next generation of AI innovations. The big picture: Hyperscale AI data centers are purpose-built...

read
Apr 7, 2025

Tech giants expand AI infrastructure with new partnerships and data centers

Tech giants are rapidly expanding AI infrastructure to meet the growing demands of this computing-intensive technology. Recent collaborations between major companies like Schneider Electric, Oracle, NVIDIA, and Digital Realty highlight the industry's push toward more sophisticated simulation technologies, cloud integration, and global data center expansion. These developments point to a strategic shift in how companies are building the physical and digital foundations needed to support increasingly complex AI workloads. The big picture: Major tech and infrastructure companies are forming strategic partnerships to build more advanced AI infrastructure systems that can optimize power usage, accelerate workloads, and expand global data center...

read
Apr 7, 2025

CoreWeave seeks $32 billion valuation in pivotal AI infrastructure IPO test

CoreWeave's upcoming IPO represents a significant test for both the U.S. public offering market and investor confidence in AI infrastructure providers. The Nvidia-backed cloud services company is seeking a valuation of up to $32 billion on a fully diluted basis, positioning itself as a specialized player in the booming generative AI sector that has driven record market gains over the past two years. The big picture: CoreWeave and selling shareholders aim to raise up to $2.7 billion by offering 49 million shares priced between $47 and $55 each in what could be one of the year's most closely watched tech...

read
Apr 7, 2025

Oracle’s new AI Agent Studio brings custom agent creation to enterprise cloud

Oracle's expansion into AI agent capabilities marks a significant evolution in enterprise cloud offerings, positioning 2025 as a potential watershed moment for enterprise AI integration. Following the introduction and rapid adoption of generative AI in 2023-2024, major technology providers like Oracle are now embedding sophisticated AI functionality directly into their core enterprise platforms, creating purpose-built solutions that leverage decades of industry expertise and infrastructure. The big picture: Oracle has unveiled Oracle AI Agent Studio for Fusion Applications, a comprehensive toolkit that allows enterprises to build, customize, and orchestrate AI agents within their existing Oracle environments. This launch follows Oracle's earlier...

read
Apr 7, 2025

Satellite internet expansion becomes critical backbone for AI’s always-on future

The race for satellite internet connectivity is accelerating as a fundamental enabler for AI integration across devices and services. Satellite internet providers are rapidly expanding through partnerships with nations, phone manufacturers, airlines, and other industries, creating always-on connectivity that AI systems require to function optimally. This technological convergence is particularly critical for AI agents and IoT devices, which need constant connections to receive model updates and effectively automate various aspects of daily life. The big picture: Satellite internet is experiencing tremendous growth as providers forge partnerships across multiple sectors, with AI capabilities emerging as a significant driver for further expansion....

read
Apr 7, 2025

Oracle adds free AI Agent Studio to Fusion Cloud as enterprise agents surge

Oracle is making a strategic move in the enterprise AI race by adding an AI Agent Studio to its Fusion Cloud business applications at no extra cost. This addition comes as AI agents rapidly transition from novelty to necessity in enterprise software, with Gartner predicting a jump from less than 1% adoption in 2024 to 33% by 2028. The global AI agents market, already valued at $5.4 billion in 2024, is projected to grow at nearly 46% annually through 2030 as organizations seek to automate workflows and enhance customer experiences. The big picture: Oracle is working to retain enterprise customers...

read
Apr 7, 2025

7 ways network security is transforming in 2025 as traditional perimeters fade

Network security is undergoing a fundamental shift in 2025, driven by increasingly distributed work environments, cloud adoption, and IoT proliferation. Traditional security perimeters are dissolving as organizations seek comprehensive protection across dispersed networks. Emerging technologies like SD-WAN and Secure Access Service Edge (SASE) are responding to these challenges, helping companies maintain security despite growing complexity. Understanding these shifts is critical for organizations to develop effective protection strategies in an interconnected digital landscape where threats continue to evolve in sophistication. 1. Holistic security becomes imperativeThe expanding attack surface and sophisticated cyber threats are making integrated security platforms essential. Organizations struggle to...

read
Apr 7, 2025

Nvidia rebrands as an “AI factory” as GPU sales triple to cloud providers

Nvidia's transformation from a chip manufacturer to an "AI factory" represents a strategic pivot reflecting the massive scale of AI infrastructure investments in today's tech landscape. CEO Jensen Huang's recent remarks signal how the company now sees itself primarily as an infrastructure provider enabling customer revenue streams rather than simply selling GPU hardware, marking a fundamental shift in Nvidia's business model and its relationship with enterprise customers. The big picture: Nvidia CEO Jensen Huang has publicly redefined the company as "an AI infrastructure company" rather than just a chip seller, signaling a dramatic evolution in its business model. During the...

read
Apr 6, 2025

Google’s TPUs are changing the game for AI processing speed and efficiency

Tensor Processing Units (TPUs) represent a significant advancement in specialized hardware for AI applications, offering performance capabilities that traditional processors cannot match. These purpose-built chips, developed by Google in 2016, have become foundational infrastructure for modern AI systems, enabling faster model training and deployment while reducing energy consumption and operational costs. Understanding TPU technology is increasingly important as AI applications become more prevalent across industries and computational demands continue to grow. What TPUs are: Tensor Processing Units are specialized chips designed specifically to accelerate AI and machine learning workloads through optimized tensor computation processing. Unlike general-purpose CPUs or even graphics...

read
Apr 4, 2025

Why AI FinOps is becoming essential for controlling generative AI costs

The rapid adoption of artificial intelligence, particularly generative AI, is reshaping enterprise operations while introducing significant financial challenges. As AI services become essential business tools, organizations face complex cost structures across cloud platforms that demand strategic management. Financial Operations (FinOps) practices are emerging as a critical framework for maintaining cost efficiency while maximizing AI's business value. The big picture: The resource-intensive nature of AI services requires organizations to develop comprehensive FinOps strategies to prevent runaway costs while still leveraging AI's transformative potential. Cloud providers like AWS, Azure, and Google Cloud offer extensive AI capabilities that consume substantial CPU/GPU resources and...

read
Apr 4, 2025

IBM to enterprise AI: $5 billion to beam up

IBM is transforming the enterprise AI landscape with a multi-pronged strategy that combines proprietary models, Red Hat hybrid cloud integration, and global consulting capabilities. The tech giant's pragmatic approach has already generated $5 billion in AI-related business in under two years, with 80% coming from consulting engagements and the remainder from software subscriptions. This enterprise-first strategy particularly targets regulated industries like financial services and healthcare, where security, governance, and compliance concerns dominate decision-making. The big picture: IBM's AI strategy centers on smaller, specialized models deployed across hybrid cloud environments rather than massive general-purpose models, positioning the company as a trusted...

read
Apr 1, 2025

Here are the 5 power constraints emerging as the biggest bottlenecks in AI datacenter expansion

The growing power demands of AI infrastructure are creating significant bottlenecks in datacenter construction and expansion. Powerful AI systems require unprecedented levels of electricity—far beyond what traditional computing infrastructure needs—creating a complex set of challenges for companies racing to build the computational foundation for artificial intelligence. As AI adoption accelerates, resolving these power-related constraints will determine which organizations can effectively scale their AI capabilities. 1. Power availability – the fundamental constraint AI datacenters require massive amounts of energy to power their computational workloads, especially for training large language models, creating demand that often exceeds what existing electrical grids can supply....

read
Load More