News/Computing
AI powers Chinese tech giant ZTE’s Q1 results, company reports
Chinese tech giant ZTE is leveraging artificial intelligence across its business segments to drive financial growth and expand its market position. The company's Q1 2025 results demonstrate how strategic AI integration is helping technology providers diversify revenue streams beyond traditional telecommunications infrastructure, particularly as global demand for AI-powered solutions accelerates across enterprise and consumer markets. The big picture: ZTE reported CNY 32.97 billion ($4.50 billion) in Q1 2025 revenue, representing a 7.8% year-over-year increase as the company successfully capitalized on AI-driven market opportunities. Net profit reached CNY 2.45 billion, reflecting the company's successful execution of its AI strategy across multiple...
read Apr 24, 2025AI supercomputers are a US first, China second phenomenon. And growing rapidly.
AI supercomputers are scaling at an exponential rate, with performance doubling every nine months while power requirements and costs double annually. This unprecedented growth, detailed in a comprehensive study of 500 AI systems from 2019-2025, reveals a dramatic shift toward private ownership of computing resources, with industry now controlling 80% of global AI compute power. Understanding these trends is crucial as we approach a future where leading AI systems could require power equivalent to multiple cities and hardware investments in the hundreds of billions. The big picture: AI supercomputers have experienced explosive growth in computational performance, increasing 2.5x annually through...
read Apr 23, 2025Docker simplifies AI model deployment with new container workflow
Docker is applying container technology principles to artificial intelligence development, creating a unified system for deploying AI components within enterprise environments. Their new tools – MCP Catalog, MCP Toolkit, and Model Runner – standardize how developers work with AI models and external tools using familiar Docker workflows. This approach addresses critical challenges in AI implementation by bringing Docker's established containerization benefits of portability, security, and consistency to the rapidly evolving AI development landscape. The big picture: Docker has launched three new tools that bring container technology principles to artificial intelligence development workflows, allowing organizations to manage AI components with the...
read Apr 23, 2025AI model aims to advance multiple scientific fields
Los Alamos National Laboratory's ambitious "General Scientific AI" initiative represents a paradigm shift in how artificial intelligence can accelerate scientific discovery across diverse fields. By developing a unified AI model capable of working within any scientific domain—from nuclear physics to climate science—LANL is pioneering an approach that could fundamentally transform how research is conducted at national laboratories and beyond. This effort demonstrates the evolving role of AI from narrow applications to becoming a versatile scientific partner with the potential to drive breakthrough discoveries. The big picture: Los Alamos National Laboratory is developing a "General Scientific AI" capable of working across...
read Apr 21, 2025Huawei prepares new AI chip to rival Nvidia in China
Huawei's upcoming 910C AI chip represents a critical development in China's semiconductor industry at a time of increasing U.S. export restrictions on advanced technology. The mass shipment of these chips, beginning as early as next month, arrives just as Chinese AI companies face new limitations on accessing Nvidia's H20 chips, creating an urgent market opportunity for domestic alternatives in China's rapidly evolving AI landscape. The big picture: Huawei plans to begin mass shipments of its advanced 910C AI chip to Chinese customers as early as May 2025, with some deliveries already underway. The timing coincides with the Trump administration's recent decision...
read Apr 18, 2025AI grapples with data scarcity, once thought to be no problem at all
AI models are facing a surprising challenge: they're running out of data to train on despite years of discussion about data abundance. This shortage could hamper AI advancement as soon as 2026, with overtraining exacerbating the problem by requiring ever-larger datasets. The situation creates a paradox where AI systems increasingly rely on synthetic data they've created themselves, potentially leading to less diverse outputs and amplified biases. The big picture: AI's hunger for data is outpacing supply, with models like ChatGPT requiring hundreds of billions of words and newer systems like Databricks' DBRX consuming trillions of data points. Reading a novel...
read Apr 18, 2025Foxconn launches FoxBrain AI to give manufacturing efficiency a boost
Foxconn's entry into the AI large language model space represents a significant shift for the world's largest electronics manufacturer, as it leverages artificial intelligence to transform its core manufacturing operations. The Taiwan-based company's new FoxBrain model, built on Meta's Llama 3.1 architecture, signals how traditional manufacturing giants are increasingly developing proprietary AI systems to optimize their industrial processes and supply chains rather than merely integrating third-party solutions. The big picture: Foxconn has developed FoxBrain, its first large language model with 70 billion parameters and a 128k-token context window, designed to enhance manufacturing and supply chain operations. The model was trained...
read Apr 16, 2025Kubernetes prove crucial in the AI era, boosting a private cloud resurgence
Private cloud technology is experiencing a significant revival, driven by artificial intelligence and digital sovereignty needs. At the recent KubeCon + CloudNativeCon Europe 2025 in London, attendance soared to 12,000 participants, demonstrating that Kubernetes and cloud-native technologies remain essential infrastructure components in the AI era. This renewed interest marks a shift from the hybrid cloud focus of recent years, as companies seek greater control over their data while building AI capabilities. The big picture: Kubernetes and cloud-native technologies are proving remarkably resilient against AI displacement while simultaneously becoming foundational for AI infrastructure development. The technologies that underpin modern cloud deployments...
read Apr 15, 2025KubeCon 2025 draws 12,000 as Kubernetes thrives alongside AI workloads
Kubernetes and cloud-native technologies continue to thrive despite the rise of AI, as demonstrated by the unexpectedly successful KubeCon + CloudNativeCon Europe 2025 in London. With over 12,000 attendees packing the London ExCel center, the 11-year-old conference showcased the enduring importance of core infrastructure technologies that power modern applications, including AI systems themselves. The event revealed how Kubernetes is proving resistant to AI-based development approaches while simultaneously becoming essential infrastructure for AI workloads. Why cloud-native technologies remain robust: Kubernetes and related technologies demonstrate surprising resilience against AI-driven development trends that are impacting other areas of software engineering. Infrastructure tasks require...
read Apr 15, 2025MediaTek’s new chip brings agentic AI to smartphones for autonomous decision-making
MediaTek's new Dimensity 9400+ chipset signals a significant shift in mobile computing as the industry moves toward devices capable of performing complex AI tasks independently. The chip represents a pivotal advancement in on-device artificial intelligence, incorporating agentic AI capabilities that allow smartphones to autonomously make decisions and execute multi-step tasks without continuous human direction, fundamentally changing how users interact with their devices. The big picture: MediaTek's Dimensity 9400+ introduces autonomous decision-making capabilities to mobile devices, moving beyond simple response-based AI to systems that can anticipate needs and execute complex tasks independently. The new system on chip (SoC) features a powerful...
read Apr 14, 2025Far from benched: Nvidia GPUs maintain benchmark top spot in generative AI performance tests
Nvidia's GPU dominance in generative AI benchmarks underscores the company's continued leadership position in the artificial intelligence hardware market. The latest MLPerf benchmark results reveal Nvidia's commanding performance across multiple generative AI tests, with only limited competition from rivals AMD and Google. This benchmark serves as a critical industry measure, offering insights into which chips can best handle the computationally intensive demands of today's most advanced AI applications. The big picture: Nvidia's general-purpose GPU chips have maintained their leadership position in the latest MLPerf benchmark tests, which now include specific measurements for generative AI applications such as large language models....
read Apr 12, 2025AI distillation makes powerful models smaller and more accessible
AI distillation bridges the gap between massive foundation models and practical applications by creating smaller, more efficient AI systems. This approach has become a cornerstone of accessible AI technology, allowing powerful machine learning capabilities to run on everyday devices rather than requiring enormous data centers. Understanding distillation reveals how AI is becoming more democratic and accessible while maintaining much of the quality of larger systems. The big picture: Distillation transfers knowledge from large, complex "teacher" AI models to smaller, more efficient "student" models while preserving much of the original performance capability. The technique was first introduced by Geoffrey Hinton, often...
read Apr 11, 2025Edge AI market set to grow 18x to $182B by 2032 in industrial settings
Edge AI is emerging as a transformative force in industrial applications where real-time decision making is critical, operating far from the spotlight of consumer AI applications. This technology—processing data locally rather than in the cloud—is revolutionizing high-stakes environments from oil rigs to mining operations where latency isn't just inconvenient but potentially dangerous. With the edge AI market projected to grow from $10.11 billion in 2023 to nearly $182 billion by 2032, this lesser-known branch of artificial intelligence is quietly addressing challenges in environments where connectivity, speed, and reliability are non-negotiable requirements. The big picture: Edge AI processes data locally at...
read Apr 11, 2025Tech giants forge AI infrastructure partnerships to meet growing computing demands
Major tech partnerships are transforming the AI infrastructure landscape, with top companies focusing on digital twins, cloud integrations, and increased data center capacity in Asia. These collaborations highlight how the underlying technology powering AI systems is evolving to meet growing demands for more efficient, sustainable computing resources capable of running increasingly complex AI workloads. The big picture: Schneider Electric and ETAP have created what they claim is the first AI factory digital twin, allowing comprehensive simulation of power requirements across AI data centers. The solution runs on NVIDIA Omniverse Cloud APIs and provides real-time monitoring and predictive analytics for AI...
read Apr 11, 2025Dell’s AI server business roars despite whimpering stock price
Dell Technologies is experiencing a significant strategic shift as an AI infrastructure provider despite recent market challenges. While known primarily for its PC business, Dell has positioned itself as a crucial hardware supplier in the AI ecosystem, particularly as a provider of servers that house Nvidia's high-performance chips. The company's remarkable growth in AI-related backlog—doubling from $4.5 billion to $9 billion since October—signals strong enterprise demand for its AI infrastructure products despite the stock price struggling to regain momentum. The big picture: Dell has transformed from an AI market darling to a value opportunity as its stock trades at approximately...
read Apr 11, 2025Google’s Pixel 9A offers same chip but stripped-down AI due to memory limits
Google's Pixel 9A brings the same Tensor G4 chip as its premium siblings but includes significant limitations to its AI capabilities due to hardware constraints. The midrange device comes with a stripped-down version of Google's Gemini AI that only processes text, highlighting how memory constraints increasingly determine AI feature availability in smartphones, even when devices share the same processor. The big picture: The Pixel 9A features a text-only version of Gemini AI due to its more limited 8GB of RAM, compared to the 12GB in the Pixel 9 and 16GB in the Pro models. The device runs an even smaller...
read Apr 11, 2025Forecast in a flash: Cambridge AI researchers assess weather in 1 second
AI weather modeling takes a quantum leap forward as Cambridge researchers demonstrate a system that can match traditional forecasting accuracy in just one second on a desktop computer, compared to hours or days on supercomputers. This breakthrough, named Aardvark Weather, represents a significant shift in the weather prediction landscape by fully replacing both the computationally intensive initialization and forecasting stages that have defined meteorological science since the 1950s. The big picture: A new AI system can produce weather forecasts in a single second on a desktop computer that rival the accuracy of traditional numerical weather prediction (NWP) models requiring massive...
read Apr 11, 2025Samsung plays catch-up, races to reclaim AI memory chip dominance with accelerated HBM timeline
Samsung's push to reclaim leadership in AI memory chips comes at a critical juncture in the semiconductor industry, where high-bandwidth memory has become essential infrastructure for artificial intelligence applications. With growing competition in this lucrative market segment, Samsung's aggressive timeline for introducing next-generation HBM technology signals the company's determination to reassert its traditional dominance after falling behind competitors in capitalizing on the AI computing boom. The big picture: Samsung Electronics plans to strengthen its position in the high-bandwidth memory chip market this year, responding to shareholder criticism over lost opportunities in AI-related semiconductor business. Key details: Jun Young-hyun, who leads...
read Apr 11, 2025Supermicro unveils petascale storage server with NVIDIA Grace CPU for AI workloads
Supermicro's new petascale storage server represents a significant advancement in AI infrastructure by combining NVIDIA's Grace CPU Superchip with high-density storage capabilities specifically designed for AI workloads. This collaboration between Supermicro, NVIDIA, and WEKA addresses the growing demand for efficient, high-performance storage solutions needed to support the massive datasets required for AI and ML operations, potentially transforming how organizations build their AI computing environments. The big picture: Supermicro has introduced a first-of-its-class petascale all-flash storage server optimized for high-performance software-defined storage workloads in AI environments. The system utilizes the NVIDIA Grace CPU Superchip with 144 Arm Neoverse V2 cores, providing...
read Apr 11, 2025New bio-computer combines living neurons with silicon chips for AI breakthrough
A groundbreaking bio-computer merging living neurons with silicon chips has emerged as a potential milestone in AI and neuromorphic computing. Developed by Australia's Cortical Labs, the CL1 bio-computer combines synthetic living brain neurons with artificial neural networks, creating a novel approach that could transform our understanding of both biological and artificial intelligence while raising profound ethical questions about the boundary between machine cognition and living systems. The big picture: The CL1 bio-computer from Cortical Labs represents a significant advancement in neuromorphic computing by integrating lab-grown living neurons with traditional silicon chips for $35,000. The system employs a Biological Intelligence Operating...
read Apr 11, 2025WEKA’s new AI data platform cuts inference costs by 24% through NVIDIA partnership
WEKA's newest AI data platform innovations are set to reshape the technological landscape for enterprises deploying advanced AI systems. The company's expanded partnership with NVIDIA introduces capabilities specifically designed to address memory bottlenecks that have historically limited inference performance for large language models. By combining WEKA's data management expertise with NVIDIA's computing power, these advancements aim to dramatically reduce costs and improve efficiency for organizations deploying increasingly complex AI reasoning and agent-based systems. The big picture: WEKA has strengthened its NVIDIA partnership through integration with the NVIDIA AI Data Platform reference design and achieved new storage certifications for NVIDIA's cloud...
read Apr 11, 2025Supermicro unveils Blackwell Ultra-powered AI systems with 288GB GPU memory at GTC 2025
Supermicro is expanding its AI infrastructure offerings with powerful new systems built on NVIDIA's Blackwell Ultra platform, designed to handle the escalating computational demands of advanced AI workloads. These next-generation systems feature dramatically increased memory capacity and enhanced networking capabilities, positioning Supermicro at the forefront of hardware solutions for training and running sophisticated AI models as organizations scale their AI initiatives. The big picture: Supermicro has unveiled new AI computing systems powered by NVIDIA's Blackwell Ultra platform at GTC 2025, featuring the NVIDIA HGX B300 NVL16 and GB300 NVL72 configurations for high-performance AI workloads. Key details: The new systems leverage...
read Apr 10, 2025AI factories emerge as the backbone of the next industrial transformation, minus the smokestacks
AI factories are emerging as the engine of the next industrial revolution, transforming how organizations generate value from artificial intelligence. Unlike traditional data centers that primarily store and process information, these specialized facilities manufacture intelligence at scale by orchestrating the entire AI lifecycle. This shift is enabling enterprises to convert massive data investments into immediate competitive advantages rather than waiting for long-term returns, with token throughput—the ability to generate real-time AI predictions—becoming the critical measure of performance. The big picture: AI factories represent a fundamental reimagining of computing infrastructure, optimized specifically for AI production rather than general-purpose computing. These specialized...
read Apr 10, 2025Thinker vs. Tinker: 76% of AI researchers doubt scaling alone will achieve AGI, despite Big Tech buildout
AI researchers overwhelmingly reject the tech industry's scaling strategy for achieving artificial general intelligence (AGI), with 76 percent believing that simply throwing more computing power at existing models is unlikely to succeed. This skepticism comes as companies continue pouring billions into AI infrastructure, highlighting a critical disconnect between research consensus and industry investment strategies that could reshape the future direction of AI development. The big picture: A new survey of 475 AI researchers by the Association for the Advancement of Artificial Intelligence reveals that the dominant industry approach of scaling up current models is widely considered insufficient for reaching human-level...
read