News/Computing

May 19, 2025

Nvidia software gains full adoption across Taiwan

Nvidia is transforming Taiwan into a massive hub for its AI software ecosystem, exemplifying CEO Jensen Huang's vision of "sovereign AI" where nations build local computing infrastructure to maintain control over artificial intelligence development. At the annual Computex conference in Taipei, Nvidia unveiled several partnerships that position Taiwan as a crucial center for AI innovation and semiconductor manufacturing, leveraging relationships with the island's technology giants to strengthen its dominant position in the global AI infrastructure market. The big picture: Nvidia is systematically embedding its software across Taiwan's technology landscape, creating a comprehensive AI infrastructure throughout the island nation. Taiwan Semiconductor...

read
May 19, 2025

AI chips evolve as imec lab pursues programmable designs

The rapid evolution of artificial intelligence algorithms is creating an unexpected challenge for chipmakers, with traditional design approaches potentially becoming a bottleneck for future AI development. According to Imec CEO Luc Van den hove, the semiconductor industry must pivot toward more flexible, reconfigurable architectures to keep pace with AI's swift advancement, or risk creating costly hardware that becomes obsolete before it's even deployed. The big picture: One of the world's leading semiconductor R&D organizations warns that current chip design strategies focusing on raw computing power are creating unsustainable energy consumption, rising costs, and slow hardware development cycles. "There is a...

read
May 19, 2025

Dover bolsters data center offerings with Germany’s Sikora purchase

Dover is expanding its data center technology portfolio through a strategic acquisition targeting AI computing infrastructure growth. The company's $622 million purchase of Germany's Sikora adds specialized capabilities in measuring cable parameters for data centers—a market experiencing sustained demand despite economic uncertainty. This acquisition aligns with industry trends showing continued investment in AI computing infrastructure, as evidenced by other players like Eaton maintaining strong growth projections in this sector. The big picture: Dover is strengthening its position in the data center infrastructure market by acquiring Sikora for $622 million in cash, adding specialized measurement systems for data center cables. Sikora...

read
May 19, 2025

Nvidia eyes global AI reach with new tools and ecosystem updates

Nvidia's latest technological announcements at Taiwan's Computex event signal its strategic push to maintain dominance in the AI computing boom. CEO Jensen Huang unveiled new products and ecosystem updates specifically designed to expand Nvidia's market reach while eliminating barriers to AI adoption across industries and geographies. This aggressive expansion reinforces Nvidia's determination to remain the central player in AI acceleration as the technology continues to reshape global industries. The big picture: Nvidia CEO Jensen Huang used Asia's largest electronics forum to introduce new technologies aimed at sustaining the company's central position in the AI computing revolution. Key details: Huang unveiled...

read
May 17, 2025

AI models evolve: Understanding Mixture of Experts architecture

Mixture of Experts (MoE) architecture represents a fundamental shift in AI model design, offering substantial improvements in performance while potentially reducing computational costs. Initially conceptualized by AI pioneer Geoffrey Hinton in 1991, this approach has gained renewed attention with implementations from companies like Deepseek demonstrating impressive efficiency gains. MoE's growing adoption signals an important evolution in making powerful AI more accessible and cost-effective by dividing processing tasks among specialized neural networks rather than relying on monolithic models. How it works: MoE architecture distributes processing across multiple smaller neural networks rather than using one massive model for all tasks. A "gatekeeper"...

read
May 16, 2025

Computex 2025 showcases Asia’s latest tech innovations

Computex 2025 is poised to reveal the roadmap for PC computing's next phase with its early-scheduled May 19-23 run in Taipei. While CES claims the spotlight for broad consumer tech launches, Computex's strategic proximity to Asian manufacturing hubs positions it as the more consequential venue for core PC technology developments. This year's show promises critical hardware announcements with Nvidia's Jensen Huang headlining a major press event that signals the significance of the developments to come. The big picture: Computex 2025 will showcase the next generation of computing hardware with particular emphasis on budget components, AI processing capabilities, and specialized enterprise...

read
May 16, 2025

Google AI chatbot solves advanced math and science problems

Google DeepMind's AlphaEvolve represents a significant advancement in applying large language models to solve complex problems in mathematics and computer science. By combining an LLM's creative capabilities with rigorous evaluation algorithms, this general-purpose AI system has already tackled longstanding mathematical challenges and delivered practical efficiencies for Google's computing infrastructure. Unlike previous AI scientific tools that were custom-built for specific tasks, AlphaEvolve's general-purpose design signals a potential shift toward more versatile AI systems that can generate novel solutions across multiple domains. The big picture: Google DeepMind has created AlphaEvolve, an AI system that uses chatbot models to solve complex problems in...

read
May 14, 2025

Vegas-based TensorWave secures $100M to boost AI infrastructure development

TensorWave's massive $100 million Series A funding round signals growing investor confidence in AI infrastructure as a critical growth sector. The Las Vegas-based startup has positioned itself strategically in a market projected to exceed $400 billion by 2027, offering essential tools and infrastructure for training AI models and optimizing workloads. This investment comes during a transformative period where access to specialized computing resources has become essential for organizations developing and deploying sophisticated AI systems. The big picture: TensorWave secured a $100 million Series A funding round led by Magnetar and AMD Ventures, with participation from existing and new investors including...

read
May 14, 2025

AI agent AlphaEvolve creates algorithms surpassing human expertise

Google DeepMind's AlphaEvolve represents a significant leap in AI's ability to create novel algorithmic solutions rather than simply remixing existing knowledge. By combining Gemini's coding capabilities with evolutionary design methods, this system has created provably new algorithms that outperform human-designed approaches that have remained unchallenged for decades. This breakthrough demonstrates AI's emerging capacity to generate genuinely innovative solutions to computational problems, particularly those relevant to advancing AI itself. The big picture: Google DeepMind has developed AlphaEvolve, an AI system that designs algorithms that surpass human expertise in specific computational domains, including improvements to a matrix calculation method that has remained...

read
May 12, 2025

AI researchers test LLM capabilities using dinner plate-sized chips

The Cerebras WSE processor is revolutionizing AI capabilities with unprecedented computing power and speed. This dinner plate-sized chip represents a significant departure from traditional processors, offering hundreds of thousands of cores and remarkable context capabilities that are transforming how industries handle large language models and complex data processing tasks. Understanding these hardware advances is crucial as organizations seek competitive advantages through faster and more powerful AI implementations. The big picture: The Cerebras Wafer Scale Engine (WSE) represents a breakthrough in AI computing hardware, with its massive size and processing power enabling previously impossible AI capabilities. At 8.5 x 8.5 inches—roughly...

read
May 12, 2025

INTELLECT-2 launches 32B parameter AI model with global training

Prime Intellect has achieved a significant milestone in AI development with INTELLECT-2, pioneering a novel approach to training large language models through distributed computing. This 32B parameter model represents the first of its kind to utilize globally distributed reinforcement learning across a network of decentralized contributors, potentially democratizing the resource-intensive process of AI model training and opening new pathways for collaborative AI development outside traditional centralized infrastructure. The big picture: Prime Intellect has released INTELLECT-2, a groundbreaking 32B parameter language model that employs globally distributed reinforcement learning across a decentralized network of compute contributors. The model is the first of...

read
May 8, 2025

NVIDIA Blackwell powers Cadence’s AI-driven engineering design

Cadence's new Millennium M2000 Supercomputer represents a significant leap in computational power for engineering and life sciences, featuring NVIDIA's Blackwell architecture to deliver up to 80x performance gains over CPU-based predecessors. This collaboration between NVIDIA and Cadence aims to accelerate breakthrough development in autonomous machines, drug discovery, semiconductor design, and data center optimization through massive parallel computing capabilities and specialized software optimizations. The big picture: Cadence is launching a new supercomputer powered by NVIDIA's Blackwell architecture that dramatically accelerates computational workflows for engineering and scientific applications. The Millennium M2000 Supercomputer integrates NVIDIA HGX B200 systems and RTX PRO 6000 Blackwell...

read
May 6, 2025

Emperor’s New Groove: IBM unveils AI-powered LinuxONE Emperor 5 mainframe

IBM's new LinuxONE Emperor 5 mainframe represents a significant fusion of traditional enterprise computing with modern AI capabilities, signaling a strategic shift in how massive computing resources can be deployed for mission-critical AI workloads. This fifth-generation system introduces unprecedented levels of processing power, security features, and cost efficiency while addressing the growing enterprise demand for dedicated, high-performance AI infrastructure that doesn't sacrifice the reliability mainframes are known for. The big picture: IBM's LinuxONE Emperor 5 transforms the concept of mainframes from legacy technology into a cutting-edge AI powerhouse tailored for enterprise-grade computing needs. The system features the new IBM Telum...

read
May 1, 2025

Alibaba’s Qwen releases AI model for consumer devices

Alibaba's new Qwen2.5-Omni-3B model represents a significant advancement in making multimodal AI accessible on consumer-grade hardware. This lightweight variant maintains impressive capabilities across text, audio, image, and video processing while dramatically reducing resource requirements. The development highlights the industry's growing focus on efficient AI systems that can operate outside of enterprise environments, potentially bringing sophisticated multimodal capabilities to a much wider range of applications and devices. The big picture: Alibaba's Qwen team has released Qwen2.5-Omni-3B, a compact 3-billion-parameter multimodal AI model that retains over 90% of the performance of its larger 7B counterpart while cutting GPU memory requirements by more...

read
May 1, 2025

AI models on Linux made easy with new user-friendly app

GPT4ALL simplifies running local AI models on Linux, offering users both privacy and a robust feature set. This open-source application joins the growing ecosystem of desktop AI tools that allow users to interact with large language models without sending queries to cloud services. While many AI tools require web access, desktop applications like GPT4ALL enable completely private AI interactions by running models locally on personal hardware. Installation steps for running GPT4ALL on Ubuntu-based Linux distributions 1. Download the installer Navigate to the GPT4ALL website and download the Linux installer file gpt4all-installer-linux.run to your Downloads folder. The application supports multiple operating...

read
Apr 30, 2025

Spreading out: Startups build cutting-edge AI models without data centers

A new approach to AI model training could disrupt the centralized power structure that has dominated artificial intelligence development. By using distributed computing across regular GPUs connected via the internet, two startups have demonstrated an alternative path that might challenge the resource-intensive model building methods that currently give tech giants their competitive edge in AI development. The big picture: Researchers from Flower AI and Vana have successfully trained a language model called Collective-1 using GPUs spread across the globe rather than concentrated in datacenters. This distributed approach allowed them to incorporate both public and private data sources, including messages from...

read
Apr 28, 2025

CUDA engineers can now use RightNow AI’s vibe coding in V2.0

RightNow AI has launched version 2.0 of its CUDA optimization tool, climbing to the top of Product Hunt rankings with its innovative approach to GPU programming. The platform allows developers to automatically profile, detect bottlenecks, and optimize CUDA kernels without writing code (vibe coding), potentially democratizing high-performance computing optimization that typically requires specialized expertise. Its impressive 4.93/5 rating and rapid climb to #1 daily rank suggest the tool is addressing a significant pain point in GPU development. The big picture: RightNow AI's V2.0 platform promises to simplify CUDA kernel optimization through an automated, no-code approach to GPU performance tuning. The...

read
Apr 26, 2025

NVIDIA Blackwell platform slashes water use by 300x

Liquid cooling technology is emerging as a critical solution for managing the intense heat generated by AI computing systems, addressing a growing challenge in data center operations. As AI models continue to scale in size and complexity, traditional air cooling methods are becoming inadequate for dissipating heat from high-density server racks that now support over 135 kW compared to just 20 kW in previous generations. This shift toward liquid cooling represents not only a technical necessity but a strategic approach to improving energy efficiency and performance as AI infrastructure demands intensify. The big picture: NVIDIA's liquid-cooled systems are transforming data...

read
Apr 26, 2025

Data analytics acceleration solves AI’s hidden bottleneck

The untold analytics bottleneck is slowing down enterprise AI adoption despite the industry's obsession with larger models and faster inference chips. While executives tout their generative AI implementations, engineers face growing data preparation challenges that consume up to 80% of data scientists' time and over 30% of the AI pipeline. This hidden infrastructure problem threatens to widen the gap between AI investments and actual returns as traditional CPU-bound architectures struggle to efficiently process the massive datasets needed for modern AI applications. The big picture: While the AI industry focuses on model size and training capabilities, data preparation has emerged as...

read
Apr 25, 2025

Neuromorphic computing mimics human brain for smarter AI

Neuromorphic computing is emerging as a transformative technology that mimics the human brain's architecture to create more efficient computing systems. With the global market projected to reach $1.81 billion by 2025 and growing at a remarkable 25.7% CAGR according to The Business Research Company, this field represents a significant shift in computational approaches. The technology's ability to emulate the adaptability and learning capacity of the human brain is creating new possibilities for IoT applications and opening career opportunities for professionals with specialized skills. The big picture: Neuromorphic computing systems are designed to work like the human brain rather than traditional...

read
Apr 25, 2025

Intel develops in-house AI chips to compete with Nvidia

Intel's new CEO Lip-Bu Tan faces a daunting challenge in taking on Nvidia's AI chip dominance, confronting years of missteps that left the company trailing in the most lucrative tech market of the decade. Unlike previous strategies that relied heavily on acquisitions, Tan is pivoting to an in-house development approach focused on optimizing Intel's existing portfolio for emerging AI applications like robotics and intelligent agents, while acknowledging there's "no quick fix" to the company's competitive positioning. The big picture: Intel's new CEO is implementing a comprehensive strategy overhaul to challenge Nvidia in the AI chip market, where Intel has repeatedly...

read
Apr 25, 2025

AI optimizes complex coordinated systems in groundbreaking approach

MIT researchers have developed a revolutionary diagram-based approach to optimizing complex interactive systems, particularly deep learning algorithms. Their new method simplifies the optimization of AI models to the point where improvements that previously took years to develop can now be sketched "on a napkin." This breakthrough addresses a critical gap in the field of deep learning optimization, potentially transforming how engineers design and improve AI systems by making complex operations more transparent and efficient. The big picture: MIT researchers have created a new diagram-based "language" rooted in category theory that dramatically simplifies the optimization of complex interactive systems and deep...

read
Apr 25, 2025

Hardcore hardware: Gigabyte unveils new GeForce RTX 5060 GPU lineup with 16GB option

Gigabyte launches its RTX 5060 and 5060 Ti GPU lineup based on NVIDIA's new Blackwell architecture, targeting gamers, creators, and AI developers who need reliable performance at 1080p and 2K resolutions. The series features six distinct design families, from the premium Aorus Elite with advanced cooling systems to the compact low-profile option for small form factor builds. These cards represent NVIDIA's latest push to make AI capabilities more accessible to mainstream users through NIM microservices support for building personalized AI assistants. The big picture: The new RTX 5060 series cards bridge the gap between gaming performance and everyday AI capabilities,...

read
Apr 24, 2025

Rust gets multi-platform compute boost with CubeCL

CubeCL represents a significant advancement in GPU programming, offering Rust developers a native way to write high-performance compute kernels across multiple hardware platforms. This open-source language extension aims to simplify GPU programming while maintaining Rust's safety guarantees and performance benefits, potentially transforming how developers approach hardware-accelerated computing tasks from machine learning to scientific computing. The big picture: CubeCL provides a Rust-based solution for GPU programming that works across multiple hardware platforms while leveraging Rust's strengths in safety and performance. The project allows developers to write GPU code directly in Rust using familiar syntax and zero-cost abstractions rather than learning separate...

read
Load More