News/Computing
Qualcomm’s new chip enhances CPU and GPU for midrange phones
It can be a great thing, being mid. The mobile processor market is seeing significant advancement in budget segment capabilities with Qualcomm's latest announcement. The Snapdragon 6 Gen 4 represents a substantial upgrade for affordable, midrange Android phones, bringing features previously reserved for higher-end devices. Core improvements: The new Snapdragon 6 Gen 4 processor delivers an 11% CPU performance boost and 29% GPU enhancement compared to its predecessor. The chip features a modern octa-core CPU configuration with Cortex-A720 and Cortex-A520 cores, replacing the older Cortex-A78 and Cortex-A55 architecture Gaming capabilities are enhanced through features like Game Super Resolution for visual...
read Feb 11, 2025Baidu CEO thinks more AI investment is needed, despite success of DeepSeek
The rise of artificial intelligence has sparked intense competition among tech companies in China and globally to develop powerful language models. Chinese tech giant Baidu has been at the forefront of this race, despite facing challenges from emerging competitors and U.S. chip sanctions. Current landscape: Baidu CEO Robin Li emphasizes the continued importance of investing in cloud infrastructure and computing power for developing advanced AI models, even as new approaches challenge traditional assumptions. Speaking at the World Government Summit in Dubai, Li stressed that superior AI models still require substantial computing resources Computing power in this context refers to the...
read Feb 10, 2025NXP acquires edge AI chipmaker Kinara for $307M
The semiconductor industry is seeing increased consolidation in the edge AI space as companies position themselves for the growing demand in industrial IoT and automotive markets. NXP Semiconductors' acquisition of Kinara represents a strategic move to strengthen its AI processing capabilities at the network edge. Deal Overview: NXP Semiconductors has announced plans to acquire US-based edge AI chipmaker Kinara for $307 million, with the transaction expected to close in the first half of 2025. The acquisition will integrate Kinara's neural processing units (NPUs) and AI software into NXP's industrial and IoT processor portfolio The two companies already have an existing...
read Feb 10, 2025Pro-tip: Key steps to choosing the right AI agent platform
The rise of AI agent platforms has created new challenges for CIOs and IT leaders who must carefully evaluate these tools before implementation. Selecting the right AI agent builder platform requires assessing multiple technical and operational factors to ensure successful deployment and long-term value. Initial evaluation criteria: Before selecting an AI agent platform, organizations must first examine the core building environment and development tools to ensure they align with team capabilities and project requirements. The platform should provide an intuitive interface for testing and deploying agents while incorporating essential features like memory management and responsible AI safeguards Usage tracking and...
read Feb 10, 2025How Georgia Tech is preparing its students for an AI-driven future
The advancement of artificial intelligence has created an urgent need for universities to prepare students for an AI-driven workforce. Georgia Tech has emerged as a leader in AI education by implementing comprehensive programs and resources that make advanced computing accessible to students across all disciplines. Pioneering AI Infrastructure: Georgia Tech and NVIDIA have established the first AI supercomputer specifically designed for student use at a U.S. university, marking a significant shift in democratizing access to advanced computing resources. The AI Makerspace, powered by this supercomputer, is available to all students regardless of their field of study A team of 60...
read Feb 10, 2025AI progress breaks Moore’s Law as AGI looms, says Altman
The development of artificial intelligence capabilities is progressing at a pace far exceeding Moore's Law, with token costs dropping approximately 150x between early 2023 and mid-2024. OpenAI CEO Sam Altman's recent analysis suggests this acceleration signals artificial general intelligence (AGI) - AI systems that match or exceed human intelligence - is approaching faster than previously anticipated. Key metrics and trends: Token costs for AI systems are declining at a rate of approximately 10x every 12 months, dramatically outpacing the traditional semiconductor industry's growth curve defined by Moore's Law. ChatGPT's operational costs have plummeted, with token prices falling about 150x in...
read Feb 8, 2025OpenAI evaluating these 3 states for its massive Stargate AI facility
Latest developments: OpenAI is evaluating locations in Pennsylvania, Oregon, and Wisconsin for its ambitious Stargate project, a massive AI infrastructure initiative that could represent a $500 billion investment. OpenAI has already established a pilot site in Texas and plans to expand with additional data centers The company is currently seeking proposals from top firms to design and build these facilities More specific details about Pennsylvania projects are expected to be announced on Thursday Infrastructure requirements: The project aims to develop advanced computing capabilities necessary for next-generation AI systems that can tackle complex problems in science and medicine. According to OpenAI's...
read Feb 7, 2025The AI memory wall: How Micron’s next-gen hardware is unlocking edge AI’s full potential
The rapid evolution of edge AI is reshaping how we think about artificial intelligence, moving from cloud-dependent processing to powerful on-device capabilities that demand unprecedented memory performance. As the industry grapples with the growing "memory wall" challenge—where AI processors outpace memory systems' ability to supply data—innovative solutions like Processing-in-Memory (PIM) technology are emerging as potential game-changers for the future of edge computing. The evolution of edge AI: Edge computing, where AI processing occurs directly on devices rather than in the cloud, is becoming increasingly critical for real-world AI applications and user experiences. Industry leaders now widely accept that AI processing...
read Feb 7, 2025Amazon is priming to invest $100B in AI infrastructure in 2025
Amazon plans to invest approximately $100 billion in AI infrastructure during 2025, with CEO Andy Jassy highlighting unprecedented opportunities in the AI space. Core announcement: Amazon's Q4 capital expenditure of $26.3 billion represents the expected quarterly spending rate for 2025, with the majority allocated to AI infrastructure for AWS. The investment supports AWS's AI business, which is growing at triple-digit rates year-over-year AWS generated Q4 net sales of $28.8 billion, a 19% increase from the previous year Operating income for AWS reached $10.6 billion in Q4, up from $7.2 billion in Q4 2023 Technical infrastructure details: Amazon is developing a...
read Feb 5, 2025RAND Corporation on what DeepSeek means for AI competition
Breaking developments in AI competition: Chinese tech company DeepSeek has released two AI models that match the capabilities of leading US models while reportedly requiring significantly less computational resources. Key achievements: DeepSeek's new models represent a significant advancement in AI efficiency and accessibility, with potentially major implications for the global AI landscape. The V3 model has achieved performance parity with GPT-4 DeepSeek's R1 reasoning model matches OpenAI's o1 while requiring only about 4% of the computational resources The company claims to have trained V3 for approximately $5.6 million, though this figure may not reflect total development costs Technical context: While...
read Feb 5, 2025DeepSeek’s clever efficiency upends the global AI race
DeepSeek, a Chinese AI company, has released a new AI model that operates at significantly lower costs while maintaining competitive performance capabilities. Core innovation: DeepSeek-R1 represents a major advancement in AI efficiency, operating at up to 50 times lower cost than comparable U.S. models while being capable of running on standard laptop hardware rather than specialized chips. The model was reportedly developed for just $6 million, though this figure excludes significant operational and infrastructure costs DeepSeek achieved this efficiency through advanced techniques including a "mixture of experts" architecture that selectively activates only relevant parts of the model Additional optimization methods...
read Feb 1, 2025What the headlines on DeepSeek are missing, according to RAND
The DeepSeek AI company has achieved significant technical progress while operating under U.S. export controls on advanced AI chips to China, demonstrating both efficiency gains and limitations in the current regulatory landscape. Key developments: DeepSeek has managed to train advanced AI models using Nvidia H800 chips, which were specifically designed to comply with initial U.S. export controls. The company trained its V3 model using 2,000 H800 chips, showing impressive efficiency DeepSeek previously operated Asia's first 10,000 Nvidia A100 cluster and reportedly maintains 50,000 "Hopper" chips The timing of their R1 model release coincided with President Trump's inauguration, potentially for strategic...
read Jan 31, 2025Canadian bitcoin miner Bitfarms considers pivot to AI data centers
Bitfarms, a Canadian bitcoin mining company, is exploring the possibility of converting some of its facilities into AI data centers, joining other crypto miners in diversifying their operations. Strategic pivot details; Bitfarms has hired Appleby Strategy Group and World Wide Technology to assess its North American facilities for potential AI data center conversion. The consultants will analyze site capabilities and develop computing and AI strategies They will also help market these facilities to potential customers This move follows similar industry trends, with competitor Riot Platforms recently announcing a review of AI computing possibilities at its Texas facility Infrastructure advantages; Bitcoin...
read Jan 31, 2025Los Alamos Lab teams with OpenAI to boost national security
Los Alamos National Laboratory has formed a groundbreaking partnership with OpenAI to deploy advanced AI reasoning models on the Lab's Venado supercomputer for national security research. Partnership overview: Los Alamos National Laboratory will integrate OpenAI's o-series models into its Venado supercomputer, equipped with NVIDIA GH200 Grace Hopper Superchips, to address complex scientific and security challenges. The Venado supercomputer will be transferred to a secure, classified network for shared use among researchers from Los Alamos, Lawrence Livermore, and Sandia national labs This marks the first implementation of OpenAI's latest reasoning models for energy and national security applications Los Alamos has previously...
read Jan 30, 2025DeepSeek launches compact AI models for edge computing
DeepSeek has released new compact language models that can operate directly on edge devices, marking a significant advancement in edge computing and artificial intelligence for IT operations (AIOps). Key innovation; DeepSeek's R1 model enables large language models (LLMs) to run on local devices like laptops while maintaining high performance and providing transparent explanations for its outputs. The model claims performance comparable to top-tier alternatives while requiring fewer computational resources A key differentiator is the model's ability to explain its decision-making process by default The development leveraged synthetic data for training, helping overcome traditional data limitations Edge computing implications; The ability...
read Jan 29, 2025SoftBank teams with Quantinuum to advance quantum computing applications
SoftBank and Quantinuum have formed a strategic partnership to advance quantum computing applications, aiming to overcome current AI limitations through hybrid computing solutions that combine traditional and quantum processing capabilities. Partnership overview: The collaboration, announced during the International Year of Quantum Science and Technology 2025, focuses on developing practical quantum computing solutions for real-world applications. The partnership seeks to address complex optimization problems, causal relationship analysis, and high-precision simulations that current AI technologies struggle to solve Both companies will explore hybrid computing systems that integrate CPUs, GPUs, and QPUs (Quantum Processing Units) to enhance computational capabilities The initiative aims to...
read Jan 29, 2025OpenAI CEO vows to outperform DeepSeek, doubling down on costly computing strategy
The artificial intelligence industry is experiencing a pivotal moment as OpenAI CEO Sam Altman grapples with a direct challenge to his company's resource-intensive development strategy, following Chinese startup DeepSeek's demonstration that superior AI models can be built with significantly less computing power. The situation has forced Altman to defend OpenAI's approach while acknowledging DeepSeek's achievements, highlighting a growing tension between traditional high-compute methods and emerging efficient alternatives that could reshape the future of AI development. Market disruption: DeepSeek's R1 AI model has demonstrated superior performance compared to established players while using significantly less computing power, triggering a trillion-dollar decline in...
read Jan 28, 2025Cool new AI tech is coming to the rescue of old and strained power grids
The rapid growth of AI computing and data centers is creating unprecedented electricity demand, prompting innovation in power grid technology. The core challenge: Power utilities face mounting pressure to deliver more electricity to data centers and other growing sources of demand, with distribution rather than generation emerging as the primary constraint. Data center operators and AI companies cite electricity access as their top operational concern Traditional grid infrastructure expansion is slow and expensive, creating opportunities for technological alternatives Growing adoption of electric vehicles and heat pumps adds to the strain on power grids Innovative solution: Veir, Inc. is developing superconducting...
read Jan 24, 2025Billionaire Mukesh Ambani to build world’s largest data center in India
Mukesh Ambani's Reliance Group has begun construction on what could become the world's largest data center by capacity in India, positioning the company to compete in the rapidly growing artificial intelligence services market. Project scope and significance; The ambitious data center project represents a major infrastructure investment by India's largest company, aiming to establish unprecedented data processing capacity in the region. The facility, once completed, is projected to surpass current global leaders in terms of data storage and processing capabilities The project aligns with India's growing role as a major technology hub and its increasing demand for advanced computing infrastructure...
read Jan 24, 2025Meta aims to double its GPU count to 1.3M for AI development
Meta CEO Mark Zuckerberg plans to significantly expand the company's AI computing capabilities by doubling its GPU count to 1.3 million units by the end of the year. Strategic objectives: Meta aims to develop cutting-edge AI assistants and launch its Llama 4 model to compete with industry leaders ChatGPT and Google Gemini. Zuckerberg expects Meta AI to serve over 1 billion users in 2025 The company plans to create an "AI engineer" capable of contributing to Meta's R&D efforts This expansion follows recent layoffs of 3,600 Meta employees Infrastructure investment: Meta is planning a massive data center expansion that will...
read Jan 24, 2025Google DeepMind CEO offers inside look into company’s AI innovations
Google DeepMind has developed more cost-effective AI processing methods that could provide a significant advantage in the ongoing competition among major tech companies. Breakthrough in AI Processing; Google has innovated a more efficient approach to running AI models through new "light chips" that could dramatically reduce operational costs. The new processors build upon Google's decade-long development of Tensor Processing Units, maintaining the same fundamental architecture These chips specifically address the growing computational demands of "inference" - the process of AI models executing tasks, similar to human thinking The innovation comes at a crucial time when running advanced AI models has...
read Jan 22, 2025Microsoft and OpenAI reshape partnership allowing rival cloud services for AI model training and deployment
Microsoft and OpenAI's latest strategic moves herald significant changes in their exclusive partnership, with Microsoft taking a more flexible approach to OpenAI's cloud computing relationships. Key developments: Microsoft has adjusted its stance on OpenAI's cloud computing partnerships, allowing the AI company to work with other providers while maintaining its substantial investment position. Microsoft's $14 billion investment in OpenAI remains intact, but the company is now permitting OpenAI to utilize rival cloud services for AI model training and deployment The shift was revealed through a Microsoft blog post that outlined the modified terms of their previously exclusive arrangement OpenAI has joined...
read Jan 20, 2025Microsoft unveils Windows 11: a redesign for productivity, gaming, and security
Microsoft has made its first major update to the Windows operating system since Windows 10, introducing Windows 11 with a redesigned interface and new features focused on productivity and security. Key Features and Design Changes: Windows 11 represents a significant visual overhaul, featuring a centered Start menu and taskbar, rounded corners, and a more modern aesthetic approach. The new Start menu abandons the live tiles from Windows 10 in favor of a simpler grid of pinned apps and recommended files Snap Layouts and Snap Groups provide enhanced window management capabilities, allowing users to organize multiple windows in pre-configured arrangements The taskbar...
read Jan 19, 2025To the leader of the AI compute race will go the spoils
The global race for AI computing power is intensifying as nations and corporations compete for technological dominance, with investments in data centers and computing infrastructure becoming increasingly critical for economic and military supremacy. The competitive landscape: The United States and China have emerged as the primary contenders in the battle for AI computing supremacy, with Saudi Arabia rising as a notable challenger. U.S. private companies are investing $30 billion annually in data centers, more than double the previous year's spending China has invested $6.12 billion in AI data centers and has made artificial intelligence a national priority Saudi Arabia is...
read