News/Computing
Forward-deployed engineers become the hottest AI role as companies race to implement
Forward-deployed engineers are emerging as one of the most crucial roles in AI, with companies scrambling to find talent that can bridge the gap between cutting-edge research and real-world implementation. Unlike traditional software engineers who build products for mass use, these specialists embed within individual companies to identify automation opportunities and customize AI solutions, making them essential for turning AI breakthroughs into practical business value. What you should know: The role was popularized by Palantir, a data analytics company, and has become a hot topic among AI startup founders seeking to scale their technologies effectively. Forward-deployed engineers work directly inside...
read Jul 10, 2025AWS upgrades SageMaker with observability tools to boost AI development
AWS has unveiled significant upgrades to SageMaker, its machine learning and AI model training platform, adding new observability capabilities, connected coding environments, and GPU cluster performance management. These enhancements aim to solidify AWS's position as the infrastructure backbone for enterprise AI development, even as competition intensifies from Google and Microsoft in the AI acceleration space. What you should know: The SageMaker updates directly address customer pain points in AI model development and deployment. SageMaker HyperPod observability enables engineers to examine various layers of the stack, including compute and networking layers, with real-time alerts and dashboard metrics when performance issues arise....
read Jul 7, 2025German firm makes DeepSeek AI 200% faster with 90% of original performance
German AI consulting firm TNG Technology Consulting GmbH has released DeepSeek-TNG R1T2 Chimera, a significantly faster variant of DeepSeek's popular open-source reasoning model R1-0528. The new model delivers 90% of the original's intelligence while generating responses with 60% fewer tokens, translating to 200% faster inference and dramatically lower compute costs for enterprises. What you should know: R1T2 represents a breakthrough in AI model efficiency through TNG's Assembly-of-Experts (AoE) methodology, which merges multiple pre-trained models without additional training. The model combines three parent models: DeepSeek-R1-0528, DeepSeek-R1, and DeepSeek-V3-0324, creating what TNG calls a "Tri-Mind" configuration. Unlike traditional training approaches, AoE selectively...
read Jun 24, 2025Deal with this: Dell’s 32GB Copilot+ laptop hits lowest price at $799!
Best Buy is offering the Dell 16 Plus Copilot+ PC with 32GB RAM and 1TB SSD for $799.99, down from its regular price of $1,250. The 36% discount represents the lowest price tracked to date for this touchscreen laptop powered by AMD's Ryzen AI 7 350 processor, delivering serious value for users seeking AI-enhanced computing capabilities. Key specifications: The Dell 16 Plus features AMD's Ryzen AI 7 350 chip with an octa-core CPU and Radeon 860M integrated GPU, paired with 32GB of RAM and 1TB SSD storage. The laptop includes a 16-inch FHD+ (1,920 x 1,200) touchscreen display with a...
read Jun 24, 2025Only 32 countries have the computing power to build advanced AI as Africa hopes to catch up
New research reveals that artificial intelligence is creating a stark global digital divide, with just 32 countries possessing the computing power necessary to build cutting-edge AI systems. The uneven distribution of AI infrastructure is fracturing the world between nations with advanced data centers and those forced to rely on remote access, fundamentally reshaping global power dynamics in the digital age. The big picture: The European Union leads with 28 AI-capable data centers, followed by the US with 26 and China with 22, collectively housing more than half of the world's most powerful facilities mapped in the Oxford University study. The...
read Jun 24, 2025Think locally, scrape globally: Web scraping gets easier with Oxylabs’ 175M proxy pool and AI tools
ZDNET's David Gewirtz tested Oxylabs, a web scraping proxy service that offers over 175 million residential proxies across 195 countries, significantly outpacing competitors like IPRoyal (32 million) and MarsProxies (1 million). The platform combines extensive proxy infrastructure with AI-powered data parsing tools, positioning it as a comprehensive solution for enterprise-scale web scraping operations. What you should know: Oxylabs distinguishes itself through both scale and ethical sourcing practices in the competitive proxy services market. The company maintains 175 million residential proxies across 195 countries, dwarfing competitors' offerings. Residential proxy access is obtained ethically by paying computer owners small amounts for bandwidth...
read Jun 23, 2025Google’s ChromeOS adds practical AI features that outshine competitors
Google has quietly unleashed a suite of artificial intelligence features for ChromeOS that could reshape how business users interact with their computers. Unlike flashy AI chatbots that require separate applications, these tools integrate directly into everyday workflows, offering contextual assistance without interrupting productivity. The standout feature, Text Capture, demonstrates Google's strategic approach to practical AI implementation. Rather than forcing users to learn new interfaces, it works seamlessly within existing applications, analyzing on-screen content and suggesting relevant actions. This represents a significant shift toward ambient computing, where AI assistance becomes invisible infrastructure rather than a separate tool. For business users evaluating...
read Jun 23, 2025AI computing divide leaves 150+ nations without critical infrastructure
Artificial intelligence computing power is creating a stark global divide, with only 32 countries hosting AI-specialized data centers while more than 150 nations have no such infrastructure. This digital gap is reshaping geopolitics and economics, as nations with advanced AI capabilities gain significant advantages in scientific research, business automation, and technological sovereignty, while those without face mounting challenges in talent retention and economic development. The big picture: The United States, China, and the European Union dominate the AI computing landscape, hosting more than half of the world's most powerful data centers used for developing complex AI systems. American companies operate...
read Jun 20, 2025AI and SaaS markets converge into $939B opportunity by 2025
Enterprise technology spending is undergoing its most dramatic transformation since the cloud migration began 15 years ago. Two massive markets are reshaping how businesses allocate their technology budgets: artificial intelligence spending projected to reach $644 billion in 2025, growing at a staggering 76.4% year-over-year, while Software-as-a-Service (SaaS) is expected to hit $295 billion with steady 18.4% compound annual growth rate (CAGR). However, these aren't competing markets—they're converging. The companies that understand this convergence, rather than viewing it as a zero-sum competition, will capture the lion's share of this combined $939 billion opportunity. The tale of two markets SaaS has earned...
read Jun 18, 2025SandboxAQ releases 5.2M synthetic molecules to accelerate AI drug discovery
SandboxAQ, an AI startup spun out of Google and backed by Nvidia, has released a massive dataset of 5.2 million synthetic molecular structures designed to accelerate drug discovery by predicting how pharmaceutical compounds bind to proteins. This computational approach could dramatically reduce the time and cost of identifying promising drug candidates by using AI to simulate what traditionally required extensive laboratory experiments. What you should know: The dataset represents a breakthrough in computational drug discovery, combining traditional scientific computing with modern AI capabilities. SandboxAQ generated the synthetic molecules using Nvidia's chips and existing experimental data, creating three-dimensional molecular structures that...
read Jun 17, 2025Groq challenges AWS with faster AI and Hugging Face integration
Groq has launched two major initiatives targeting established cloud providers like AWS and Google: supporting Alibaba's Qwen3 32B language model with its full 131,000-token context window and becoming an official inference provider on Hugging Face's platform. These moves position the AI inference startup to challenge tech giants by offering faster processing speeds and broader developer access, potentially reshaping how millions of developers access high-performance AI models. What you should know: Groq claims to be the only fast inference provider capable of supporting Qwen3 32B's complete 131,000-token context window, a technical capability that enables processing of lengthy documents and complex reasoning...
read Jun 11, 2025Jupiter supercomputer puts Europe in global AI computing race
Nvidia and the Julich Supercomputing Centre have launched Jupiter, Europe's fastest supercomputer, powered by nearly 24,000 Nvidia GH200 Grace Hopper Superchips. The system marks Europe's entry into exascale computing, delivering over one quintillion calculations per second while positioning the continent as a major player in AI research and scientific discovery. What you should know: Jupiter represents a significant leap forward in European high-performance computing capabilities and energy efficiency. The supercomputer delivers more than twice the performance of Europe's next-fastest system and ranks among the top five globally on the latest TOP500 list. Built on Nvidia's GH200 Grace Hopper platform and...
read Jun 9, 2025Amazon invests $20B in Pennsylvania to expand AI cloud infrastructure
Amazon has announced a $20 billion investment in Pennsylvania to expand its cloud infrastructure, marking the latest in a series of massive capital commitments by the tech giant. This investment underscores the intense competition among Big Tech companies to build the infrastructure necessary to support the rapidly growing demand for AI services and cloud computing capabilities. The big picture: Amazon's Pennsylvania investment is part of a broader $35 billion spending spree across multiple states in recent weeks, including $10 billion in North Carolina and over $5 billion in Taiwan for cloud infrastructure. Key details: The Pennsylvania investment will create significant...
read Jun 3, 2025Walk it Back: AI researchers cut energy use with backward computation
Reversible computing is emerging as a promising solution to the energy efficiency crisis facing AI and computing at large. As traditional computing approaches physical limitations on chip miniaturization, researchers are turning to reversible computing—a technique that avoids energy waste by allowing computations to run backward as well as forward. This approach could potentially save orders of magnitude in power consumption, making it particularly valuable for energy-intensive AI applications where efficiency constraints threaten to limit further advancement. The big picture: Researchers are reviving interest in reversible computing as a way to dramatically reduce energy consumption in computation, particularly for power-hungry AI...
read May 23, 2025Intel’s 48GB dual-chip GPU beast may arrive soon
Intel's potential dual-GPU graphics card represents a notable revival of multi-chip consumer hardware, pointing to a possible new direction for AI development hardware that sits between consumer and professional-grade equipment. The reported 48GB memory configuration suggests Intel and its partners may be exploring ways to deliver higher memory capacity in more affordable packages for AI workloads and content creation, though important questions about performance, pricing, and market positioning remain unanswered. The big picture: A mysterious Intel graphics card combining two Arc B580 GPUs with a substantial 48GB of memory has reportedly entered development through an unnamed board partner. This design...
read May 22, 2025Dell aims to dominate enterprise AI infrastructure market
Dell Technologies is aggressively positioning itself as the comprehensive provider for enterprise AI infrastructure, with CEO Michael Dell outlining a vision of "decentralized, low-latency, and hyper-efficient" artificial intelligence that follows data rather than forcing data migration. The company's strategy addresses the growing complexity of on-premise generative AI deployments by offering end-to-end solutions with 24/7 support and monitoring, capitalizing on enterprise customers' desire for simplified AI implementation in their own environments. The big picture: Dell is launching a suite of hardware, software, and managed services designed to make on-premise AI more accessible and cost-effective than cloud alternatives. The new Dell AI...
read May 22, 2025Google developers blog unveils latest web, mobile, AI and cloud updates
Google's new on-device AI capabilities mark a significant expansion in edge computing possibilities, bringing powerful language models directly to mobile devices and web applications. The introduction of Gemma 3n as Google's first multimodal small language model, combined with new RAG and Function Calling libraries, provides developers with comprehensive tools to build sophisticated AI features that operate entirely on local devices without requiring cloud connectivity or compromising user privacy. The big picture: Google AI Edge is dramatically expanding its on-device small language model (SLM) ecosystem with over a dozen new models, including the new Gemma 3 and multimodal Gemma 3n models,...
read May 21, 2025Diffusers’ quantization backends boost AI model efficiency
Quantization techniques are transforming how resource-intensive diffusion models can be deployed, making state-of-the-art AI image generation more accessible. By reducing precision requirements without significantly sacrificing quality, these approaches are democratizing access to powerful models like Flux that would otherwise require substantial computational resources. Understanding the trade-offs between different quantization backends is becoming essential knowledge for AI practitioners looking to optimize their deployment strategies. The big picture: Hugging Face Diffusers now supports multiple quantization backends that can significantly reduce the memory footprint of large diffusion models like Flux. These techniques compress models by using lower precision representations of weights and activations,...
read May 21, 2025LLM runs on Commodore 64 in impressive display of 80s tech staying power
The 42-year-old Commodore 64 just became the oldest computer capable of running a large language model, showcasing the remarkable versatility of early computing hardware in the age of AI. While modern AI companies race to optimize their models for efficiency on contemporary devices, developer Maciej Witkowiak has taken a dramatically different approach by successfully porting a simplified LLM to run on 1982 technology, demonstrating how even the most basic computing platforms can participate in today's AI revolution. The big picture: Developer Maciej Witkowiak has successfully ported a simplified version of Llama 2 to run on a Commodore 64 computer from...
read May 20, 2025Nvidia and Foxconn build AI supercomputer to power Taiwan’s tech future
Nvidia and Foxconn's new AI supercomputer collaboration marks a significant advancement in Taiwan's technological infrastructure. The partnership, which includes the Taiwan government and TSMC, represents a strategic investment to position Taiwan as an AI innovation hub in the semiconductor industry. This development highlights the growing importance of specialized AI computing resources in driving national technological competitiveness and industrial transformation. The big picture: Nvidia and Foxconn are developing a massive AI factory supercomputer in Taiwan equipped with 10,000 Nvidia Blackwell GPUs, creating foundational AI infrastructure for researchers, startups, and industries throughout the region. The project represents an expansion of the companies'...
read May 20, 2025Data center future trends unveiled at Eaton’s seaside conference
Eaton's Solutions By The Sea Data Center Conference highlights the critical intersection of generative AI, energy production, and data center evolution. The conference showcases how infrastructure providers are preparing for AI's intense energy demands while addressing sustainability concerns. Understanding these developments is crucial as data centers rapidly transform to accommodate AI workloads that require significantly more power, cooling capacity, and innovative rack designs. The big picture: Eaton's conference focused on how generative AI is transforming data center infrastructure requirements and energy management strategies across the industry. Sessions covered critical topics including chip evolution impacts on rack design, handling volatile GenAI...
read May 20, 2025Qualcomm targets data centers with new Nvidia-compatible processors
Qualcomm's entry into the data center AI processor market signals a significant strategic pivot for the mobile chip giant, as it positions itself to compete in Nvidia's GPU-dominated territory. By designing CPUs specifically compatible with Nvidia's GPUs and software ecosystem, Qualcomm is acknowledging Nvidia's market leadership while carving out its own complementary role in the AI infrastructure landscape. This diversification effort represents the company's latest attempt to expand beyond its smartphone chip stronghold under CEO Cristiano Amon's leadership. The big picture: Qualcomm announced plans to launch data center processors designed to connect with Nvidia's GPUs, marking the company's re-entry into...
read May 19, 2025Nvidia unveils tech to maintain AI development dominance
Nvidia's strategic expansion with NVLink Fusion represents a significant pivot in the AI chip ecosystem, allowing non-Nvidia processors to interconnect with their technology for the first time. This move positions the company to maintain its central role in AI development even as competitors build custom solutions, effectively acknowledging the hybrid future of AI infrastructure while ensuring Nvidia remains the gravitational center of the industry. The big picture: Nvidia CEO Jensen Huang announced "NVLink Fusion," opening the company's previously closed interconnect technology to competitors' CPUs and custom chips for the first time. The program allows customers to build "semi-custom AI infrastructure"...
read May 19, 2025Super Micro’s powers weaken as revenue forecast slashed, shares tumble
Super Micro Computer's revenue forecast reduction signals mounting concerns for the AI server maker amid internal challenges that threaten its market standing. The company's significant downward adjustment from its previous guidance comes at a critical time when investor scrutiny is intensifying around AI infrastructure providers. This development raises questions about both Super Micro's operational issues and potential shifts in the broader AI hardware market. The big picture: Super Micro Computer has cut its fiscal 2025 revenue forecast, reinforcing concerns about the server maker's position in the competitive AI infrastructure market. Key details: The company now expects 2025 revenue between $21.8...
read