Open-source - CO/AI

News/Open-source

Jul 24, 2025

Cognizant’s AI Lab reaches 59 U.S. patents with neural network breakthroughs

Cognizant's AI Lab has secured its 59th U.S. patent, marking a significant milestone in the company's artificial intelligence research efforts. The achievement reflects Cognizant's accelerating innovation pace, with two new patents granted in the first half of 2025 alone, plus an additional 23 patents pending approval. What you should know: The latest patents demonstrate Cognizant's focus on solving core AI challenges around neural network optimization and training efficiency.• U.S. Patent No. 12,282,845 covers Multi-objective Coevolution of Deep Neural Network Architectures, designed to improve model performance and resource efficiency across applications from medical imaging to natural language processing.• U.S. Patent No....

read Jul 24, 2025

Trump’s AI Action Plan prioritizes open-source models, reshapes government tech

President Donald Trump signed the AI Action Plan, establishing an "open-weight first" approach to artificial intelligence development that explicitly supports open-source AI models and removes regulatory barriers for government AI adoption. The plan signals a fundamental shift from the previous administration's cautious regulatory stance, potentially accelerating enterprise AI deployment while creating new compliance challenges for businesses working with federal agencies. What you should know: The Action Plan restructures how government agencies can contract with AI providers and sets clear priorities for American AI leadership. The plan removes references to misinformation and diversity, equity and inclusion from National Institute of Standards...

read Jul 23, 2025

Proton launches Lumo, a privacy-first AI chatbot with end-to-end encryption

Proton has launched Lumo, a privacy-focused AI chatbot that promises end-to-end encryption and zero data logging to compete with ChatGPT and Google Gemini. The service positions itself as the first major AI assistant that prioritizes user privacy by storing conversations locally on devices rather than on company servers, appealing to users concerned about data security in AI interactions. What you should know: Lumo operates under Proton's established privacy framework, ensuring conversations remain encrypted and inaccessible to the company or third parties. Chats are stored locally on user devices and don't sync across multiple devices, even for logged-in users. All queries...

read Jul 23, 2025

Google’s Aeneas AI helps historians decode ancient Latin inscriptions

Google DeepMind has launched Aeneas, an AI system designed to help historians decode and contextualize ancient Latin inscriptions carved in stone. The tool analyzes weathered engravings to determine when and where they were originally created, while also providing researchers with historical parallels from a database of nearly 150,000 catalogued inscriptions spanning from modern-day Britain to Iraq. How it works: Aeneas processes partial transcriptions alongside scanned images of inscriptions to reconstruct missing text and provide historical context. The system can fill in damaged portions of text—for example, completing "...us populusque Romanus" by suggesting "Senat" to form "Senatus populusque Romanus" ("The Senate...

read Jul 23, 2025

Trump unveils AI strategy targeting China while promoting open-source models

The White House has unveiled a new artificial intelligence policy plan aimed at achieving "global dominance" in the AI sector, with President Donald Trump calling for open-source AI models to be freely available worldwide. The plan also directs the Commerce Department to investigate Chinese AI models for potential alignment with Communist Party messaging and restricts federal AI funding to states with burdensome regulations. Key policy priorities: The administration's strategy centers on promoting open AI development while scrutinizing foreign competitors. Open-source and open-weight AI models should be made freely available by developers for global download and modification. The Commerce Department will...

read Jul 22, 2025

New method tracks how AI models actually make predictions after scaling

AI researcher Patrick O'Donnell has introduced "landed writes," a new method for understanding how large language models make predictions by tracking how internal components actually influence outputs after normalization scaling. The approach addresses a critical gap in current AI interpretability tools, which measure what model components intend to write rather than what actually affects the final answer after the model's internal scaling processes. The core problem: Most AI interpretability tools completely miss how transformer models internally reshape component contributions through RMSNorm scaling, which can amplify early-layer writes by up to 176× while compressing late-layer contributions. When a neuron writes +0.001...

read Jul 22, 2025

Alibaba’s Qwen3 model outperforms rivals while cutting hardware costs by 70%

Alibaba has released Qwen3-235B-A22B-2507-Instruct, an open-source large language model that outperforms rival Chinese AI startup Moonshot's Kimi-2 and Claude Opus 4's non-thinking version on key benchmarks. The model comes with an FP8 version that dramatically reduces compute requirements, allowing enterprises to run powerful AI capabilities on smaller, less expensive hardware while maintaining performance quality. What you should know: The new Qwen3 model delivers substantial improvements across reasoning, coding, and multilingual tasks compared to its predecessor. MMLU-Pro scores jumped from 75.2 to 83.0, showing stronger general knowledge performance. GPQA and SuperGPQA benchmarks improved by 15-20 percentage points for better factual accuracy....

read Jul 16, 2025

Rakuten builds memory-enhanced Japanese AI model with government backing

Rakuten has been selected for the third phase of Japan's government-backed Generative AI Accelerator Challenge (GENIAC), a program supported by the Ministry of Economy, Trade and Industry and the New Energy and Industrial Technology Development Organization (NEDO). The Japanese tech giant plans to develop an open-weight Japanese language model with enhanced memory capabilities, positioning itself to create more personalized AI applications across its business ecosystem while contributing to Japan's domestic AI research capacity. What you should know: Rakuten will focus on creating a lightweight, memory-augmented Japanese language model using a Mixture of Experts architecture starting in August 2025. The model...

read Jul 16, 2025

Mistral releases Voxtral, an open-source voice AI that challenges paid alternatives

Mistral has released Voxtral, an open-source voice AI model that goes beyond basic transcription to offer summarization and speech-triggered functions, challenging paid alternatives from companies like ElevenLabs and Hume AI. The Apache 2.0-licensed model comes in 24B and 3B parameter versions, with Mistral claiming it bridges the gap between proprietary speech recognition systems and existing open-source alternatives that often lack semantic understanding. What you should know: Voxtral offers comprehensive voice processing capabilities that extend far beyond traditional transcription services. The model can process up to 30 minutes of audio for transcription or 40 minutes for audio understanding with a 32K...

read Jul 15, 2025

Mira Murati’s AI startup raises $2B at $12B valuation

Mira Murati's AI startup Thinking Machines has raised approximately $2 billion at a $12 billion valuation in an early-stage funding round led by Andreessen Horowitz, a prominent venture capital firm. The massive investment in a company launched just five months ago with no revenue or products yet highlights the intense competition for top AI talent and the continued investor enthusiasm for AI startups from former OpenAI executives. The big picture: This funding round exemplifies the extraordinary valuations being assigned to AI startups led by high-profile executives, even in pre-revenue stages. The company was founded in February 2025 by Murati after...

read Jul 13, 2025

Open-source Kimi K2 outperforms GPT-4 on coding and math benchmarks

Moonshot AI has released Kimi K2, an open-source language model that outperforms GPT-4 on key benchmarks including coding and mathematical reasoning while being available for free. The Chinese startup's trillion-parameter model achieved 65.8% accuracy on SWE-bench Verified and 97.4% on MATH-500, surpassing OpenAI's GPT-4.1 at 92.4%, signaling a potential shift in AI market dynamics where open-source models finally match proprietary alternatives. What you should know: Kimi K2 features 1 trillion total parameters with 32 billion activated parameters in a mixture-of-experts architecture, optimized specifically for autonomous agent capabilities. The model comes in two versions: a foundation model for researchers and developers,...

read Jul 11, 2025

Swiss universities to release 70B parameter open-source LLM in 2025

ETH Zurich and EPFL will release a fully open-source large language model in late summer 2025, trained on the "Alps" supercomputer at the Swiss National Supercomputing Centre. The model represents a significant milestone in open AI development, offering multilingual fluency in over 1,000 languages and positioning European institutions as credible alternatives to closed commercial systems from the US and China. What you should know: The LLM will be completely transparent, with source code, weights, and training data publicly available under the Apache 2.0 License. Unlike commercial models developed behind closed doors, this approach enables high-trust applications and supports regulatory compliance...

read Jun 25, 2025

Google launches free Gemini CLI tool for developers across platforms

Google has launched Gemini CLI, a free command-line tool that brings its Gemini 2.5 Pro AI model directly to developers' terminals across Mac, Windows, and Linux. The open-source tool provides lightweight access to Google's AI agent with capabilities extending beyond code understanding to include file manipulation, command execution, and troubleshooting, positioning Google to capture developer mindshare in the competitive AI tooling landscape. What you should know: Gemini CLI transforms the traditional command-line experience by integrating AI assistance directly into developers' existing workflows. The tool offers code writing and explanation, debugging assistance, project management, and natural-language documentation queries. Beyond coding tasks,...

read Jun 17, 2025

Google gives iNaturalist $1.5M for AI tools—but users threaten to quit over environmental impact

iNaturalist, a nonprofit platform used by 3.7 million nature observers worldwide, received a $1.5 million grant from Google's philanthropic arm to develop generative AI tools for species identification. The announcement sparked significant backlash from the platform's community, who raised concerns about environmental impacts, data accuracy, and the potential devaluation of human expertise in taxonomy. What you should know: iNaturalist operates as a collaborative platform where users submit observations of wild organisms and rely on community expertise for species identification. More than 3.7 million people use the platform to record observations, from weekend naturalists to professional taxonomists The community has logged...

read Jun 11, 2025

Developer proposes new Unix philosophy for the AI era

A developer known as Looper has proposed a reimagined Unix philosophy for the Post-AI Era, arguing that traditional software development principles need updating for a world dominated by machine learning, neural networks, and intelligent systems. The essay challenges developers to rethink fundamental concepts like modularity, simplicity, and tool design in an age where AI agents and adaptive systems are replacing static programs. The core shift: Traditional Unix philosophy emphasized writing programs that "do one thing and do it well," but AI systems work with patterns, not programs. The proposed new mantra becomes "Build systems that are pattern-aware and failure-resilient" rather...

read Jun 3, 2025

Google launches AI Edge Gallery for offline Android AI

Google's experimental AI Edge Gallery app brings advanced artificial intelligence capabilities directly to Android smartphones without requiring internet connectivity, representing a significant advancement in edge computing and on-device AI processing. This approach addresses growing privacy concerns by keeping sensitive data locally processed rather than sending it to cloud servers, while simultaneously making sophisticated AI models more accessible to mobile users through an open-source framework. The big picture: Google has released an experimental Android application enabling users to run sophisticated AI models directly on their smartphones without an internet connection. The app, called AI Edge Gallery, allows users to download and...

read Jun 2, 2025

DeepSeek update challenges OpenAI and Google dominance

DeepSeek is emerging as a formidable challenger in the global AI landscape with its latest release demonstrating significant performance improvements while maintaining an open-source approach. The Chinese startup's new DeepSeek-R1-0528 model showcases remarkable gains in complex reasoning and coding capabilities, areas where even industry leaders struggle. What makes DeepSeek particularly noteworthy is its combination of competitive performance, open licensing, and cost-efficient development—a strategy that could reshape who controls and benefits from advanced AI technology. The big picture: DeepSeek's latest AI model, DeepSeek-R1-0528, is challenging Western AI giants like OpenAI and Google with significant performance improvements in reasoning, coding, and logic....

read Jun 2, 2025

AI integration layer Model Context Protocol gains traction

Anthropic's Model Context Protocol aims to solve the complex integration problem plaguing AI systems by establishing a standardized way for large language models to interact with external tools. As enterprise AI systems grow more sophisticated in their ability to generate content and take actions, the current landscape of proprietary interfaces has created an integration bottleneck that costs organizations significant time and resources. MCP represents a promising step toward an industry standard that could dramatically reduce this "integration tax" through consistent interfaces. The big picture: Anthropic's Model Context Protocol (MCP) offers a standardized framework for connecting AI models with external tools,...

read May 23, 2025

Python agents in 70 lines: Building with MCP

Hugging Face's new MCP-powered Python framework allows developers to create sophisticated AI agents with minimal code, streamlining how language models interact with external tools. This implementation simplifies agent development by standardizing tool interactions through the Model Context Protocol (MCP), enabling Python developers to rapidly build conversational AI systems that can leverage a growing ecosystem of capabilities without complex custom integrations. The big picture: Hugging Face has extended its hub client SDK to support the Model Context Protocol (MCP), allowing Python developers to create AI agents in roughly 70 lines of code. MCP standardizes how large language models interact with external...

read May 21, 2025

Simplest PyTorch repository for training vision language models

Hugging Face has introduced nanoVLM, a lightweight and accessible toolkit that simplifies the complex process of training Vision Language Models with minimal code requirements. This project follows in the footsteps of Andrej Karpathy's nanoGPT by prioritizing readability and simplicity, potentially democratizing VLM development for researchers and beginners alike. The toolkit's focus on pure PyTorch implementation and compatibility with free-tier computing resources represents a significant step toward making multimodal AI development more approachable. The big picture: NanoVLM provides a streamlined way to build models that process both images and text without requiring extensive technical expertise or computational resources. The toolkit enables...

read May 21, 2025

Diffusers’ quantization backends boost AI model efficiency

Quantization techniques are transforming how resource-intensive diffusion models can be deployed, making state-of-the-art AI image generation more accessible. By reducing precision requirements without significantly sacrificing quality, these approaches are democratizing access to powerful models like Flux that would otherwise require substantial computational resources. Understanding the trade-offs between different quantization backends is becoming essential knowledge for AI practitioners looking to optimize their deployment strategies. The big picture: Hugging Face Diffusers now supports multiple quantization backends that can significantly reduce the memory footprint of large diffusion models like Flux. These techniques compress models by using lower precision representations of weights and activations,...

read May 21, 2025

Devstral launches AI-powered software development platform

Mistral AI and All Hands AI have released Devstral, a groundbreaking open-source AI model specifically designed for software engineering that outperforms existing options for coding assistance. This new lightweight yet powerful large language model (LLM) achieves significantly better results on real-world programming tasks than both open and some closed-source alternatives, while being accessible enough to run on consumer hardware. The Apache 2.0 license makes it freely available for both individual developers and enterprises needing secure, compliant AI coding assistance. The big picture: Devstral represents a significant advancement in AI-powered software development by tackling real-world coding challenges rather than just simpler,...

read May 20, 2025

AI browser assistant BrowserBee launches on GitHub

BrowserBee represents a significant advancement in browser automation by bringing natural language control directly to Chrome users in a privacy-focused package. Unlike traditional automation tools that require coding knowledge, this open-source extension enables users to accomplish complex web tasks through conversational commands while maintaining security for sensitive accounts and personal information. The big picture: BrowserBee combines large language model (LLM) capabilities with Playwright's browser automation toolkit to create a seamless interface between natural language instructions and browser actions. The extension operates primarily within the user's browser, only connecting externally for LLM processing, which preserves security when accessing logged-in websites and...

read May 19, 2025

LegoGPT model creates custom Lego sets for free in novel form of AI “buildout”

Carnegie Mellon researchers have developed LegoGPT, an innovative AI tool that transforms simple text descriptions into physics-tested, buildable Lego designs. This free, open-source system represents a significant advancement in AI-generated physical objects, offering step-by-step brick-by-brick instructions that bridge the gap between creative imagination and real-world construction. By combining generative AI with physics simulations, LegoGPT demonstrates how artificial intelligence can create designs that aren't just visually appealing but structurally sound and physically buildable. How it works: LegoGPT converts natural language descriptions into complete Lego building instructions that can be physically constructed using real bricks. The system employs physics simulations to test...

read