Open-source - CO/AI

News/Open-source

Dec 20, 2024

Cohere’s smallest, fastest R-series AI model excels at RAG and foreign language

Cohere's latest AI model release demonstrates the company's growing focus on practical, efficient enterprise solutions that balance performance with resource optimization. Key Model Features and Capabilities: Command R7B represents Cohere's smallest and fastest offering in its R series, designed to provide efficient AI capabilities without requiring extensive computational resources. The model features a 128K context length and supports 23 languages, making it versatile for various enterprise applications Built with retrieval-augmented generation (RAG) technology, which enhances accuracy by grounding responses in external data Specifically optimized for tasks including math, reasoning, code, and translation capabilities Performance Benchmarks: Command R7B has demonstrated superior...

read Dec 19, 2024

Why this uncensored AI video model from China may spark an AI hobbyist movement

The emergence of open-source AI video generation models marks a significant shift in the accessibility and capabilities of video synthesis technology, with Tencent's HunyuanVideo leading the way as a freely available, uncensored option. Recent developments in AI video: The AI video generation landscape has experienced rapid advancement in early 2024, with multiple major releases from industry leaders. OpenAI's Sora, Pika AI's Pika 2, Google's Veo 2, and Minimax's video-01-live have all launched or been announced recently Tencent's HunyuanVideo distinguishes itself by making its neural network weights openly available, enabling local execution on suitable hardware The model can be fine-tuned and...

read Dec 18, 2024

Apple and NVIDIA partner on open initiative to boost LLM performance

Apple and NVIDIA have joined forces in a new partnership to enhance the performance of large language models (LLMs). Key development: Apple's recently open-sourced Recurrent Drafter (ReDrafter) technology is being integrated into NVIDIA's TensorRT-LLM platform to accelerate text generation capabilities. ReDrafter represents a novel approach to text generation that significantly improves processing speed for large language models The integration combines Apple's innovative methodology with NVIDIA's powerful GPU infrastructure This collaboration marks a notable instance of cross-company cooperation in advancing AI technology Technical achievements: The partnership has yielded substantial performance improvements in LLM operations through strategic hardware and software optimization. Benchmarking...

read Dec 18, 2024

Databricks founder offers $1M to solve AI coding challenges

The artificial intelligence community has a new challenge with significant implications for the future of coding, as Databricks and Perplexity co-founder Andy Konwinski announces a major prize for advancing AI coding capabilities. Prize details and objectives: Konwinski is offering a $1 million reward to researchers who can achieve a 90% score on a new AI coding benchmark, with the competition specifically targeting open-source language models. The contest, named K Prize, aims to encourage smaller, independent researchers to develop innovative approaches to AI model training A minimum prize of $50,000 is guaranteed for the top submission, even if it falls short...

read Dec 17, 2024

UAE’s Falcon 3 competes with top open-source AI models

The UAE's Technology Innovation Institute has entered the competitive small language model space with Falcon 3, a new family of efficient AI models designed for deployment on single GPU systems. Core technology and specifications: Falcon 3 represents a significant advancement in small language model development with its focus on efficiency and performance. The family includes four model sizes (1B, 3B, 7B, and 10B parameters) with both base and instruct variants The models were trained on 14 trillion tokens, more than double the training data of their predecessor Built using a decoder-only architecture with grouped query attention, the models optimize memory...

read Dec 17, 2024

MIT’s new Boltz-1 AI model is an open-source rival to AlphaFold3

The development of Boltz-1, a new open-source AI model from MIT researchers, marks a significant advancement in biomolecular structure prediction, offering an alternative to Google DeepMind's restricted AlphaFold3 for both academic and commercial applications. Project overview and significance: MIT's Jameel Clinic for Machine Learning in Health has created a groundbreaking AI model that matches the capabilities of AlphaFold3 while remaining fully open-source. Graduate students Jeremy Wohlwend and Gabriele Corso led the development, working alongside researchers Saro Passaro and professors Regina Barzilay and Tommi Jaakkola The model aims to democratize access to advanced protein structure prediction tools The team completed the...

read Dec 17, 2024

Abu Dhabi’s TII launches Falcon 3 AI models with open-source access

The Technology Innovation Institute (TII) in Abu Dhabi has unveiled Falcon3, a new family of open-source large language models designed to advance AI capabilities while maintaining accessibility and efficiency. The big picture: The Falcon3 release introduces five base models ranging from 1 billion to 10 billion parameters, with a particular focus on enhancing performance in science, mathematics, and coding applications. The family includes Falcon3-1B-Base, Falcon3-3B-Base, Falcon3-Mamba-7B-Base, Falcon3-7B-Base, and Falcon3-10B-Base All models are released under the Falcon LLM license to promote AI accessibility and collaboration The models support context lengths up to 32,000 tokens (except for the 1B model, which supports...

read Dec 15, 2024

Meta FAIR unveils new AI research, models and datasets in latest announcement

The world's leading AI research lab Meta FAIR has announced several groundbreaking developments in artificial intelligence, including new models for controlling virtual agents and watermarking videos, along with significant advances in AI architecture and safety. Key announcements and strategic focus: Meta FAIR's latest release encompasses nine major projects aimed at advancing machine intelligence while promoting open collaboration in AI development. Meta Motivo, a foundation model for controlling virtual embodied agents, represents a significant step forward in agent-based AI systems Meta Video Seal builds upon last year's Audio Seal technology to provide open-source video watermarking capabilities Additional releases include innovations in...

read Dec 15, 2024

Harvard releases copyright-free AI training data to democratize AI development

The growth of open-source AI training datasets marks a significant shift in how artificial intelligence models access and learn from literary works, with Harvard University taking a leading role through a major public domain book release. Project overview: Harvard's Institutional Data Initiative (IDI) has launched an unprecedented effort to democratize AI development by releasing nearly one million public domain books for AI training purposes. The dataset represents a five-fold increase compared to the Books3 dataset, previously one of the largest open collections used for AI training Microsoft and OpenAI have provided funding support for this initiative, highlighting major tech companies'...

read Dec 11, 2024

How OpenAI may lose ground to open-source models

The rise of large reasoning models (LRMs) marks a significant evolution in AI technology, with OpenAI's o1 model leading the way while facing growing competition from open-source alternatives. Model capabilities and innovation: OpenAI's o1 represents a new class of AI models that employ additional computational power to review and refine their responses, particularly excelling in complex tasks like coding and mathematics. The model uses extra inference-time compute cycles to "think" through problems, enabling more sophisticated problem-solving capabilities compared to traditional large language models (LLMs) Developers have reported mixed experiences with o1's latest update, with some showcasing impressive achievements while others...

read Dec 11, 2024

New open-source AI image model creates images on your phone in real time

The development of real-time, on-device AI image generation marks a significant advancement in making artificial intelligence more accessible and responsive to everyday users. Breakthrough technology: The University of Surrey has developed NitroDiffusion, a new AI image generation model that operates directly on local devices without requiring cloud computing resources. The model can generate images instantaneously as users type their prompts, representing a significant departure from traditional AI image generators that require longer processing times Users can create complex images in real-time by typing prompts like "a photograph of a meerkat floating in space, wearing sunglasses" The technology operates entirely on...

read Dec 10, 2024

ServiceNow open-sources Fast-LLM to boost enterprise AI model training

ServiceNow has released Fast-LLM as an open-source technology that promises to accelerate enterprise AI model training by 20%, potentially saving significant time, money and computational resources. Core Innovation: ServiceNow's Fast-LLM introduces groundbreaking improvements in AI training efficiency through advanced data parallelism and memory management techniques. The technology has already proven successful in training ServiceNow's StarCoder 2 LLM and handling large-scale, trillion-token continuous pre-training Fast-LLM is designed as a drop-in replacement for existing AI training pipelines, requiring minimal configuration changes The framework competes with established AI training tools like PyTorch while offering unique optimization features Technical Breakthroughs: Two key innovations distinguish...

read Dec 9, 2024

Meta’s new Llama 3.3 70B model just dropped and it’s a big deal

Meta's newest open-source AI language model, Llama 3.3 70B, marks another significant advancement in making powerful AI more accessible and efficient, while challenging proprietary models from major tech companies. Key developments: The new 70B parameter version of Llama 3.3 demonstrates performance capabilities that rival larger models, including OpenAI's GPT-4o and Google's Gemini Pro 1.5 in certain benchmarks. The model is freely available for download through Ollama, Hugging Face, and Meta's official Llama site Despite its smaller size, the model's performance approaches that of the larger Llama 3.1 405B The release supports eight languages, including Spanish, Hindi, and Thai, with built-in...

read Dec 8, 2024

Cryptomining malware infects thousands via hijacked AI model

The popular AI development company Ultralytics experienced a significant security breach when threat actors compromised its YOLO11 model to deploy cryptocurrency mining malware through the Python Package Index (PyPI). The incident overview: Ultralytics' YOLO (You Only Look Once) AI model, a widely-used open-source computer vision system for real-time object detection, was targeted in a supply chain attack affecting versions 8.3.41 and 8.3.42. The compromised software has been downloaded over 260,000 times in the past 24 hours from PyPI alone The project maintains significant popularity in the developer community, with 33,600 GitHub stars and 6,500 forks The attack impacted multiple downstream...

read Dec 6, 2024

Meta unveils new budget-friendly AI model for businesses

Key announcement: Meta has unveiled Llama 3.3 70B, a new AI language model that achieves performance parity with larger models while requiring significantly fewer computational resources. The new 70B parameter model matches the capabilities of Meta's larger 405B parameter version, while being more cost-effective and computationally efficient Meta claims the model outperforms competing offerings from Google, OpenAI, and Amazon on key benchmarks, including the MMLU (Massive Multitask Language Understanding) test Competitive landscape: The announcement comes during a week of intense AI-related activity from major technology companies. Google, Microsoft, OpenAI, and xAI have all made significant AI announcements this week The...

read Dec 5, 2024

Meta’s next AI might allow you to type without using your hands

Surface electromyography (sEMG) technology is advancing as a means of translating muscle activity at the wrist into digital commands, with potential applications ranging from augmented reality control to keyboardless typing. Major breakthrough: Meta is releasing two groundbreaking datasets and benchmarks for sEMG-based typing and pose estimation as part of NeurIPS 2024, representing the largest open-source sEMG datasets ever compiled. The datasets include 716 hours of sEMG recordings from 301 consenting participants Each dataset contains 10 times more data than previous single-task, single-device collections State-of-the-art models for typing and pose estimation are being released alongside the datasets Technical innovation: Surface electromyography...

read Dec 5, 2024

This open-source dataset may lead to more fuel-efficient, AI-designed cars

Global efforts to create more sustainable and efficient vehicles have received a significant boost from a groundbreaking database of car designs and their aerodynamic properties developed by MIT engineers. Project overview: DrivAerNet++, a comprehensive open-source dataset, contains over 8,000 3D car designs with detailed aerodynamic simulations, representing a significant advancement in automotive design resources. The database encompasses multiple car types including fastback, notchback, and estateback designs Each design includes various representations such as mesh models, point clouds, and parametric specifications The project required more than 3 million CPU hours of processing time and generated 39 terabytes of data Technical foundation:...

read Dec 4, 2024

Tencent’s AI video model Hunyuan is the latest challenger to Runway and Sora

Hunyuan Video, a new open-source AI video generation model from Chinese tech giant Tencent, marks another significant development in the rapidly evolving field of AI-generated video content. Model specifications and capabilities: Hunyuan represents a substantial technical achievement in the AI video generation space, utilizing a 13-billion parameter diffusion transformer architecture. The model can generate 5-second high-resolution videos from text prompts, though generation times currently extend to about 15 minutes Implementation requires significant computational resources, with a minimum requirement of 60GB GPU memory on hardware like Nvidia H800/H20 GPUs The system produces photorealistic videos featuring natural-looking human and animal movements Accessibility...

read Dec 2, 2024

The EU AI Act from an open-source developer’s perspective

The European Union's AI Act represents the world's first comprehensive artificial intelligence legislation, establishing a risk-based framework that affects developers, deployers, and users of AI systems, including the open source community. Key regulatory framework: The EU AI Act creates a tiered system of regulation based on the potential risks posed by different AI applications, from unacceptable to minimal risk. The legislation applies to any AI systems or models that impact EU residents, regardless of where the developers are located The Act distinguishes between AI models (like large language models) and AI systems (like chatbots or applications that use these models)...

read Nov 28, 2024

Alibaba unveils Marco-o1 AI model with advanced reasoning

The emergence of large reasoning models (LRMs) marks a significant advancement in artificial intelligence, with new developments focusing on enhanced problem-solving capabilities beyond traditional language processing tasks. Key innovation: Alibaba researchers have developed Marco-o1, a new language model that builds upon OpenAI's o1 framework to tackle complex problems lacking clear solutions or quantifiable metrics. The model is based on Alibaba's Qwen2-7B-Instruct and incorporates advanced techniques like chain-of-thought fine-tuning and Monte Carlo Tree Search (MCTS) Marco-o1 uses "inference-time scaling," which allows the model more computational time to generate and review responses A built-in reflection mechanism prompts the model to periodically review...

read Nov 28, 2024

Chinese AI models are closing the AI leadership gap

The artificial intelligence landscape is experiencing rapid evolution as Chinese developers and open-source initiatives challenge OpenAI's leadership position in advanced reasoning models. Recent developments: Three new Chinese AI models have emerged to compete with OpenAI's o1 Preview, showcasing the accelerating pace of innovation in the field. Deepseek R1 from HighFlyer Capital Management, Marco-1 from Alibaba, and OpenMMLab's hybrid model are demonstrating competitive performance metrics These models are challenging OpenAI's benchmark standards established by their o1-preview model released in mid-September OpenAI is expected to announce its next release as soon as next week, facing pressure to maintain its technological edge Market...

read Nov 27, 2024

HuggingFace claims its new AI model SmolVLM will slash business AI costs

Hugging Face's release of SmolVLM represents a significant advancement in making vision-language AI more accessible and cost-effective for businesses, offering comparable performance to larger models while requiring substantially less computing power. Key innovation details: SmolVLM is a compact vision-language model that can process both images and text while using significantly less computational resources than existing alternatives. The model requires only 5.02 GB of GPU RAM, compared to competitors Qwen-VL 2B and InternVL2 2B which need 13.70 GB and 10.52 GB respectively SmolVLM utilizes 81 visual tokens to encode image patches of size 384×384, enabling efficient processing of visual information The...

read Nov 27, 2024

AMD launches ROCm 6.3, an open-source platform to reduce compute costs

The AMD ROCm Version 6.3 release marks a significant advancement in open-source software for AI, machine learning, and high-performance computing on AMD Instinct GPU accelerators. Major updates and core features: ROCm 6.3 introduces several key improvements aimed at enhancing developer productivity and computational performance across various sectors. SGLang integration enables up to 6X higher performance for Large Language Model (LLM) inferencing A re-engineered FlashAttention-2 implementation provides up to 3X faster processing for AI model training New multi-node Fast Fourier Transform (FFT) capabilities support distributed computing applications Enhanced computer vision libraries include support for AV1 codec and improved JPEG processing AI...

read Nov 27, 2024

Mochi-1 lets users train their own AI video models with minimal footage

The race to develop accessible, high-quality AI video generation tools has intensified with Genmo's latest advancement in personalized video model training. Major breakthrough: San Francisco-based Genmo has unveiled a fine-tuning tool for their Mochi-1 video generation model that allows users to customize video output using a small set of training clips. The new feature leverages LoRA (Low-Rank Adaptation) technology, previously used in image model fine-tuning, to help users personalize their video generations Users can theoretically achieve customized results with as few as twelve video clips The technology could enable specific use cases like automatically incorporating brand logos into generated videos...

read