AI Models – Page 64

News/AI Models

Sep 2, 2024

AI Engineering Breakthrough Slashes Project Failure Rates

Engineered Intelligence: A New Approach to AI Implementation: The concept of "engineered intelligence" is emerging as a potential solution to the high failure rate of AI projects and the looming threat of another AI winter. The current approach to AI implementation often involves data scientists attempting to engineer real-world solutions, resulting in an 87% failure rate for AI projects. Engineered intelligence aims to create a distinct discipline for applied artificial intelligence, similar to how other scientific breakthroughs are handed off to specialized engineers for practical application. The Problem with Current AI Implementation: The lack of a dedicated field for applying...

read Sep 2, 2024

Claude’s AI Boosts Code Editing with New Inpainting Feature

AI-powered code editing gets a boost: Claude, an AI assistant developed by Anthropic, has received a significant update to its Artifacts feature, allowing users to highlight and edit specific lines of code within generated content. The new functionality brings inpainting capabilities, commonly used in AI image generation, to code editing within Claude's interface. Users can now select specific portions of generated code and request changes or explanations, streamlining the process of refining AI-generated content. This update addresses previous limitations where users had to reply to entire threads or manually copy and paste code sections to make targeted changes. Expanding Artifacts'...

read Aug 30, 2024

ChatGPT Reaches 200M Users as AI Adoption Soars

ChatGPT's explosive growth: OpenAI's ChatGPT has reached a milestone of over 200 million weekly active users, doubling its user base since November 2023. This significant growth comes amid increasing competition in the AI chatbot market from tech giants like Meta, Google, and emerging players like Anthropic. The rapid adoption of ChatGPT demonstrates the growing mainstream acceptance and integration of AI language models in various sectors. Enterprise adoption and API usage: OpenAI's products have gained substantial traction in the corporate world, with widespread implementation across Fortune 500 companies. An impressive 92% of Fortune 500 companies are now utilizing OpenAI's products, showcasing...

read Aug 30, 2024

Japan’s Record $59B Defense Budget Targets AI and Unmanned Systems

Japan's defense budget request reflects growing regional tensions: Japan's Defense Ministry has sought a record 8.5 trillion yen ($59 billion) budget for 2025, marking the third year of a rapid five-year military buildup plan amid increasing threats from China. Key objectives of the budget request: The budget aims to fortify deterrence on southwestern islands against China's growing military presence in the region It focuses on developing unmanned weapons and AI systems to compensate for the declining number of servicemembers due to Japan's shrinking population The request is part of Japan's plan to double its annual military spending to around 10...

read Aug 30, 2024

NSF Invests $20M in AI to Transform Geosciences Research

NSF's $20 million investment in AI for geosciences: The U.S. National Science Foundation (NSF) has announced funding for 25 projects totaling over $20 million through the Collaborations in Artificial Intelligence and Geosciences (CAIG) program, aiming to advance AI techniques in geosciences research. The CAIG program seeks to foster transdisciplinary partnerships between geoscientists, computer scientists, mathematicians, and other experts to drive innovative discoveries and solutions in Earth sciences. This investment will support the development and implementation of cutting-edge AI techniques while expanding access to education and training opportunities for using AI in geosciences research. The funded projects align with key technology...

read Aug 30, 2024

AI Adoption Hits Crossroads as Companies Struggle to Scale

Generative AI adoption reaches critical juncture: A new Deloitte report reveals that while two-thirds of companies are increasing their investment in generative AI, most efforts are still in early stages. Key findings on adoption and expectations: The survey of 2,770 director to C-suite level respondents across 14 countries shows a strong focus on efficiency and productivity improvements, but challenges in implementation and measuring impact persist. 54% of organizations are seeking efficiency and productivity improvements through generative AI However, only 38% are actively tracking changes in employee productivity 68% of respondents said their organization has moved 30% or fewer of their...

read Aug 29, 2024

ChatGPT Unminifies JavaScript Code, Unveiling AI’s Development Potential

AI-powered code unminification reveals surprising capabilities: OpenAI's ChatGPT demonstrates an impressive ability to decipher and reconstruct minified JavaScript code, offering developers a powerful tool for code analysis and learning. The challenge of minified code: Frank Fiegel, while exploring an interesting component with running ASCII art, encountered minified code that was difficult to understand at first glance. Minified code is compressed to reduce file size, making it challenging for humans to read and comprehend. Traditionally, developers would either struggle through reading the minified code or search for source maps to restore the original version. ChatGPT's unexpected prowess: Fiegel decided to experiment...

read Aug 29, 2024

Code-Trained AI Models Outperform in Non-Coding Tasks

The power of code in LLM training: New research from Cohere reveals that including code in the pre-training data of large language models (LLMs) significantly improves their performance on non-coding tasks. Researchers systematically investigated the impact of code data in LLM pre-training on general performance beyond coding tasks. The study used a two-phase training process: continued pre-training and a cooldown phase, testing various ratios of text and code in the training data. Models were evaluated at different scales, from 470 million to 2.8 billion parameters, using benchmarks for world knowledge, natural language reasoning, and code performance. Key findings: The inclusion...

read Aug 29, 2024

AI Powers Reliance’s $8.5B Disney Merger and Entertainment Push

A landmark partnership in Indian entertainment: Mukesh Ambani, chair of Reliance Industries Limited (RIL), announced a transformative merger with Disney and unveiled ambitious AI plans at the company's annual general meeting. The $8.5 billion merger between RIL and Disney's key entertainment assets in India received approval from the Competition Commission, subject to voluntary modifications. Ambani welcomed Disney to the "Reliance family," describing the deal as the beginning of a new era in India's entertainment industry. The strategy focuses on combining content creation with digital streaming to deliver affordable content across various consumer preferences. Media and entertainment division performance: RIL's media...

read Aug 28, 2024

AI Slashes 1,800 Jobs at Klarna as Fintech Giant Automates

AI-driven workforce transformation at Klarna: Swedish fintech giant Klarna is embarking on a significant restructuring effort, leveraging artificial intelligence to streamline operations and reduce its workforce by nearly half over the coming years. Klarna plans to decrease its employee count from 3,800 to approximately 2,000, primarily by implementing AI solutions in marketing and customer service departments. This move follows a previous downsizing initiative that saw the company's workforce shrink from 5,000 to 3,800 employees. CEO Sebastian Siemiatkowski positions the job cuts as an opportunity to increase compensation for remaining staff members. AI implementation and early results: Klarna's foray into AI-powered...

read Aug 28, 2024

Experts Debate the Future of LLMs — Optimization or Radical Transformation?

The future of large language models: The debate surrounding the evolution of large language models (LLMs) is centered on whether they will undergo significant transformations or maintain their current capabilities while becoming more accessible and efficient. Two contrasting perspectives have emerged within the AI community: one anticipating dramatic changes in LLMs within months, and another suggesting that improvements in compute power and data have reached a plateau. Skeptics predict that while LLMs may not experience substantial intelligence gains, they are likely to become considerably more cost-effective and faster to use. Current state of LLM development: Analysis of LLM reasoning capabilities...

read Aug 27, 2024

Tech Giants Are Shifting to Paid Models for Advanced AI Services

AI's transition to paid models: The era of free AI services is coming to an end as major tech companies move towards subscription-based models for their advanced AI offerings. OpenAI and other AI companies are increasingly brokering deals with media outlets to address content scraping and training concerns, but the landscape remains largely unregulated. Visual artists and content creators continue to face challenges as AI companies use their work for training without compensation, raising concerns about intellectual property rights. The high costs associated with developing and training large language models are likely to be passed on to consumers through subscription...

read Aug 27, 2024

Anthropic Has Published Its System Prompts, Marking Milestone for AI Transparency

Anthropic's release of AI model system prompts marks a significant step towards transparency in the rapidly evolving generative AI industry. Unveiling the operating instructions: Anthropic has publicly disclosed the system prompts for its Claude family of AI models, including Claude 3.5 Sonnet, Claude 3 Haiku, and Claude 3 Opus. System prompts act as operating instructions for large language models (LLMs), guiding their behavior and interactions with users. The release includes details about each model's capabilities, knowledge cut-off dates, and specific behavioral guidelines. Anthropic has committed to regularly updating the public about changes to its default system prompts. Insights into Claude...

read Aug 27, 2024

OpenAI Reportedly Set to Release Powerful ‘Strawberry’ AI Model This Fall

OpenAI's Strawberry AI: A potential game-changer for ChatGPT and GPT-4: OpenAI is preparing to launch a new AI technology, codenamed Strawberry, that could significantly enhance the capabilities of its chatbots and large language models (LLMs) this fall. Key features of Strawberry: The new AI system boasts advanced mathematical reasoning and programming skills, setting it apart from current LLMs in the market. Strawberry, previously known as Q* (Q Star), was initially developed by OpenAI's former chief scientist Ilya Sutskever and later improved by researchers Jakub Pachocki and Szymon Sidor. The AI demonstrates the ability to solve novel math problems and answer...

read Aug 27, 2024

Anthropic’s ‘Artifacts’ Feature Now Generally Available and on Mobile

Anthropic enhances Claude AI with Artifacts feature: Anthropic, a leading AI startup, has made its innovative Artifacts feature generally available across all user tiers and mobile platforms, marking a significant advancement in AI interaction and productivity. Key feature expansion: Artifacts, previously a manual opt-in feature, is now standard across Anthropic's Free, Pro, and Team tiers, as well as on Claude's official iOS and Android mobile apps. The feature allows users to run code snippets and full programs generated by Claude alongside their chat interface. Users can create interactive visualizations, charts, and even playable games directly within their browser. Mobile integration...

read Aug 27, 2024

MIT Researchers Are Developing AI Models for Dance Choreography

AI meets dance: A fusion of technology and human movement: MIT researchers are exploring how artificial intelligence can interact with and enhance traditional dance practices, leading to innovative projects that blend cultural heritage with cutting-edge technology. MIT's 'dance with AI' project, led by researchers like Pat Pataranutaporn, aims to develop "choreography intelligence" by deconstructing traditional dances from around the world into teachable events for AI systems. The project seeks to create new choreography through AI-human collaboration, potentially breathing new life into cultural traditions and creating new forms of cultural heritage. Researchers are working on developing AI models that can interpret...

read Aug 27, 2024

Nous Research Launches Tool to Train AI Models Across Distributed Networks

Revolutionary AI training breakthrough: Nous Research has unveiled DisTrO, a new optimizer that dramatically increases the efficiency of training powerful AI models across distributed networks. Key innovation: DisTrO significantly reduces the amount of information that must be transmitted between GPUs during AI model training, enabling large-scale models to be trained over consumer-grade internet connections. The optimizer achieves an 857 times efficiency increase compared to the popular All-Reduce algorithm It reduces the amount of information transmitted during each training step from 74.4 gigabytes to 86.8 megabytes DisTrO maintains comparable training performance to conventional methods while drastically reducing communication overhead Implications for...

read Aug 27, 2024

NVIDIA Launches AI Microservices for Japan and Taiwan Markets

AI localization advances: NVIDIA has launched four new NIM microservices to accelerate the deployment of sovereign AI applications with enhanced cultural and language fluency in Japan and Taiwan. The new microservices support popular community models tailored to meet regional needs, improving user interactions through more accurate understanding and responses based on local languages and cultural heritage. This initiative aligns with the growing global trend of nations pursuing sovereign AI to ensure AI systems are in harmony with local values, laws, and interests. ABI Research projects that generative AI software revenue in the Asia-Pacific region alone is expected to surge from...

read Aug 27, 2024

AI Startup Aleph Alpha Launches Open-Source Language Models

Aleph Alpha's release of open-source AI models signals a shift towards transparent and EU-compliant machine learning, potentially reshaping the landscape of AI development. Breakthrough in open-source AI: German startup Aleph Alpha has unveiled two new large language models (LLMs) under an open license, challenging the closed-source approach of many tech giants. The models, Pharia-1-LLM-7B-control and Pharia-1-LLM-7B-control-aligned, each have 7 billion parameters and are designed to deliver concise, length-controlled responses in multiple European languages. Aleph Alpha claims their performance matches leading open-source models in the 7-8 billion parameter range. The company has also open-sourced its training codebase, called "Scaling," allowing researchers...

read Aug 27, 2024

‘Model Collapse’ Has Experts Questioning Inevitability of AI Model Performance

AI performance decline raises concerns: Recent observations suggest that popular AI models like ChatGPT and Claude are experiencing a noticeable decrease in performance and accuracy, challenging the expectation of continuous improvement in AI technology. Steven Vaughan-Nichols, in a Computerworld opinion piece, highlights the erratic and often inaccurate responses from major AI platforms. Users on the OpenAI developer forum have reported a significant decline in accuracy following the release of the latest GPT version. One user expressed disappointment, stating that the AI's performance fell short of the surrounding hype. Potential causes of AI degradation: Several factors may contribute to the perceived...

read Aug 27, 2024

DeepMind, Berkeley Show How to Make AI Models Better, Not Bigger

Optimizing LLM performance through inference-time compute: Researchers from DeepMind and UC Berkeley have explored innovative ways to enhance large language model (LLM) performance by strategically allocating compute resources during inference, potentially reducing the need for larger models or extensive pre-training. The study investigates how to maximize LLM performance using a fixed amount of inference-time compute, comparing different methods and their effectiveness against larger pre-trained models. This approach aims to enable the deployment of smaller LLMs while achieving comparable performance to larger, more computationally expensive models. Key strategies for inference-time compute optimization: The researchers focused on two main approaches to improve...

read Aug 27, 2024

AI Predicts 70% of Earthquakes in 7-Month China Trial

Groundbreaking AI predicts earthquakes with unprecedented accuracy: A new artificial intelligence algorithm developed by researchers at the University of Texas at Austin has demonstrated remarkable success in predicting earthquakes, potentially revolutionizing earthquake preparedness and risk management. The AI system successfully predicted 70% of earthquakes during a seven-month trial in China, forecasting them a week in advance. The algorithm correctly predicted 14 earthquakes within approximately 200 miles of their estimated location and at almost exactly the calculated strength. It missed only one earthquake and gave eight false warnings, showcasing its high level of accuracy. Competition success and global implications: The University...

read Aug 26, 2024

AI21 Debuts Jamba 1.5 With An Eye on Agentic AI

AI21 launches Jamba 1.5: AI21 has unveiled new versions of its Jamba model, combining transformer and Structured State Space (SSM) approaches to enhance AI capabilities. The Jamba 1.5 series includes mini and large versions, building upon the innovations introduced in Jamba 1.0 released in March. Jamba utilizes an SSM approach known as Mamba, aiming to leverage the strengths of both transformers and SSM for improved performance and accuracy. The name Jamba is an acronym for Joint Attention and Mamba architecture, reflecting its hybrid nature. Key features and enhancements: Jamba 1.5 introduces several new capabilities designed to facilitate the development of...

read Aug 26, 2024

Stanford, CMU and Georgia Tech Develop AI Model for Mental Health Support

AI-powered peer counselor training: A collaborative effort between Stanford, Carnegie Mellon, and Georgia Tech has developed an AI model to provide feedback and improve the skills of novice peer counselors in emotional support conversations. The project, presented in a working paper accepted for the 2024 Association for Computational Linguistics conference, aims to address the growing demand for mental health support and the challenges in preparing peer counselors for their roles. Interdisciplinary collaboration between computer scientists and psychologists was crucial in developing this AI-assisted training model, combining expertise in both AI and counseling intervention skills. Developing a feedback framework: The research...

read