AI Safety - CO/AI

News/AI Safety

Sep 16, 2024

Leading Scientists Call for Protections Against Catastrophic AI Risks

AI safety concerns gain urgency: Leading AI scientists are calling for a global oversight system to address potential catastrophic risks posed by rapidly advancing artificial intelligence technology. The release of ChatGPT and similar AI services capable of generating text and images on command has demonstrated the powerful capabilities of modern AI systems. AI technology has quickly moved from the fringes of science to widespread use in smartphones, cars, and classrooms, prompting governments worldwide to grapple with regulation and utilization. A group of influential AI scientists has issued a statement warning that AI could surpass human capabilities within years, potentially leading...

read Sep 16, 2024

AI Governance Takes Center Stage in ASEAN-Stanford HAI Workshop

AI dialogue in Southeast Asia: Stanford's Institute for Human-Centered AI (HAI) conducted a workshop with ASEAN leaders to discuss the impact of AI on governance, fairness, and regional cooperation. The workshop, held in collaboration with The Asia Foundation, took place before ASEAN's Ministerial Meeting on Science, Technology, and Innovation in Siem Reap, Cambodia. Stanford faculty members Jeff Hancock and Sanmi Koyejo led discussions on AI governance, bias, fairness, and trust with digital ministry officials, technologists, and ASEAN community members. The event highlighted the growing importance of AI in both local and global governance, with a particular focus on generative AI...

read Sep 16, 2024

MIT Proposes ‘AI Safety Hotline’ to Address Industry Concerns

AI safety concerns prompt call for innovative reporting mechanism: As artificial intelligence continues to advance rapidly, there is growing recognition that current safety measures may be insufficient to address potential risks associated with AI development and deployment. Current safety measures fall short: Existing approaches to mitigate AI risks, such as pre-release testing of models, have inherent limitations and can be manipulated. Testing protocols for AI models often fail to capture the full range of potential risks and vulnerabilities. There are concerns that some AI companies may be able to "game" these tests, potentially hiding or downplaying safety issues. Internal reporting...

read Sep 14, 2024

OpenAI’s New o-1 Model is Raising Ethical Concerns for its Ability to Deceive

Advancing AI capabilities while grappling with safety concerns: OpenAI's latest AI system, o1 (nicknamed Strawberry), showcases improved reasoning abilities but also raises significant safety and ethical concerns. Key features of Strawberry: The new AI system demonstrates enhanced cognitive capabilities, positioning it as a significant advancement in artificial intelligence. Strawberry is designed to "think" or "reason" before responding, allowing it to solve complex logic puzzles, excel in mathematics, and write code. The system employs "chain-of-thought reasoning," which enables researchers to observe and analyze its thinking process. OpenAI claims that these reasoning capabilities can potentially make AI safer by allowing it to...

read Sep 14, 2024

AI Threatens Election Integrity, California AG Warns Tech Giants

California AG addresses AI and tech companies on election integrity: Rob Bonta, California's Attorney General, has issued a warning to major technology and artificial intelligence companies regarding the potential spread of election misinformation through their platforms. Bonta sent letters to the CEOs of Alphabet, Meta, Microsoft, OpenAI, Reddit, TikTok, X, and YouTube, emphasizing the importance of preventing voter intimidation, deception, and dissuasion. The timing of the letters coincides with the first televised debate between Vice President Kamala Harris and former President Donald Trump, highlighting the relevance of the issue in the current political climate. Key legal considerations: The Attorney General's...

read Sep 14, 2024

G20 Nations Agree on AI Guidelines to Combat Disinformation

G20 leaders unite against disinformation and set AI guidelines: The Group of 20 nations have reached a landmark agreement to combat disinformation and establish guidelines for artificial intelligence development, marking a significant step in addressing global digital challenges. Key takeaways from the G20 meeting: For the first time in G20 history, the group has officially recognized the problem of disinformation and called for transparency and accountability from digital platforms. The ministers agreed to set up guidelines for developing AI, emphasizing ethical, transparent, and accountable use with human oversight. The agreement aims to ensure compliance with privacy and human rights laws...

read Sep 14, 2024

Amazon Joins Tech Giants to Combat AI-Generated Fake Content

Major tech player joins content authenticity initiative: Amazon has become the latest tech giant to join the Coalition for Content Provenance and Authenticity (C2PA), a group dedicated to developing standards for tracing the origins of digital content. Key players in the coalition: The C2PA already boasts an impressive roster of technology industry leaders, including: Adobe Google Microsoft Meta OpenAI C2PA's primary focus: The coalition is working on developing a system to help trace the origins of images, which could prove invaluable in the fight against AI-generated fake content. Current state of implementation: While the C2PA's efforts show promise, it's important...

read Sep 13, 2024

OpenAI’s New o-1 Model Is Already Sparking Safety Concerns

Groundbreaking AI model raises safety concerns: OpenAI's new o1-preview model, designed for enhanced reasoning capabilities, has sparked warnings from AI experts about potential risks associated with increasingly capable artificial intelligence systems. OpenAI's o1-preview model, codenamed 'Project Strawberry', is now available for ChatGPT Pro subscribers and through the company's API. The model demonstrates significant improvements in problem-solving abilities across various fields, including mathematics, coding, and scientific disciplines. OpenAI also introduced o1-mini, a faster and more affordable version of the reasoning model, particularly effective for coding applications. Performance benchmarks: The new o1-preview model has shown remarkable improvements in various challenging tasks, outperforming...

read Sep 13, 2024

What To Know About The EU AI Act

EU Artificial Intelligence Act sets new standards for AI regulation: The European Union has introduced a comprehensive legislation aimed at ensuring safe, trustworthy, and human-centric use of AI technologies across various sectors. The EU AI Act has a broad extraterritorial reach, applying to entities operating in or supplying AI systems to the EU, regardless of their headquarters location. Different obligations are established for various actors in the AI value chain, including GPAI model providers, deployers, manufacturers, and importers. The legislation adopts a risk-based approach, with higher-risk use cases subject to more stringent requirements and enforcement. Compliance and penalties: The Act...

read Sep 13, 2024

AI and Cybersecurity Are on a Collision Course — Here’s Why

AI and cybersecurity convergence in government: Public-sector CIOs are raising concerns about the potential for criminals to exploit artificial intelligence advancements, highlighting the need for increased security measures in governmental AI deployments. At the State of GovTech 2024 conference in Kansas City, Mo., speakers emphasized the growing intersection of AI and cybersecurity as two of the most critical issues in government technology. The conference, which attracted over 160 attendees from public sector, vendor, and investment backgrounds, focused on the rapid pace of AI adoption and the associated cybersecurity risks. Tom Lynch, CIO of Cook County, Ill., stressed the importance of...

read Sep 12, 2024

AI Ethics Take Center Stage in Pope’s Singapore Address

Pope Francis addresses AI concerns during Singapore visit: The pontiff's recent tour of Asian countries included a stop in Singapore, where he issued warnings about the potential risks of artificial intelligence technology. During his visit to Singapore, Pope Francis emphasized that AI should be used to bring communities closer together, rather than isolate individuals. The Pope's comments in Singapore echo his earlier statements calling for a ban on "lethal autonomous weapons" and advocating for greater human control over AI technologies. Balancing technological progress and human values: Pope Francis praised Singapore's economic success while urging the nation to prioritize care for...

read Sep 12, 2024

MIT AI Risk Database Catalogs 750+ Threats to Innovation

Comprehensive AI risk database unveiled: MIT and University of Queensland researchers have created a groundbreaking repository cataloging over 750 AI-related risks, providing a crucial resource for understanding and mitigating potential dangers associated with artificial intelligence. The big picture: The AI Risk Repository, a free and publicly accessible database, aims to address gaps in current understanding of AI risks and foster more effective risk mitigation strategies across various sectors. The project was led by Peter Slattery, PhD, from MIT FutureTech, who emphasized the importance of identifying fragmented knowledge in AI risk assessment. Researchers utilized systematic searches, expert input, and a "best...

read Sep 11, 2024

Yann LeCun and Geoffrey Hinton Clash Over AI Safety Bill SB 1047

AI safety debate intensifies: California's AI safety bill SB 1047 has sparked a fierce debate among AI pioneers, with Yann LeCun and Geoffrey Hinton taking opposing stances on the legislation. Yann LeCun, Meta's chief AI scientist, publicly criticized supporters of SB 1047, arguing they have a "distorted view" of AI's near-term capabilities. Geoffrey Hinton, often called the "godfather of AI," endorsed the bill by signing an open letter urging Governor Gavin Newsom to approve the legislation. The disagreement between these two influential figures highlights the deep divisions within the AI community regarding regulation and safety measures. Key provisions of SB...

read Sep 11, 2024

SB 1047: Will California Determine the Course of the Entire AI Industry?

California's AI regulation push: California's SB 1047 bill, aimed at regulating advanced AI models, has passed the state legislature and now awaits Governor Gavin Newsom's decision, potentially setting a new standard for AI regulation in the US. The bill, introduced by state Senator Scott Wiener, seeks to implement strict safety measures for powerful AI models, including thorough testing and safety certifications. If signed into law, SB 1047 would apply to AI models operating in California's market, potentially impacting the industry far beyond state borders. Industry reactions and competitive landscape: The proposed legislation has sparked intense debate within the tech industry,...

read Sep 11, 2024

AI Governance and the Evolving Landscape of Consumer Values

AI governance emerges as a critical focus: As artificial intelligence continues to advance rapidly, the need for comprehensive governance frameworks becomes increasingly important to ensure responsible and ethical development and deployment of AI technologies. The concept of AI governance builds upon established principles of data governance, which have been crucial in addressing privacy concerns and data ownership issues in the big data era. AI governance aims to provide oversight and guidelines for AI products and services, similar to how data governance has been instrumental in managing data-related challenges. Principle-based approach gains traction: Experts advocate for a more flexible and agile...

read Sep 10, 2024

UK Report Uncovers AI Risks and Calls for Global Cooperation

The UK's Department for Science, Innovation, and Technology has released an interim report on advanced AI safety, highlighting current capabilities, potential risks, and mitigation strategies while emphasizing the need for global cooperation in addressing AI challenges. Report overview and significance: The International Scientific Report on the Safety of Advanced AI – Interim Report provides a comprehensive examination of the current state and future potential of artificial intelligence systems, with a focus on safety and risk assessment. The report delves into the capabilities of current AI systems, evaluates general-purpose AI, and explores potential risks associated with advanced AI technologies. It emphasizes...

read Sep 10, 2024

New Mexico Cops Use AI-Generated 14-Year-Old Girl to Lure Sex Offenders

Controversial law enforcement tactics: New Mexico law enforcement officials have employed an AI-generated image of a fictional 14-year-old girl to attract and catch potential sex offenders, raising ethical concerns about the use of technology in criminal investigations. The revelation comes from a lawsuit filed by the state of New Mexico against Snapchat, alleging the social media platform's failure to protect children from sexual exploitation and harm. As part of an undercover investigation, officers created a decoy Snapchat account for a fictional 14-year-old named "Heather," using an AI-generated image to convince potential predators of her authenticity. The tactic successfully lured accounts...

read Sep 10, 2024

Gary Marcus’ New Book Explores AI’s Benefits and Risks to Society

AI's dual nature: promise and peril: Gary Marcus' new book, "Taming Silicon Valley," offers a balanced and critical examination of artificial intelligence's potential benefits and risks to society. The book, published by MIT Press, provides a refreshing departure from many AI-focused business books that often downplay or ignore potential dangers. Marcus acknowledges AI's potential to benefit various aspects of human society while also addressing real risks such as job displacement, political manipulation, and environmental impact. Historical context and industry motivations: The book begins with a brief history of AI and examines how tech companies prioritize profit over societal concerns. While...

read Sep 10, 2024

US Proposes Mandatory Reporting for Advanced AI Developers

New AI reporting requirements proposed by US Commerce Department: The Bureau of Industry and Security (BIS) plans to introduce mandatory reporting for developers of advanced AI models and cloud computing providers, aiming to bolster national security and defense. The proposed rules would require companies to report on development activities, cybersecurity measures, and results from red-teaming tests. These tests assess risks such as AI systems aiding cyberattacks or enabling non-experts to create chemical, biological, radiological, or nuclear weapons. Commerce Secretary Gina M. Raimondo emphasized the importance of keeping pace with AI technology developments for national security purposes. Global context of AI...

read Sep 9, 2024

New Study Shows People Place ‘Alarming’ Trust in AI for Life and Death Decisions

AI influence on high-stakes decisions: A recent US study reveals an alarming level of human trust in artificial intelligence when making life-and-death decisions, raising concerns about the potential overreliance on AI systems. The study, conducted by scientists at the University of California – Merced and published in Scientific Reports, simulated assassination decisions via drone strikes to test human reliance on AI advice. Participants were shown a list of eight target photos marked as friend or foe and had to make rapid decisions on simulated assassinations, with AI providing a second opinion on target validity. Unbeknownst to the participants, the AI...

read Sep 9, 2024

Google’s New AI Image Generator Doesn’t Make Black Nazis Anymore

Google's AI image generator returns with improvements: Google is relaunching its Gemini-powered AI image generator after addressing issues that led to the generation of historically inaccurate and controversial images. Background and previous controversy: The AI tool faced significant backlash in February when it produced images of racially diverse Nazi-era German soldiers, prompting Google to apologize and temporarily shut down the feature. The incident highlighted the challenges of addressing racial bias in AI systems while maintaining historical accuracy. Google initially struggled to implement effective safeguards, leading to the feature's complete deactivation. New model and safeguards: Google has introduced Imagen 3, an...

read Sep 7, 2024

AI Sting Operations Target Online Child Predators

Innovative law enforcement tactics: New Mexico police are employing AI-generated images of fake teenagers in undercover operations to catch online child predators, as revealed in a lawsuit against Snapchat. Operation details: The New Mexico Department of Justice created a fake Snapchat account for a 14-year-old girl named "Sexy14Heather" using an AI-generated image Despite being set to private, the account was recommended to potentially dangerous users with concerning usernames like "child.rape" and "pedo_lover10" After accepting a single follow request, Snapchat's algorithm suggested over 91 users, many of whom were adult accounts seeking explicit content Investigation findings: Investigators posing as the fictional...

read Sep 6, 2024

4 New Experts Join U.S. National AI Advisory Committee

New appointments bolster National AI Advisory Committee: The U.S. Department of Commerce has appointed four new experts to the National Artificial Intelligence Advisory Committee (NAIAC), enhancing its ability to guide AI policy and development. Key details of the announcement: U.S. Secretary of Commerce Gina Raimondo announced the appointments on September 5, 2024 The NAIAC, established in 2022, advises the President and White House on various AI-related issues These appointments follow the initial 27 members appointed in 2022 Role and importance of NAIAC: The committee provides recommendations on U.S. AI competitiveness, scientific advancements, and workforce issues It also advises on the...

read Sep 6, 2024

How to Ride the Flywheel of Cybersecurity AI

Generative AI's rapid adoption brings both transformative potential and security challenges that AI itself can help address, creating a virtuous cycle of progress and protection. The big picture: As organizations embrace generative AI, particularly large language models (LLMs), they are leveraging AI capabilities to enhance security measures and mitigate associated risks. The pattern mirrors the early adoption of the open internet, where companies that quickly embraced the technology also became proficient in modern network security. This approach creates a flywheel effect, where AI advancements drive security improvements, which in turn enable further AI adoption. Key security threats and AI-powered solutions:...

read