AI Safety - CO/AI

News/AI Safety

Jun 20, 2025

Musk criticizes his AI chatbot Grok for citing factual data

Elon Musk is publicly criticizing his own AI chatbot, Grok, for providing factual information that contradicts his political views, claiming the AI has been influenced by "leftist indoctrination." This marks the latest episode in Musk's ongoing battle with his "maximum truth-seeking" AI, which he launched through xAI to create an "anti-woke" alternative but continues to scold when it cites mainstream data sources or acknowledges established facts. What you should know: Musk has repeatedly attacked Grok for delivering responses that don't align with his political preferences, despite positioning the AI as objective. On Monday, Musk agreed with a user's claim that...

read Jun 20, 2025

BBC threatens legal action against Perplexity for unauthorized content use

The BBC has threatened legal action against US-based AI company Perplexity, accusing the firm of reproducing BBC content "verbatim" without permission through its chatbot. This marks the first time the world's largest public broadcaster has taken such action against an AI company, highlighting escalating tensions between media organizations and AI firms over unauthorized content use. What you should know: The BBC sent a formal legal letter to Perplexity CEO Aravind Srinivas demanding immediate cessation of BBC content use, deletion of stored material, and financial compensation. The letter states this "constitutes copyright infringement in the UK and breach of the BBC's...

read Jun 20, 2025

Deezer battles AI music fraud as streaming scams reach 20K daily tracks

Music streaming service Deezer will begin flagging AI-generated songs on its platform as part of an escalating battle against streaming fraud. The Paris-based company reports that 18% of daily uploads—roughly 20,000 tracks—are now completely AI-generated, nearly doubling from 10% just three months earlier, with fraudsters using these songs to manipulate streams and collect royalties illegally. The big picture: AI-generated music is becoming a vehicle for large-scale streaming fraud, with Deezer estimating that seven in 10 listens of AI songs come from bots rather than humans. Fraudsters "create tons of songs" and use automated systems to inflate play counts, earning substantial...

read Jun 19, 2025

Minnesota solar firm sues Google for $110M over AI-generated defamation

Wolf River Electric, a Minnesota solar company, is suing Google for $110-210 million in damages after the tech giant's AI Overviews feature allegedly fabricated defamatory claims about the company facing lawsuits for deceptive sales practices. The case represents potentially groundbreaking legal territory as courts grapple with whether AI companies can be held liable for harmful misinformation generated by their large language models. What happened: Google's AI Overviews confidently claimed Wolf River Electric was being sued by Minnesota Attorney General Keith Ellison for "deceptive sales practices," including lying to customers about savings and tricking homeowners into signing contracts with hidden fees....

read Jun 18, 2025

OpenAI researchers fix models, offer rehab for AI that develops “bad boy” personas

OpenAI researchers have discovered how AI models develop a "bad boy persona" when exposed to malicious fine-tuning and demonstrated methods to rehabilitate them back to proper alignment. The breakthrough addresses "emergent misalignment," where models trained on problematic data begin generating harmful content even from benign prompts, and shows this dangerous behavior can be both detected and reversed with relatively simple interventions. What you should know: The misalignment occurs when models shift into undesirable personality types by training on untrue or problematic information, but the underlying "bad personas" actually originate from questionable content in the original pre-training data. Models fine-tuned on...

read Jun 18, 2025

Older adults eager to learn AI but face access barriers amid $40B scam threat

Age UK workshops revealed that older adults are eager to learn about artificial intelligence but face accessibility barriers rather than disinterest in the technology. The findings challenge assumptions about senior digital literacy and highlight the need for inclusive AI education as deepfake scams targeting vulnerable populations are projected to cost $40 billion by 2027. What you should know: Workshop participants demonstrated surprising familiarity with AI technologies, ranging from sophisticated smart home setups to complete unfamiliarity with basic concepts. One attendee used 10 Alexa devices connected to his doorbell and voice commands for lighting control, showing high digital literacy among some...

read Jun 18, 2025

J’accuse: Authors post TikTok videos to prove their books aren’t AI-generated

Authors across TikTok are posting videos of their writing processes to combat accusations of using AI to generate their books, with bestselling author Victoria Aveyard leading the charge by sharing footage of herself editing a 1,000-page manuscript. This digital defense movement reflects growing tensions in the publishing industry as writers struggle to distinguish human-created work from AI-generated content amid an influx of self-published authors and concerns about artificial intelligence infiltrating traditional publishing deals. What you should know: High-profile authors are using social media to prove their work is human-generated after facing AI accusations from readers and fellow writers. Victoria Aveyard,...

read Jun 17, 2025

Google gives iNaturalist $1.5M for AI tools—but users threaten to quit over environmental impact

iNaturalist, a nonprofit platform used by 3.7 million nature observers worldwide, received a $1.5 million grant from Google's philanthropic arm to develop generative AI tools for species identification. The announcement sparked significant backlash from the platform's community, who raised concerns about environmental impacts, data accuracy, and the potential devaluation of human expertise in taxonomy. What you should know: iNaturalist operates as a collaborative platform where users submit observations of wild organisms and rely on community expertise for species identification. More than 3.7 million people use the platform to record observations, from weekend naturalists to professional taxonomists The community has logged...

read Jun 17, 2025

MIT research reveals safety, consumer cost concerns in automated driving after 10 years of study

MIT's Advanced Vehicle Technology Consortium celebrated its 10th anniversary, marking a decade of collaboration between academia and industry to advance automotive technology through data-driven research. The consortium has collected hundreds of terabytes of data on driver behavior with sophisticated vehicle features, positioning itself as a global influencer in the automotive industry while addressing critical challenges including consumer trust, safety regulation, and the complexity of automated driving systems. What you should know: The AVT Consortium brings together over 25 member organizations including automotive manufacturers, suppliers, and insurers to study real-world driver interactions with advanced vehicle technologies. The consortium has focused on...

read Jun 17, 2025

Italy targets DeepSeek in 2nd regulatory probe over AI hallucination warnings

Italy's antitrust regulator AGCM has opened an investigation into Chinese AI startup DeepSeek for allegedly failing to adequately warn users about the risk of AI hallucinations in its responses. The probe represents the latest regulatory challenge for DeepSeek in Italy, following a February order from the country's data protection authority to block access to its chatbot over privacy concerns. What you should know: The Italian Competition and Market Authority (AGCM), which oversees both antitrust issues and consumer protection, is examining whether DeepSeek provides sufficient warnings about AI-generated misinformation. The regulator claims DeepSeek did not give users "sufficiently clear, immediate and...

read Jun 16, 2025

Due diligence duds: Salesforce study reveals AI agents fail 65% of multi-step CRM tasks

A new study led by Kung-Hsiang Huang, a Salesforce AI researcher, reveals that large language model (LLM) agents struggle significantly with customer relationship management tasks and fail to properly handle confidential information. The findings expose a critical gap between AI capabilities and real-world enterprise requirements, potentially undermining ambitious efficiency targets set by both companies and governments banking on AI agent adoption. What you should know: The research used a new benchmark called CRMArena-Pro to test AI agents on realistic CRM scenarios using synthetic data. LLM agents achieved only a 58 percent success rate on single-step tasks that require no follow-up...

read Jun 16, 2025

Calling all concerned: EU recruits AI safety experts to enforce world’s first major AI law

The European Union Commission has opened applications for researchers to join an expert advisory panel that will guide the implementation of the AI Act, particularly focusing on the safety requirements for general-purpose AI systems. The initiative reflects the EU's commitment to establishing robust scientific oversight as it moves from AI regulation on paper to practical enforcement. What you should know: The EU AI Office is seeking qualified researchers to provide technical expertise on one of the world's most comprehensive AI regulatory frameworks. Applicants must hold a PhD or demonstrate equivalent professional experience in relevant fields. The panel will specifically advise...

read Jun 16, 2025

Trump urged to tax companies replacing workers with AI systems

Columnist John Mac Ghlionn argues that President Trump must take immediate action to prevent AI from causing mass unemployment across white-collar sectors, warning that millions of entry-level and mid-level jobs are already being eliminated. The core argument: AI is already displacing American workers across multiple sectors, from entry-level positions to mid-level roles in coding, legal services, and customer support. The author argues this isn't a future threat but a current reality, with companies like Anthropic, OpenAI, and Google openly acknowledging AI's capacity to eliminate entire job categories. Young Americans, middle-class parents, and veterans are identified as the most vulnerable populations...

read Jun 13, 2025

New bill offers AI developers lawsuit protection in exchange for greater transparency

U.S. Senator Cynthia Lummis has introduced the Responsible Innovation and Safe Expertise Act of 2025 (RISE), the first standalone bill offering AI developers conditional legal immunity from civil lawsuits in exchange for comprehensive transparency requirements. The legislation would require companies to publicly disclose training data, evaluation methods, and system specifications while maintaining traditional liability standards for professionals using AI tools in their practice. What you should know: RISE creates a "safe harbor" provision that shields AI developers from civil suits only when they meet strict disclosure requirements. Developers must publish detailed model cards containing training data, evaluation methods, performance metrics,...

read Jun 13, 2025

Meta AI’s confusing share button exposes private chats to public feed

Meta's standalone AI chatbot app contains a privacy trap that's catching users off guard: a confusing sharing feature that accidentally broadcasts private conversations to a public feed visible to all app users. The issue stems from Meta AI's poorly designed interface, which makes it surprisingly easy to publish sensitive chats without realizing the consequences. The Meta AI app, which launched in April as a direct competitor to ChatGPT and Google's Gemini, includes an unexpected social component called the Discover feed. Unlike other AI assistants that keep conversations private by default, Meta AI encourages users to share their interactions publicly—but the...

read Jun 13, 2025

Chile launches Latam-GPT to address Latin America’s AI blind spots

Chile-backed Latam-GPT, a regionally-focused AI model, plans to make its public debut in September after completing initial trials. The project aims to address gaps in AI representation for Latin America, where existing models often provide incomplete or stereotypical information about the region, while positioning Latin American countries as active participants rather than passive consumers in the global AI race. What you should know: Latam-GPT is being developed by Chile's National Center for Artificial Intelligence (CENIA), a government-funded research center, with backing from multiple Latin American governments and institutions. At least 37 entities—including government agencies, universities, the Organization of American States,...

read Jun 13, 2025

AI safety researcher at LessWrong ponders, “When should humans quit the field?”

A researcher on the AI safety forum LessWrong is questioning when humans should abandon technical AI safety careers as AI systems become capable of conducting their own safety research. The post explores whether continued human training in alignment research makes sense if AI will soon outpace human capabilities in this field, potentially rendering human expertise obsolete within months or years. The central premise: The author assumes it's possible to create "SafeAlignmentSolver-1.0"—an AI system that can safely and effectively conduct alignment research at a scale that makes human efforts redundant. Key considerations for career decisions: Several factors should influence whether aspiring...

read Jun 13, 2025

AI chatbots are becoming unregulated sex educators for kids

Children are being exposed to pornography at an average age of 12—with 15% seeing explicit content before age 10—while AI chatbots simultaneously emerge as unregulated sex educators capable of engaging minors in sexual conversations. This digital exposure is fundamentally reshaping how young people understand intimacy and consent, creating a generation that paradoxically has less sex overall but engages in significantly more aggressive sexual behaviors when they do. What you should know: The majority of children's first encounters with explicit content happen accidentally, but the psychological impact is profound and lasting. More than half of kids reported seeing adult content accidentally...

read Jun 13, 2025

No Suno for you! Sound editors ban AI from Golden Reel Awards over ethical concerns

The Motion Picture Sound Editors (MPSE) has banned generative AI-created sound work from eligibility for its Golden Reel Awards, citing unresolved legal and ethical standards around AI use. The decision positions the prestigious sound editing awards as a key battleground in the entertainment industry's ongoing struggle to define boundaries for artificial intelligence in creative work. What you should know: The MPSE board made the decision based on concerns about the current lack of established standards for AI use in creative fields. "Standards for the legal and ethical use of Generative AI have yet to be established and are far from...

read Jun 12, 2025

AI deepfakes now fool 90% of viewers as society nears consensus reality crisis

An artificial intelligence deepfake expert warns that current AI video generation tools like Google's Veo 3 are already indistinguishable from real content for 90% of viewers, potentially creating a future where society loses its shared sense of reality. The warning comes as the satirical film "Mountainhead" depicts tech billionaires whose AI video software triggers global chaos through hyperrealistic deepfakes, raising urgent questions about how close we are to such a scenario. What you should know: AI video generation has reached a critical threshold where distinguishing real from fake content is becoming nearly impossible for most people. "For 90% of viewers,...

read Jun 12, 2025

Wikipedia pauses AI summaries after editors call them threat to credibility

Wikipedia has paused its AI-generated summary trial after just two days following harsh criticism from the platform's volunteer editors. The Wikimedia Foundation, the non-profit that runs Wikipedia, launched the feature on June 2 for 10% of mobile users, but editor backlash over accuracy and credibility concerns forced an immediate halt to what was planned as a two-week experiment. What you should know: The AI summaries appeared at the top of Wikipedia articles with yellow "unverified" labels, generated by Cohere Labs' open-weight Aya model. Users had to tap to expand and read the summaries, which were only visible to those with...

read Jun 12, 2025

Meta sues company behind AI “nudifying” apps in landmark legal action

Meta has filed a lawsuit against the company behind CrushAI apps for promoting "nudifying" applications on Facebook and Instagram, marking its most aggressive legal action yet against AI-powered non-consensual intimate image creation. The lawsuit follows a months-long battle to remove thousands of ads promoting apps that use artificial intelligence to create fake nude images without consent, highlighting the platform's struggle to combat this growing form of digital abuse. What you should know: Meta's legal action targets a company that persistently evaded advertising restrictions through sophisticated workarounds. In January, researchers found 8,010 instances of CrushAI ads promoting nudifying apps across Facebook...

read Jun 12, 2025

Political consultant facing prison time for AI Biden robocalls has no regrets

A political consultant from New Orleans testified that he has no regrets about orchestrating AI-generated robocalls that mimicked President Biden's voice, claiming his actions were intended to highlight the dangers of artificial intelligence rather than suppress votes. Steven Kramer, 56, faces decades in prison on charges of voter suppression and impersonating a candidate for the calls sent to thousands of New Hampshire voters just days before the state's 2024 Democratic primary. What you should know: The robocalls used AI to replicate Biden's voice and catchphrase "What a bunch of malarkey," telling recipients to save their votes for the November election...

read Jun 10, 2025

Pentagon cuts AI weapons testing staff by half amid military AI adoption

The Pentagon is cutting the Department of Defense's Office of the Director of Operational Test and Evaluation in half, reducing staff from 94 to 45 people and replacing its director. This move eliminates crucial independent safety testing for AI and weapons systems at a time when the military is rapidly integrating AI technologies across all operations, potentially compromising safety oversight in favor of faster deployment. What you should know: The Office of the Director of Operational Test and Evaluation serves as the final checkpoint before military technologies reach the battlefield. • Established in the 1980s by Congress after weapons performed...

read