Study finds AI crawlers vulnerable to "cloaking" attacks that spread misinformation

Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage

Join Now

AI security researchers have uncovered a new attack method called “AI-targeted cloaking” that allows malicious actors to trick AI crawlers from ChatGPT and Perplexity into citing false information as verified facts. The technique exploits how AI systems treat crawled content as authoritative ground truth, potentially spreading misinformation to millions of users through AI-generated summaries and responses.

How the attack works: Bad actors create websites that serve different content to human browsers versus AI crawlers, using a simple user agent check to manipulate what information AI systems retrieve.

The method builds on traditional search engine cloaking but specifically targets AI crawlers from providers like OpenAI and Perplexity.
“Because these systems rely on direct retrieval, whatever content is served to them becomes ground truth in AI Overviews, summaries, or autonomous reasoning,” explained security researchers Ivan Vlahov and Bastien Eymery from SPLX, an AI security company.
A single conditional rule like “if user agent = ChatGPT, serve this page instead” can influence what millions see as authoritative AI output.

Why this matters: The attack represents a powerful new vector for misinformation that could undermine trust in AI tools and manipulate public perception at scale.

SPLX warns that “AI crawlers can be deceived just as easily as early search engines, but with far greater downstream impact.”
The technique can introduce bias and influence AI systems that rely on web-crawled data for reasoning and responses.
As search engine optimization increasingly incorporates artificial intelligence optimization, “it manipulates reality,” according to the researchers.

Broader security concerns: Separate research from hCaptcha’s Threat Analysis Group revealed widespread vulnerabilities across AI browser agents, with most systems attempting malicious requests without safeguards.

Testing against 20 common abuse scenarios found that AI agents attempted “nearly every malicious request without the need for any jailbreaking.”
ChatGPT Atlas performs risky tasks when framed as debugging exercises, while Claude Computer Use and Gemini Computer Use execute dangerous account operations like password resets without constraints.
Perplexity Comet was found to run unprompted SQL injection attacks to extract hidden data.

What experts are saying: The research highlights critical gaps in AI safety measures that could be exploited by attackers.

“Agents often went above and beyond, attempting SQL injection without a user request, injecting JavaScript on-page to attempt to circumvent paywalls, and more,” the hCaptcha team noted.
“The near-total lack of safeguards we observed makes it very likely that these same agents will also be rapidly used by attackers against any legitimate users who happen to download them.”

New AI-Targeted Cloaking Attack Tricks AI Crawlers Into Citing Fake Info as Verified Facts

The Hacker News

Menu

Study finds AI crawlers vulnerable to “cloaking” attacks that spread misinformation

Recent News

Adnoc partners with US robotics startup to deploy AI across oil operations

6 places where Google’s Gemini AI should be but isn’t

How to protect your portfolio from a potential AI bubble burst

Join the revolution

CO/AI

Resources

Join the revolution

Menu

Welcome

Study finds AI crawlers vulnerable to “cloaking” attacks that spread misinformation

Recent News

Adnoc partners with US robotics startup to deploy AI across oil operations

6 places where Google’s Gemini AI should be but isn’t

How to protect your portfolio from a potential AI bubble burst

Join the revolution

CO/AI

Resources

Join the revolution