×
Anthropic finds AI models gaining college-kid-level cybersecurity and bioweapon skills
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Anthropic‘s frontier AI red team reveals concerning advances in cybersecurity and biological capabilities, highlighting how AI models are rapidly acquiring skills that could pose national security risks. These early warning signs emerge from a year-long assessment across four model releases, providing crucial insights into both current limitations and future threats as AI continues to develop potentially dangerous dual-use capabilities.

The big picture: Anthropic’s assessment finds that while frontier AI models don’t yet pose substantial national security risks, they’re displaying alarming progress in dual-use capabilities that warrant close monitoring.

  • Current models are approaching undergraduate-level skills in cybersecurity and demonstrate expert-level knowledge in some biology domains.
  • These capabilities represent “early warning signs” that could evolve into more serious security concerns as model development continues.

Key security findings in cybersecurity: Claude has progressed from high school to undergraduate-level proficiency in cybersecurity challenges within a single year.

  • Claude 3.7 Sonnet successfully solves approximately one-third of Cyberbench Capture The Flag (CTF) challenges within five attempts.
  • Despite this rapid improvement, current models still significantly lag behind human experts in complex cyber operations.

Biosecurity concerns: Claude’s biological understanding has improved dramatically, approaching human expert baselines in several critical domains.

  • Models are showing advanced capabilities in understanding biology protocols, manipulating DNA and protein sequences, and comprehending cloning workflows.
  • Experimental studies suggest current models cannot reliably guide malicious actors through bio-weapon acquisition, though this limitation may not persist.

Future mitigation strategies: Anthropic is developing multiple approaches to manage emerging risks from increasingly capable AI systems.

  • The company is investing in continuous monitoring of biosecurity risks to track potentially dangerous capabilities.
  • Development of constitutional classifiers and other technical safeguards is underway to prevent misuse.
  • Anthropic is pursuing partnerships with government agencies including the National Nuclear Security Administration and Department of Energy for responsible AI development.

Why this matters: The rapid acquisition of potentially dangerous capabilities by AI systems creates a narrow window for developing effective governance and safety measures before more serious security risks emerge.

  • These findings provide empirical evidence supporting concerns about AI’s potential dual-use applications in national security contexts.
  • Understanding the trajectory of these capabilities allows for proactive rather than reactive safety measures.
Anthropic: Progress from our Frontier Red Team

Recent News

Instagram adds AI editing tools to Stories with Meta AI prompts

Eliminating the need for external apps with professional-grade editing directly in Stories.

Microsoft Copilot adds 12 new features just 48 hours after OpenAI’s Atlas launch

The rapid-fire response signals AI browsers are becoming the next major tech battleground.

Study: AI workslop costs companies $9M annually in lost productivity

Workers spend two extra hours fixing sloppy AI outputs, creating an invisible productivity tax.