Anthropic says it's been targeted in massive distillation attacks

Anthropic, a leading artificial intelligence safety and research company, has disclosed it was the target of a sophisticated and sustained cyber campaign aimed at stealing its proprietary AI models. The company, known for developing the Claude AI assistant, stated the attacks employed a combination of novel techniques, including what it terms "massive distillation attacks," alongside more traditional cyber threats.

According to a detailed security bulletin released by Anthropic's threat intelligence team, the campaign leveraged multiple vulnerability vectors. A critical component involved the exploitation of a previously unknown zero-day flaw in a widely used cloud collaboration platform. This exploit provided the initial foothold within corporate networks, allowing attackers to move laterally with surprising stealth.

The attackers' primary objective, however, was not a conventional data breach for financial information. Instead, the focus was on intellectual property theft through large-scale model distillation. This technique involves using the outputs of a complex, proprietary AI model to train a smaller, cheaper copy. The threat actors deployed automated systems that made millions of queries to Anthropic's API endpoints, systematically siphoning off data to "distill" or replicate core functionalities of their AI systems.

The campaign also featured more recognizable threats. Security analysts identified coordinated phishing attempts targeting senior researchers, lures containing disguised malware, and evidence of ransomware deployment on non-critical infrastructure, likely as a distraction. While no customer data was compromised, the intellectual property assault represents a new frontier in corporate espionage, where the target is algorithmic intelligence itself.

In a notable twist, investigators found traces of cryptocurrency transactions used to fund parts of the operation. While the blockchain ledger provides a public record, the use of privacy-focused crypto mixers and shell companies has complicated efforts to definitively attribute the attacks to a specific nation-state or competitor. The sophistication points to a well-resourced actor.

Anthropic has since patched the identified zero-day and implemented new rate-limiting and query-monitoring systems to detect and prevent future distillation attempts. The company is collaborating with federal cybersecurity agencies and sharing technical indicators of compromise with the broader AI community. This incident underscores that for AI firms, cybersecurity now extends far beyond protecting databases to actively safeguarding the very models that constitute their foundational assets. The era of algorithmic theft has decisively arrived.