Following in the footsteps of OpenAI, AI powerhouse Anthropic has fired its own opening salvo. In a recently issued formal statement, the company accused three prominent Chinese AI startups—including DeepSeek—of orchestrating large-scale “distillation attacks” to illicitly extract conversational data from Claude to bolster their own competitive models. As the rivalry in the Large Language Model (LLM) sector reaches a fever pitch, high-fidelity training data has emerged as the industry’s most coveted asset. Reports indicate that Anthropic, the architect of the Claude chatbot, issued a strident appeal on its official website, naming DeepSeek, Moonshot AI, and MiniMax as the perpetrators behind an “industrial-scale campaign” designed to pilfer Claude’s capabilities. In the AI lexicon, “model distillation” is far from a novelty. It traditionally refers to the process by which smaller, more modest models refine their performance by learning from the outputs of formidable counterparts like GPT-4 or Claude. While distillation remains a legitimate optimization technique under specific licensing, Anthropic contends that these firms have transgressed ethical boundaries, transforming the practice into a malicious assault.
Anthropic asserts that these three Chinese enterprises deployed an estimated 24,000 fraudulent accounts to engage in over 16 million intensive dialogues with Claude. The company maintains that these competitors are utilizing Claude as a developmental “shortcut,” not only to expedite the creation of sophisticated models but potentially to circumvent the original safety guardrails. As for how Anthropic unmasked these “moles,” the company revealed it has reached a state of “high confidence” by meticulously tracking IP correlations, analyzing metadata request patterns and infrastructural signatures, and cross-referencing these findings with peers who observed analogous anomalies.
This incident is hardly a precedent; early last year, OpenAI leveled similar allegations, claiming rivals were employing distillation to replicate its model’s prowess, leading to a massive purge of suspicious accounts. In response, Anthropic has pledged to overhaul its defensive architecture, ensuring future distillation attacks are increasingly arduous to execute and more readily detectable.
However, a certain irony permeates this controversy: even as Anthropic decries the “theft” of its data, it remains embroiled in copyright litigation with major music publishers for allegedly utilizing protected song lyrics to train Claude. This “distillation war” pulls back the curtain on a sobering reality of modern AI development—the world is fast depleting its reservoir of high-quality training data. For Chinese AI firms like DeepSeek and Moonshot, which entered the fray later or find themselves hamstrung by U.S. high-end compute restrictions, the temporal and computational costs of scouring the web for pristine data are prohibitive. The path of least resistance? Directly querying the world’s preeminent AI systems and feeding these “golden responses”—structured and logically rigorous—into their own models. This is the essence of “distillation.” Anthropic’s indignation is palpable; after investing hundreds of millions in compute to “refine the elixir,” they find their labor effectively “freeloaded” for the mere cost of API calls from a multitude of disposable accounts.
Ultimately, this reflects the current Ouroboros-like ecosystem of the AI market: tech titans scrape the sum of human creativity without authorization to train foundation models, while startups, in turn, scrape the titans’ models to train their own. Until a global framework for AI data rights is firmly established, this cycle of intellectual mimicry and defensive escalation is only set to intensify.
Support Our Threat Intelligence
If you find our CVE report and cybersecurity news helpful, consider supporting our work.