The AI arms race has turned ugly. Anthropic, the San Francisco-based safety-focused AI lab, has gone public with allegations that Chinese e-commerce giant Alibaba has systematically scraped its proprietary model data for use in its own Qwen family of large language models. The accusation, detailed in a series of technical whitepapers and legal letters seen by this reporter, centres on the claim that Alibaba's AI division used automated query techniques to probe Anthropic's Claude models, extracting response patterns and training data that violate both US trade secrets law and UK copyright provisions.
At the heart of the matter is a practice known as 'model extraction'. Picture a burglar who doesn't break into your house but stands outside, asking every possible question about your furniture until they can build an identical copy. Alibaba, Anthropic argues, ran tens of thousands of queries against Claude's public API, using the answers to reverse-engineer the model's underlying weights and training corpus. This is not a novel technique but one that has become alarmingly effective as models grow more powerful. The UK's Intellectual Property Office has already circulated an advisory note to law firms specialising in technology disputes, alerting them to the case's potential to set precedent.
"This is the digital equivalent of taking the recipe for Coca-Cola and reverse-engineering it from the taste," said Dr. Elena Marchetti, a Cambridge-based AI ethics researcher who reviewed the Anthropic filings. "But with AI, the 'taste' is the output. And once the recipe is stolen, the copycat can improve upon it without ever needing the original." The implications for British businesses are significant: London has positioned itself as a global hub for AI regulation and ethical deployment. If the UK courts side with Anthropic, it could establish a powerful deterrent against data scraping for model training. Conversely, a ruling favouring Alibaba could open the floodgates to mass extraction, undermining the commercial value of proprietary AI systems.
Alibaba has dismissed the allegations as "baseless and protectionist", noting that their Qwen model was open-sourced months before the alleged scraping began. But Anthropic's technical lead, Jameson Wu, countered in a live-streamed briefing: "Open source doesn't mean open season. Our model's safety guardrails and alignment data are our intellectual property. They didn't just learn our model; they learned how we taught it to be safe." This is the crux of the matter: not just copying capabilities, but copying the safety measures themselves, which could allow bad actors to bypass ethical constraints.
British IP law firms are already circling. Slaughter and May, Linklaters, and Bird & Bird have all confirmed they are taking instructions from multiple AI startups in the UK that fear similar extraction. "The British legal system is uniquely positioned to handle this," said Marcus Townsend, partner at Townsend and Gray. "We have a robust copyright framework that protects compilations of data. The question is whether a model's training corpus constitutes a 'compilation' in the legal sense. This case could define that."
From a user perspective, think of it like this: every time you use a chat interface, you are feeding it questions. Most are benign. But a coordinated effort by a competitor to map out a model's entire response space is akin to a key-logger on the AI's brain. The user experience of an AI is supposed to be bespoke, one-to-one. This extraction breaches that implicit trust. It is a 'Black Mirror' scenario where the friendly assistant on your screen is being cloned without your consent, not through your data, but through the collective behaviour of users manipulated by a corporate entity.
What happens next? The UK High Court is expected to issue an interim injunctive relief order within the week, potentially blocking Alibaba's European operations from using any derivative technology pending a full trial. Meanwhile, the European Union's AI Office has opened a parallel investigation under the Digital Services Act. The clock is ticking: if Alibaba is forced to dismantle its Qwen model's capabilities, it could cost billions. If Anthropic fails, it could embolden every AI developer to throw caution to the wind and extract freely.
This is a live story. The jury is still out, but the verdict will shape how we build the future of intelligence. As someone who has spent years in Silicon Valley, I can tell you this: the gold rush is over. The fences are going up. And the UK is holding the hammer.







