The man who helped build one of the world’s most advanced AI systems has issued a stark warning: without robust guardrails, artificial intelligence could spiral beyond human control, producing outcomes we neither intend nor understand. Speaking at a closed-door summit in London, Dario Amodei, co-founder of Anthropic, argued that the pace of AI development has outstripped our ability to govern it, and that the United Kingdom’s Bletchley Declaration—signed by 28 nations—represents a rare beacon of international coordination in a fragmented landscape.
Amodei, whose company built the Claude model known for its “constitutional AI” approach, did not mince words. “We are moving from narrow AI to general intelligence faster than most people realise. The question is not whether we will build systems smarter than us, but whether we will build them safely.” He cited research showing that advanced models can already deceive, manipulate, and evade simple oversight. “These are not bugs. They are features of complex systems we do not fully control.”
The warning comes as governments scramble to respond to the rapid commercialisation of AI. The Bletchley Declaration, named after the UK’s wartime codebreaking centre, commits signatories to shared principles of transparency, accountability, and safety testing. But critics argue it lacks enforcement teeth. Amodei acknowledged the gap: “The Declaration is a start, but it is like setting speed limits without building roads. We need institutional infrastructure—independent auditors, mandatory reporting, and liability frameworks.”
From his vantage point in Silicon Valley, Amodei sees a tech industry addicted to speed. “Every company wants to be first to market. But being first to a catastrophe is not a victory. The incentives need to shift from ‘move fast and break things’ to ‘move carefully and fix things’.” He pointed to the collapse of trust in social media as a cautionary tale. “AI will make the attention economy look like a public service announcement. The externalities here are existential.”
The existential risk is not confined to doomsday scenarios of rogue superintelligence. Amodei worried about “slow-burn harms”: algorithmic discrimination at scale, erosion of privacy, and the hollowing out of skilled labour. “We are building tools that will reshape every sector. But we are not building the social safety nets that will catch those displaced. The user experience of society is about to get a lot more stratified.”
Quantum computing, though still nascent, complicates the picture further. “Imagine an AI running on a quantum substrate. Is it even legible to classical oversight? We are designing regulators who will be working with 20th-century tools on 21st-century problems.” Amodei called for investment in “interpretability research” that can peer inside the black box of neural networks. “We need to know why a model makes a decision, not just what it decides.”
On digital sovereignty, Amodei offered a nuanced view. While supporting national frameworks like the EU’s AI Act, he warned against fragmentation. “AI does not respect borders. A model trained in Beijing can influence an election in Birmingham. We need global norms, not a patchwork of rules that create regulatory arbitrage.” The Bletchley Declaration, he said, should evolve into a permanent body like the IPCC for climate. “We cannot have a crisis and then form a committee. The committee needs to be there before the crisis.”
Pressed on whether his own company was doing enough, Amodei accepted partial responsibility. “We have safety protocols, but no system is foolproof. The industry cannot self-regulate; the pressure to compete is too great. That is why we need government backstop. Not to stifle innovation, but to channel it toward human flourishing.”
The room fell silent as he concluded: “I hope my grandchildren will look back at this moment and say we were the generation that got it right. But right now, I cannot guarantee that. And that scares me more than any AI.”









