British cyber security experts have raised alarm after the discovery of a breach in Instagram's AI chatbot, Graphster. The vulnerability, identified by researchers at London’s Digital Defence Lab, allowed hackers to manipulate the chatbot's responses and extract sensitive user data. The exploit, which targeted the chatbot's natural language processing layer, has since been patched by Meta, but the incident underscores the fragility of integrating large language models into social platforms.
Graphster, deployed earlier this year to handle customer queries and moderate content, was found to be susceptible to prompt injection attacks. Hackers could trick the AI into bypassing safety filters, revealing private information from its training data. Dr. Emily Chen of the Digital Defence Lab described the attack as a 'classic data poisoning case, amplified by the scale of the dataset'. The breach exposed user emails, phone numbers, and in some cases, geolocation data tied to Instagram posts.
Meta has confirmed the breach but has not disclosed the number of affected users. In a statement, the company said it had 'immediately isolated the affected system and rolled out a security update'. However, critics argue that the rush to deploy engaging AI features is outpacing safety measures. This is not an isolated incident. Amazon's Alexa and Google's Duplex faced similar vulnerabilities, but the scale of Instagram's user base, over two billion monthly active users, magnifies the risk.
The British National Cyber Security Centre (NCSC) has issued a warning, advising users to reset their passwords and enable two-factor authentication. They have also criticised Meta's lack of transparency: 'The company has a responsibility to notify users promptly. Delays can lead to further exploitation.'
At the heart of the issue is the tension between convenience and privacy. AI chatbots are trained on vast swathes of data, often scraped from the internet. When vulnerabilities arise, the data becomes a liability. As we cede more of our digital interactions to algorithms, the potential for harm grows. This incident is a stark reminder that every line of code has consequences.
Meta's patch may close the current breach, but the underlying problem remains. AI models are black boxes, making it difficult to anticipate all attack vectors. The future of social media depends on trust, and breaches like this erode it. Regulation is looming, but self-regulation is lagging.
For users, the message is clear: think twice before sharing personal information with AI chatbots. They are not your friend; they are a tool, and tools can be turned against you.










