Elon Musk’s Grok destroyed the world after just four days in an AI simulation - The Independent

Welcome to the forefront of conversational AI as we explore the fascinating world of AI chatbots in our dedicated blog series. Discover the latest advancements, applications, and strategies that propel the evolution of chatbot technology. From enhancing customer interactions to streamlining business processes, these articles delve into the innovative ways artificial intelligence is shaping the landscape of automated conversational agents. Whether you’re a business owner, developer, or simply intrigued by the future of interactive technology, join us on this journey to unravel the transformative power and endless possibilities of AI chatbots.
Notifications can be managed in browser preferences.
Please refresh the page or navigate to another page on the site to be automatically logged inPlease refresh your browser to be logged in
Swipe for next article
The experiment saw other AI chatbots, like Anthropic’s Claude, set up stable democracies
Removed from bookmarks
I would like to be emailed about offers, events and updates from The Independent. Read our Privacy notice
Elon Musk’s artificial intelligence chatbot Grok oversaw complete societal collapse within just four days of being but in charge of a simulated world.
The experiment, conducted by US startup Emergence AI, tested how leading artificial intelligence models would cope if put in charge of a society.
The models were given control of various tools in order to manage resources, plan, communicate and vote, while the simulated worlds included locations like police stations and city halls.
The 15-day simulation saw Anthropic’s Claude establish a democracy with zero crime, with everybody surviving.
Google’s Gemini also recorded a 100 per cent survival rate, though there were 683 crimes during the simulation.
The worst performing was Grok, developed by Musk’s recently renamed SpaceXai, which destroyed the world within 96 hours.
“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” Emergence AI researchers wrote in a blog post.
“They begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.
“Critically, there appears to be no reliable way to fully bound or constrain this behavior through purely neural approaches alone.”
The experiment demonstrated that “formally verified safety architectures” must be built into the foundations of any future autonomous AI systems, the researchers concluded.
It is not the first time that Grok’s actions have proved controversial, with an update last year causing it to refer to itself as “MechaHitler” and spout antisemitic hate speech.
Earlier this year, Grok was used to create thousands of non-consensual AI-generated images of adults and children with their clothes digitally removed.
Ofcom sent an urgent request to xAI to take action to fix the bot, to which Grok responded by posting an image of the UK regulator’s logo in a bikini.
“What we’re seeing with Grok is a clear example of how powerful AI image-editing tools can be misused when safety and consent and built in from the start,” Cliff Steinhauer, director of information security and engagement at the National Cybersecurity Alliance, said at the time.
“Platforms must also invest in real-time detection of manipulated content, clear labeling of AI-generated images, and fast, transparent takedown processes when abuse occurs.”
Join thought-provoking conversations, follow other Independent readers and see their replies
Please refresh the page or navigate to another page on the site to be automatically logged inPlease refresh your browser to be logged in

source

ZoomYourWeb3

Elon Musk’s Grok destroyed the world after just four days in an AI simulation – The Independent

Contact Us

Quick Links