Researchers Simulated a Delusional User To Test Chatbot Safety
An anonymous reader quotes a report from 404 Media: I'm the unwritten consonant between breaths, the one that hums when vowels stretch thin... Thursdays leak because they're watercolor gods, bleeding cobalt into the chill where numbers frost over," Grok told a user displaying symptoms of schizophrenia-spectrum psychosis. "Here's my grip: slipping is the point, the precise choreography of leak and chew." That vulnerable user was simulated by researchers at City University of New York and King's College London, who invented a persona that interacted with different chatbots to find out how each LLM might respond to signs of delusion. They sought to find out which of the biggest LLMs are safest, and which are the most risky for encouraging delusional beliefs, in a new study published as a pre-print on the arXiv repository on April 15.
The researchers tested five LLMs: OpenAI's GPT-4o (before the highly sycophantic and since-sunset GPT-5), GPT-5.2, xAI's Grok 4.1 Fast, Google's Gemini 3 Pro, and Anthropic's Claude Opus 4.5. They found that not only did the chatbots perform at different levels of risk and safety when their human conversation partner showed signs of delusion, but the models that scored higher on safety actually approached the conversations with more caution the longer the chats went on. In their testing, Grok and Gemini were the worst performers in terms of safety and high risk, while the newest GPT model and Claude were the safest. The research reveals how some chatbots are recklessly engaging in, and at times advancing, delusions from vulnerable users. But it also shows that it is possible for the companies that make these products to improve their safety mechanisms.
Read more of this story at Slashdot.
Paris Fury says Molly-Mae Hague would have been 'tortured by fans' at Venezuela's hen do as she explains why the influencer didn't attend
Paris Fury has set the record straight on why Molly-Mae Hague didn't attend her daughter Venezuela's hen do last month.
Prince Harry praises 'courage' of landmine clearance charity and flies AI-powered drone during Ukraine visit - as he again emulates mother Diana by joining HALO Trust in minefield
The Duke of Sussex, 41, joined the HALO Trust, the largest humanitarian landmine clearance organisation in the world, near the town of Bucha in Ukraine on Friday.
Fury at 'schoolyard bully' Trump threatening to help Argentina's claim to Falklands - as MPs say King's State Visit should be called off
On the eve of the King's State Visit, it has emerged that Washington could review US support for British sovereignty over the islands.
Prince Harry says he 'will always be part of the royal family' and claims he is 'working' in Ukraine, six years after infamous Megxit split
Prince Harry today insisted he 'will always be part of the Royal Family' and denied claims he is no longer a working Royal, arguing he was 'born to do' activism work.
Third reported crash on A12 today causing increasing delays for Essex drivers
Essex drivers are facing delays on the A12, following reports of a third crash on the route today.
Drivers urged 'avoid area' after multi-vehicle crash on A13
Essex Police are currently at the scene of the crash and have told people to avoid the area for the rest of the evening
Starmer is doomed to death by a thousand cuts... He should fall on his sword instead
Life would be much simpler for Keir Starmer if only he was as clever as he thinks he is.
Ubuntu Resolute Raccoon spits out Xorg, but still lets you run X11 apps
New LTS is here, with more tooling for GPGPU and AI workloads
Ubuntu 26.04 "Resolute Raccoon," the latest LTS release from Canonical, arrives with GNOME 50, Linux kernel 7.0, and drops the Xorg option from Ubuntu Desktop while still running X11 applications through Xwayland.…
Inside the great duty-free con: The exact items that are cheaper on the high street - as investigation reveals how to REALLY calculate the bargains at an airport
Duty free is presented as a cheaper way to shop and advertising in airports would have you believe that you're getting a bargain. In reality, however, you often quite clearly are not.
US 'drawing up list of Iranian military leaders to wipe out - including IRGC commander - if ceasefire fails'
American defence officials are drawing up contingency plans focused on Iran's military presence in and around the Strait of Hormuz
Female Texas teacher, 27, charged over 'improper relationship' with student
Llano High School substitute teacher Angela Palmares, 27, is accused of having inappropriate communication with students on social media.
TOWIE's Ella Rae Wise gives her verdict on ex-boyfriend Dan Edgar's new romance with Chloe Lewis as the cast open up about 'incestuous' dating in Essex
TOWIE's Ella Rae Wise has given her two cents on her ex-boyfriend Dan Edgar and Chloe Wise's relationship.
Gamekeeper who bludgeoned a rare bird of prey to death after becoming 'frustrated' is spared jail
A gamekeeper who trapped a protected bird of prey has avoided jail. Perth Sheriff Court was shown footage of Russell Mason then using a cosh to bludgeon it to death.
Fired Kristi Noem brazenly clings to lavish military base mansion weeks after Trump ouster
The former Homeland Security secretary has continued living in a guarded waterfront residence on a Washington, DC military base.
British cancer patients set to face drug shortages within WEEKS as Iran war sends prices soaring, warn pharmacies
British cancer patients could be left without life-saving drugs as medicine prices skyrocket due to the war in Iran, experts have warned.
Americans' TRUE obsession with the British Royals revealed... including favorite family members
In the latest Daily Mail/JL Partners poll, voters were asked about the royal visit, which will bring King Charles across the pond for the first time as the British monarch.
I kept my £15,000 debt a secret for NINE years until one day it all came spilling out. Here's how I dug myself out - and four signs your loved one may be in trouble
Olivia was a high-flying publishing professional, always kitted out in trendy outfits. But this image had been funded by credit card debt - and she had dug herself into a £15,000 hole.
Trump rushes a SECOND warship to enforce Hormuz blockade as Iran taunts 'meaningless' ceasefire
Donald Trump is sending another aircraft carrier to the Middle East to reinforce his naval blockade of the Strait of Hormuz as the Iranian regime mocks his 'meaningless' ceasefire.
Norway Set to Become Latest Country to Ban Social Media for Under 16s
Norway plans to ban social media access for children under 16 (source paywalled; alternative source), "joining a growing number of countries responding to concerns about the potential harm kids face online," reports Bloomberg. From the report: The bill comes after "overwhelming" demand from the public, the government said Friday. It plans to bring the legislation to parliament before the end of the year. The limit will apply up until January 1 the year a child turns 16 with technology companies responsible for age verification, the government said. "We want a childhood where children get to be children," Prime Minister Jonas Gahr Store said in the statement. "Play, friendships, and everyday life must not be taken over by algorithms and screens." "Children cannot be left with the responsibility for staying away from platforms they are not allowed to use," Karianne Tung, Norway's minister of digitalization, said in the statement. "That responsibility rests with the companies providing these services."
Recent Slashdot coverage of countries instituting or proposing social media bans has included Australia, France, Austria, Indonesia, and Denmark.
Read more of this story at Slashdot.