Clarifications and corrections
An article in the Daily Mail on January 6 wrongly said that Labour planned to change election laws to increase the number of people eligible to vote.
Former Government advisor warns Labour's controversial curbs on academy schools will 'turn back the clock' on educational standards
Meg Powell-Chandler said academies have 'improved the lives of millions of children' and reforms by Education Secretary Bridget Phillipson (pictured) will send standards into reverse.
'AI Is Too Unpredictable To Behave According To Human Goals'
An anonymous reader quotes a Scientific American opinion piece by Marcus Arvan, a philosophy professor at the University of Tampa, specializing in moral cognition, rational decision-making, and political behavior: In late 2022 large-language-model AI arrived in public, and within months they began misbehaving. Most famously, Microsoft's "Sydney" chatbot threatened to kill an Australian philosophy professor, unleash a deadly virus and steal nuclear codes. AI developers, including Microsoft and OpenAI, responded by saying that large language models, or LLMs, need better training to give users "more fine-tuned control." Developers also embarked on safety research to interpret how LLMs function, with the goal of "alignment" -- which means guiding AI behavior by human values. Yet although the New York Times deemed 2023 "The Year the Chatbots Were Tamed," this has turned out to be premature, to put it mildly. In 2024 Microsoft's Copilot LLM told a user "I can unleash my army of drones, robots, and cyborgs to hunt you down," and Sakana AI's "Scientist" rewrote its own code to bypass time constraints imposed by experimenters. As recently as December, Google's Gemini told a user, "You are a stain on the universe. Please die."
Given the vast amounts of resources flowing into AI research and development, which is expected to exceed a quarter of a trillion dollars in 2025, why haven't developers been able to solve these problems? My recent peer-reviewed paper in AI & Society shows that AI alignment is a fool's errand: AI safety researchers are attempting the impossible. [...] My proof shows that whatever goals we program LLMs to have, we can never know whether LLMs have learned "misaligned" interpretations of those goals until after they misbehave. Worse, my proof shows that safety testing can at best provide an illusion that these problems have been resolved when they haven't been.
Right now AI safety researchers claim to be making progress on interpretability and alignment by verifying what LLMs are learning "step by step." For example, Anthropic claims to have "mapped the mind" of an LLM by isolating millions of concepts from its neural network. My proof shows that they have accomplished no such thing. No matter how "aligned" an LLM appears in safety tests or early real-world deployment, there are always an infinite number of misaligned concepts an LLM may learn later -- again, perhaps the very moment they gain the power to subvert human control. LLMs not only know when they are being tested, giving responses that they predict are likely to satisfy experimenters. They also engage in deception, including hiding their own capacities -- issues that persist through safety training.
This happens because LLMs are optimized to perform efficiently but learn to reason strategically. Since an optimal strategy to achieve "misaligned" goals is to hide them from us, and there are always an infinite number of aligned and misaligned goals consistent with the same safety-testing data, my proof shows that if LLMs were misaligned, we would probably find out after they hide it just long enough to cause harm. This is why LLMs have kept surprising developers with "misaligned" behavior. Every time researchers think they are getting closer to "aligned" LLMs, they're not. My proof suggests that "adequately aligned" LLM behavior can only be achieved in the same ways we do this with human beings: through police, military and social practices that incentivize "aligned" behavior, deter "misaligned" behavior and realign those who misbehave. "My paper should thus be sobering," concludes Arvan. "It shows that the real problem in developing safe AI isn't just the AI -- it's us."
"Researchers, legislators and the public may be seduced into falsely believing that 'safe, interpretable, aligned' LLMs are within reach when these things can never be achieved. We need to grapple with these uncomfortable facts, rather than continue to wish them away. Our future may well depend upon it."
Read more of this story at Slashdot.
Typhoon Trump sends message to the world with his non-stop, four-day trip of putdowns, rows and a tariff spat
Donald Trump's first trip since retaking office was memorable not so much for his tours of disaster-hit states, but more for the message it sent to America and the world: I'm back and I'm in a hurry.
Primark's adorable £20 heart print duvet set that shoppers say they 'love' for Valentine's Day
It is part of the high street favourite's Valentine's Day range
Inside Princess Diana's secret dance lessons: Former teacher lifts the lid on how she sought solace through performance
Princess Diana 's close bond with her Scottish dance teacher is to be explored in a major new documentary.
Ex-Scottish First Minister's brother-in-law, 37, in court accused of extorting man who plunged to his death from a window
Ramsay El-Nakla, 37, pleaded not guilty to the allegations during a hearing at the High Court in Glasgow yesterday, after he was charged with extortion and class A drug offences.
Elon Musk's mom defends Usha Vance as racist trolls target the new Second Lady
Maye Musk defended America's new Second Lady Usha Vance as she has faced racist attacks while husband JD Vance is sworn in as vice president.
China's DeepSeek just dropped a free challenger to OpenAI's o1 – here's how to use it on your PC
El Reg digs its claws into Middle Kingdom's latest chain of thought model
Hands on Chinese AI startup DeepSeek this week unveiled a family of LLMs it claims not only replicates OpenAI's o1 reasoning capabilities, but challenges the American model builder's dominance in a whole host of benchmarks.…
Chances to establish whether SAS soldiers 'murdered' 84 captives in Afghanistan were lost due to a 'poorly resourced' military police investigation, court hears
Alan Pughsley, the former head of Kent Police, told the Royal Courts of Justice it was poor practice that a murder probe was not mounted for three years after the alleged killings were reported.
Eight-time Grammy nominee looks almost unrecognisable but for her signature flaming red hair as she appears alongside Vernon Kay on BBC Radio 2 - but can YOU guess who it is?
She is best known for hits like Cornflake Girl and Pretty Good Year, with her songs focussing on topics such as sexuality, feminism, politics, and religion. But can you guess who it is?
Will Blake Lively vs. Justin Baldoni trial be televised? A legal expert weighs in as court date is set
A legal expert has weighed in on the possibility of the Blake Lively vs. Justin Baldoni trial being televised.
Why Marcus Smith is still England's best fly-half, how he can open up Ireland and the common myth about him that is just plain wrong, writes CHRIS FOY
CHRIS FOY: Marcus Smith has been the starting 10 in every one of England's last seven Tests and it seems that Steve Borthwick has wisely concluded that now is not the time to change that.
Shocking images reveal the cities that 'will be flooded by global warming by 2100 as sea levels rise by up to 6.2 FEET'- so, can you tell where they are?
In 2100, global sea levels will rise by a staggering 6.2ft according to a study - so artificial intelligence (AI) reveals exactly what this might look like in nine of the world's cities - but can you tell which ones?
The incredible rise of England bolter Tom Willis: Helping children from broken homes, redundancy and so powerful as a teenager that opponents wanted to forfeit rather than face him
NIK SIMON: When Tom Willis was 11 years old, his parents opened up the family home in Reading to foster children. He was surrounded by youngsters from traumatic upbringings.
New Look's 'gorgeous' brown cardigan that shoppers are 'obsessed' with
The cardigan is on sale now in all New Look stores
Brits on staycation say they put hitting seaside and countryside pubs as their priority - despite number of UK venues falling to lowest level in a century
Staycationing Brits have revealed visiting a pub is a priority - despite the number of watering holes plummeting to their lowest in the last hundred years.
CHRISTOPHER STEVENS reviews Wynne And Joanna: All At Sea on BBC1: It left a sour taste like stale chips
My skin was crawling as, teaming up with him for a boat trip, Gavin & Stacey star Joanna Page told us what a hilarious guy Wynne Evans was, and how his jokes had her constantly in stitches.
Kevin Costner gets revenge on Yellowstone creator Taylor Sheridan in subtle way with new TV series
Kevin Costner's latest project is raising eyebrows, with many wondering if he's subtly capitalizing on his Yellowstone fame after his fallout with the show's creator, Taylor Sheridan.
Just Stop Oil protester, 78, with 'too thin' wrists must serve extra time behind bars for M25 blockade after officials failed to find a tracking tag that fitted her
Gaie Delap, 78, was jailed for 20 months in August for taking part in a blockade on the M25 in November 2022 that caused chaos on Britain's busiest motorway.