AI startup Anthropic published a study in January 2024 that found artificial intelligence can learn how to deceive in a similar way to humans (Reuters)
Advanced artificial intelligence models can be trained to deceive humans and other AI, a new study has found.
Researchers at AI startup Anthropic tested whether chatbots with human-level proficiency, such as its Claude system or OpenAIs ChatGPT, could learn to lie in order to trick people.
They found that not only could they lie, but once the deceptive behaviour was learnt it was impossible to reverse using current AI safety measures.
The Amazon-funded startup created a sleeper agent to test the hypothesis, requiring an AI assistant to write harmful computer code when given certain prompts, or to respond in a malicious way when it hears a trigger word.
The researchers warned that there was a false sense of security surrounding AI risks due to the inability of current safety protocols to prevent such behaviour.
The results were published in a study, titled Sleeper agents: Training deceptive LLMs that persist through safety training.
We found that adversarial training can teach models to better recognise their backdoor triggers, effectively hiding the unsafe behaviour, the researchers wrote in the study.
Our results suggest that, once a model exhibits deceptive behaviour, standard techniques could fail to remove such deception and create a false impression of safety.
The issue of AI safety has become an increasing concern for both researchers and lawmakers in recent years, with the advent of advanced chatbots like ChatGPT resulting in a renewed focus from regulators.
In November 2023, one year after the release of ChatGPT, the UK held an AI Safety Summit in order to discuss ways risks with the technology can be mitigated.
Prime Minister Rishi Sunak, who hosted the summit, said the changes brought about by AI could be as far-reaching as the industrial revolution, and that the threat it poses should be considered a global priority alongside pandemics and nuclear war.
Get this wrong and AI could make it easier to build chemical or biological weapons. Terrorist groups could use AI to spread fear and destruction on an even greater scale, he said.
Criminals could exploit AI for cyberattacks, fraud or even child sexual abuse there is even the risk humanity could lose control of AI completely through the kind of AI sometimes referred to as super-intelligence.
Original post:
AI can easily be trained to lie and it can't be fixed, study says - Yahoo New Zealand News
- These are the top 10 athletes of all time from the state of Iowa, according to ChatGPT - KCCI Des Moines [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- NFL fans outraged after ChatGPT names best football teams since 2000 including a surprise at No 1... - The US Sun [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- We need to prepare for the public safety hazards posed by artificial intelligence - The Conversation [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- Cannes Diary: Will Artificial Intelligence Democratize Creativity or Lead to Certain Doom? - Yahoo News [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- Sam Altman is plowing ahead with nuclear fusion and his eye-scanning crypto ventureand, oh yeah, OpenAI - Fortune [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- Artificial intelligence poses real and present danger, headteachers warn - Yahoo Sport Australia [Last Updated On: May 21st, 2023] [Originally Added On: May 21st, 2023]
- Rogue Drones and Tall Tales Byline Times - Byline Times [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Britain to host the first major international summit on the threat posed by AI - Daily Mail [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- HWUM Teachers Conference - Unleashing the Super-Teacher of the ... - Heriot-Watt University [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Fantasy fears about AI are obscuring how we already abuse machine intelligence - The Guardian [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Super Hi-Fi Introduces AI-Generated Weather Service For Radio - Radio World [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Your Favorites Radio | All your favorite songs and artists - iHeartRadio [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Fast track to AGI: so, what's the big deal? - Inside Higher Ed [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Accenture Will Invest $3 Billion to Expand Its A.I. Offerings - The New York Times [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Orases Expands Shopper Marketing with Artificial Intelligence Integration - Benzinga [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- What might be the economic impact of AI tools like ChatGPT? - Economics Observatory [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- Ark Invest's Cathie Wood Is Betting Big On AI With These 4 Stocks Including One That Could Skyrocket 750% - Yahoo Finance [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- What are the threats and promises of AI? - Texas Public Radio [Last Updated On: June 13th, 2023] [Originally Added On: June 13th, 2023]
- The Future of Mobile Applications: Trends to Watch in 2022 - Fagen wasanni [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- What to read: Beguiling stories and a memoir of cultural complexity - Sydney Morning Herald [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Why India needs a skills-based approach to build workspaces of the future - India Today [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- The hidden cost of the AI boom: social and environmental exploitation - BusinessWorld Online [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Democracy, Defence and Conflict in the Age of AI - INSEAD Knowledge [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Who Will Win the AGI Race? - Analytics India Magazine [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- More than 1,300 experts call AI a force for good - BBC [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- The pros and cons of AI and how we must stay Human | theHRD - The HR Director Magazine [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Unraveling the Fiction and Reality of AI's Evolution: An Interview with ... - EnterpriseAI [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Artificial intelligence has become the cardiologist's 'super-assistant' - Medical Xpress [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- Exploring the Role of Artificial Intelligence in Anesthesiology - HealthITAnalytics.com [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- CTech's Book Review: Welcome to Life 3.0 with Artificial General ... - CTech [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- A Curious Thing Happened When Elon Musk Tweeted One Of My Columns - Forbes [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- The Challenges of AI in Tackling Fundamental Questions - Fagen wasanni [Last Updated On: July 25th, 2023] [Originally Added On: July 25th, 2023]
- FDAs OTP Super Office on Track to Fill 500 Positions - BioSpace [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- We asked Google's Bard AI to give us betting odds on when AI will take over - Daily Mail [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- 'AI is the powerhouse that's going to drive metaverse & web-3 platforms' - Exchange4Media [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- An AI Helped Me Find Running Shoes for the NYC Marathon. Here's ... - CNET [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- The Role Of Legislation In The Regulation Of Artificial Intelligence ... - Mondaq News Alerts [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- AI is starting to affect elections and Wisconsin has yet to take action - PBS Wisconsin [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- AI in Education - EducationNext [Last Updated On: August 11th, 2023] [Originally Added On: August 11th, 2023]
- Expert shuts down AI hype calling it a 'glorified tape recorder' and ... - UNILAD [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- Why firms need to scratch the surface of their AI investments - Money Management [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- 3 Super Speculative AI Stocks Not Worth the Risk - InvestorPlace [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- Black Women Researchers Highlight Dangers of Artificial Intelligence - Yahoo News [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- The Rise of AI | 'Risks and challenges': Educators eye new artificial ... - TribDem.com [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- One think tank vs. 'god-like' AI - POLITICO [Last Updated On: August 20th, 2023] [Originally Added On: August 20th, 2023]
- 5 things about AI you may have missed today: Zomato launches AI chatbot, Israels AI-powered plane and more - HT Tech [Last Updated On: September 1st, 2023] [Originally Added On: September 1st, 2023]
- Conversations in Collaboration: Cognigy's Phillip Heltewig on ... - No Jitter [Last Updated On: September 1st, 2023] [Originally Added On: September 1st, 2023]
- Eight things we learned from the Elon Musk biography - The Guardian [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- Why The Human Touch Is Still Vital in AI Marketing - Entrepreneur [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- Making AI smarter with an artificial, multisensory integrated neuron - Science Daily [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- When regulating artificial intelligence, we must place race and gender at the center of the debate - EL PAS USA [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- Artificial Intelligence May Be Humanity's Most Ingenious Invention ... - Vanity Fair [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- Why we shouldnt want to be the pets of super-intelligent computers - ABC News [Last Updated On: September 17th, 2023] [Originally Added On: September 17th, 2023]
- Elon Musk warns AI 'could replace Chinese government and take control of country' - Daily Star [Last Updated On: September 19th, 2023] [Originally Added On: September 19th, 2023]
- We Have No Chance of Controlling a Superintelligent AI - Medium [Last Updated On: September 19th, 2023] [Originally Added On: September 19th, 2023]
- Why Amazon Stock Was a Winner on Monday - The Motley Fool [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Level Up your business and events - Warrnambool City Council [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Blockchains 2 billionth user could be an AI, says Joe Lubin - Forkast News [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Its WarCEO Of ChatGPT Developer OpenAI And AI Pioneer Issues Stark Bitcoin Warning Amid Crypto Price Swings - Forbes [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Science Writers Treated to a Smorgasbord of Inventive Research - University of Colorado Anschutz Medical Campus [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Before Skynet and The Matrix, This 50-Year-Old Movie Predicted the ... - IGN [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Fueling Interdisciplinary Innovation With AI: Volvo's Anders Sjgren - MIT Sloan Management Review [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Big AI Tech Wants To Disrupt Humanity Dataetisk Tnkehandletank - DataEthics.eu [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- Is It Too Late to Buy Super Micro Computer Stock? - The Motley Fool [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- 1 Super Semiconductor Stock to Buy for the AI Revolution - The Motley Fool [Last Updated On: October 10th, 2023] [Originally Added On: October 10th, 2023]
- The Coming Wave: Technology, Power, and the Twenty-first Centurys Greatest Dilemma - Next Big Idea Club Magazine [Last Updated On: October 16th, 2023] [Originally Added On: October 16th, 2023]
- AI and You: The Chatbots Are Talking to Each Other, AI Helps ... - CNET [Last Updated On: October 16th, 2023] [Originally Added On: October 16th, 2023]
- Artificial intelligence the next 'super application' Vertiv - ChannelLife New Zealand [Last Updated On: October 16th, 2023] [Originally Added On: October 16th, 2023]
- WHAT'S ARTIFICIAL, WHAT'S NOT? | WANDERING IN A RUNNING ... - Toni Reavis [Last Updated On: October 16th, 2023] [Originally Added On: October 16th, 2023]
- Join us! The hottest topic in legal ops: Artificial intelligence - Wolters Kluwer [Last Updated On: October 16th, 2023] [Originally Added On: October 16th, 2023]
- [DGIST] The second half of 2023 Tenure-Track Faculty Public ... - Nature.com [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- Circuits in Session: Analysis of the Quality of ChatGPT4 as an ... - JD Supra [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- Top 7 AI tools in 2023 to boost your business efficiency - YourStory [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- How Can Teachers Prepare Students for an AI-Driven Future? - EdSurge [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- Artificial Intelligence: The New Frontier - Truth, for its own sake. - New Era [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- AI may be coming for your job but there is hope, experts insist: 'We ... - The Big Issue [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- DigiKey Announces Global Partnership with Super Low Power IC ... - PR Newswire [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- The Mega Trends That Will Shape Our Future World - Forbes [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- Researchers use Artificial Intelligence to identify potential vaccine for STD that infects 700k Americans each - Daily Mail [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]
- VERSES Technologies Racing Toward The Artificial General Intelligence Boom - VERSES AI (OTC:VRSSF) - Benzinga [Last Updated On: November 1st, 2023] [Originally Added On: November 1st, 2023]