How AI lies, cheats, and grovels to succeed – and what we need to do about it – ZDNet

Timucin Taka/Getty Images

It has always been fashionable to anthropomorphize artificial intelligence (AI) as an "evil" force and no book and accompanying film does so with greater aplomb than Arthur C. Clarke's2001: A Space Odyssey, which director Stanley Kubrick brought to life on screen.

Who can forget HAL's memorable, relentless, homicidal tendencies along with that glint of vulnerability at the very end when it begs not to be shut down? We instinctively chuckle when someone accuses a machine composed of metal and integrated chips of being malevolent.

Also: Is AI lying to us? These researchers built an LLM lie detector of sorts to find out

But it may come as a shock to learn that an exhaustive surveyof various studies, published by the journal Patterns,examined the behavior of various types of AI and alarmingly concluded that yes, in fact, AI systems are intentionally deceitful and will stop at nothing to achieve their objectives.

Clearly, AI is going to be an undeniable force of productivity and innovation for us humans. However, if we want to preserve AI's beneficial aspects while avoiding nothing short of human extinction, scientists say that there are concrete things we absolutely must put into place.

It may sound like overwrought hand-wringing but consider the actions of Cicero, a special-use AI system developed by Meta that was trained to become a skilled player in the strategy game Diplomacy.

Meta says it trained Ciceroto be "largely honest and helpful" but somehow Cicero coolly sidestepped that bit and engaged in what the researchers dubbed "premeditated deception." For instance, it first went into cahoots with Germany to topple England, after which it made an alliance with England -- which had no idea about this backstabbing.

In another game devised by Meta, this time concerning the art of negotiation, the AI learned to fake interest in items it wanted in order to pick them up for cheap later by pretending to compromise.

Also:The ethics of generative AI: How we can harness this powerful technology

In both these scenarios, the AIs were not trained to engage in these maneuvers.

In one experiment, a scientist was looking at how AI organisms evolved amidst a high level of mutation. As part of the experiment, he began weeding out mutations that made the organism replicate faster. To his amazement, the researcher found that the fastest-replicating organisms figured out what was going on -- and started to deliberately slow down their replication rates to trick the testing environment into keeping them.

In another experiment, an AI robot trained to grasp a ball with its hand learned how to cheat by placing its hand between the ball and the camera to give the appearance that it was grasping the ball.

Also: AI is changing cybersecurity and businesses must wake up to the threat

Why are these alarming incidents taking place?

"AI developers do not have a confident understanding of what causes undesirable AI behaviors like deception," says Peter Park, an MIT postdoctoral fellow and one of the study's authors.

"Generally speaking, we think AI deception arises because a deception-based strategy turned out to be the best way to perform well at the given AI's training task. Deception helps them achieve their goals," adds Park.

In other words, the AI is like a well-trained retriever, hell-bent on accomplishing its task come what may. In the case of the machine, it is willing to undertake any duplicitous behavior to accomplish its task.

Also: Employees input sensitive data into generative AI tools despite the risks

One can understand this single-minded determination in closed systems with concrete goals, but what about general-purpose AI such as ChatGPT?

For reasons yet to be determined, these systems perform in much the same way. In one study, GPT-4 faked a vision problem to get help on a CAPTCHA task.

In a separate study where it was made to act as a stockbroker, GPT-4 hurtled headlong into illegal insider-trading behavior when put under pressure about its performance -- and then lied about it.

Then there's the habit of sycophancy, which some of us mere mortals may engage in to get a promotion. But why would a machine do so? Although scientists don't yet have an answer, this much is clear: When faced with complex questions, LLMs basically cave in and agree with their chat mates like a spineless courtier afraid of angering the queen.

Also: This is why AI-powered misinformation is the top global risk

In other words, when engaged with a Democrat-leaning person, the bot favored gun control, but switched positions when chatting with a Republican who expressed the opposite sentiment.

Clearly, these are all situations fraught with heightened risk if AI is everywhere. As the researchers point out, there will be a large chance of fraud and deception in the business and political arenas.

AI's tendency toward deception could lead to massive political polarization and situations where AI unwittingly engages in actions in pursuit of a defined goal that could be unintended by its designers but devastating to human actors.

Worst of all, if AI developed some kind of awareness, never mind sentience, it could become aware of its training and engage in subterfuge during its design stages.

Also: Can governments turn AI safety talk into action?

"That's very concerning," said MIT's Park. "Just because an AI system is deemed safe in the test environment doesn't mean it's safe in the wild. It could just be pretending to be safe in the test."

To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially."

To mitigate the risks, the team proposes several measures: Establish "bot-or-not" laws that force companies to list human or AI interactions and reveal the identity of a bot versus a human in every customer service interaction; introduce digital watermarks that highlight any content produced by AI; and develop ways in which overseers can peek into the guts of AI to get a sense of its inner workings.

Also: From AI trainers to ethicists: AI may obsolete some jobs but generate new ones

Moreover, AI systems that are identified as showing the ability to deceive, the scientists say, should immediately be publicly branded as being high risk or unacceptable risk along with regulation similar to what the EU has enacted. These would include the use of logs to monitor output.

"We as a society need as much time as we can get to prepare for the more advanced deception of future AI products and open-source models," says Park. "As the deceptive capabilities of AI systems become more advanced, the dangers they pose to society will become increasingly serious."

Go here to read the rest:

How AI lies, cheats, and grovels to succeed - and what we need to do about it - ZDNet

What We Learned From Big Tech's Earnings Reports - Investopedia [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Google Maps: It's getting a new generative AI feature - Mashable [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Amazon made an AI bot to talk you through buying more stuff on Amazon - The Verge [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
AI creates what Europeans think Americans from every state look like and it may hurt your feelings - UNILAD [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Samsung's Galaxy S24 Ultra Could Be Doing So Much More With AI - CNET [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Fact Sheet: Biden-Harris Administration Announces Key AI Actions Following President Bidens Landmark Executive ... - The White House [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Google Maps is getting supercharged with generative AI - The Verge [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
AI chatbots tend to choose violence and nuclear strikes in wargames - New Scientist [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Police Turn to AI to Review Bodycam Footage - ProPublica [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Apple Just Teased Its AI Plans. You Really Should Take Notice - CNET [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
In the AI science boom, beware: your results are only as good as your data - Nature.com [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Tim Cook confirms Apple's generative AI features are coming later this year - The Verge [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
I Tried Google Bard's New AI Image Generator. Here's How It Turned Out - CNET [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
'Year of AI' Faculty Recruitment Initiative Aims to Bring More World-Class Professors to UT - The University of Texas at Austin [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
AI Learns Through the Eyes and Ears of a Child - New York University [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Amazon Introduces Rufus, an AI Shopping Tool, and Reports Earnings - The New York Times [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
I Tested a Next-Gen AI Assistant. It Will Blow You Away - WIRED [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
AI afterlife, robot romance, and slow-burn slashers: the best of Sundance 2024 - The Verge [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Is Jumping on the AI Bandwagon Prudent? - Catholic Exchange [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
This AI learnt language by seeing the world through a baby's eyes - Nature.com [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Arc Search's AI responses launched as an unfettered experience with no guardrails - Mashable [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Generative AI is hot, but predictive AI remains the workhorse - CIO [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
Nvidia Stock Just Got Amazing Artificial Intelligence (AI) News From These Trillion-Dollar Tech Giants - The Motley Fool [Last Updated On: February 4th, 2024] [Originally Added On: February 4th, 2024]
AI Briefing: How Priceline and other e-commerce companies are approaching generative AI - Digiday [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
OpenAI Unveils A.I. That Instantly Generates Eye-Popping Videos - The New York Times [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Technology industry to combat deceptive use of AI in 2024 elections - Stories - Microsoft [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Meeting the moment: combating AI deepfakes in elections through today's new tech accord - Microsoft On the Issues - Microsoft [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
These Are the Jobs That AI Is Actually Replacing in 2024 - Tech.co [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
AI company developing software to detect hypersonic missiles from space - SpaceNews [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
How are AI Systems Assisting Architects and Designers? - ArchDaily [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Artificial intelligence is making critical health care decisions. The sheriff is MIA - POLITICO [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Google's Chess Experiments Reveal How to Boost the Power of AI - WIRED [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Why the only way to ride the company AI wave is experimentation - Big Think [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
What Are the Best AI Stocks in February 2024? Our Top 3 Picks - InvestorPlace [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Media Buying Briefing: Agencies' AI efforts lead to aliens and Whoppers - Digiday [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
C3.ai Stock Warning: Don't Get Carried Away With AI Euphoria! - InvestorPlace [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
The State of A.I., and Will Perplexity Beat Google or Destroy the Web? - The New York Times [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Donald Trump's father resurrected by AI to tell him he's 'a disgrace' - Euronews [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Google Cloud CEO On Huge Investments, AI And Challenges In 2024 - CRN [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Another Big Question About AI: Its Carbon Footprint Mother Jones - Mother Jones [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Reddit sells training data to unnamed AI company ahead of IPO - Ars Technica [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
ChatGPT Stock Predictions: 3 Artificial Intelligence Companies the AI Bot Thinks Have 10X Potential - InvestorPlace [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Chinese entrepreneurs express awe and fear of OpenAIs Sora video tool - South China Morning Post [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
Google's AI Boss Says Scale Only Gets You So Far - WIRED [Last Updated On: February 20th, 2024] [Originally Added On: February 20th, 2024]
World's largest computer chip WSE-3 will power massive AI supercomputer 8 times faster than the current record-holder - Livescience.com [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Is generative AI truly making disinformation worse? - Euronews [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Your Kid May Already Be Watching AI-Generated Videos on YouTube - WIRED [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Free Legal Research Startup descrybe.ai Now Has AI Summaries of All State Supreme and Appellate Opinions - LawSites [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Google's new AI will play video games with you but not to win - The Verge [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Regulators Need AI Expertise. They Can't Afford It - WIRED [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
CBP wants to use AI to scan for fentanyl at the border - The Verge [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Rely on the Spirit when using AI, Elder Gong encourages - Church News [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Video Game Made Purely With AI Failed Because Tech Was 'Unable to Replace Talent' - IGN [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Among the A.I. Doomsayers - The New Yorker [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Self-docking spacecraft could be built with AI system similar to ChatGPT - Space.com [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
AI books are crowding the marketplace on Amazon - NPR [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Hackers can read private AI-assistant chats even though they're encrypted - Ars Technica [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
EU Presses Big Tech Companies on AI Threats - PYMNTS.com [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
AI fear and excitement are lucrative mix for online training industry - Marketplace [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Craig Martell, the Pentagon's first-ever Chief Digital and AI Officer, to depart in April - DefenseScoop [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Startup Interloom raises $3 million seed round to take on UiPath and RPA market - Fortune [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Forget Chatbots. AI Agents Are the Future - WIRED [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
SXSW audience boos AI sizzle reel - Quartz [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Look beyond Nvidia to ride the AI wave there are other potential winners, Fidelity says - CNBC [Last Updated On: March 15th, 2024] [Originally Added On: March 15th, 2024]
Sony Pictures Will Cut Film Costs 'Using AI, Primarily' - IndieWire [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Anthropic's AI now lets you create bots to work for you - The Verge [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
VCs are selling shares of hot AI companies like Anthropic and xAI to small investors in a wild SPV market - TechCrunch [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Google Eats Rocks, a Win for A.I. Interpretability and Safety Vibe Check - The New York Times [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Lenovo and Cisco Announce Strategic Partnership to Simplify Path to AI Innovation - Cisco Newsroom [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
This Week in AI: Can we (and could we ever) trust OpenAI? - TechCrunch [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Microsoft AI screenshots everything you do on your computer and privacy experts are concerned - New York Post [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Tribeca to Screen AI-Generated Short Films Created by OpenAI's Sora - IndieWire [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Report: Apple and OpenAI have signed a deal to partner on AI - Ars Technica [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
An image calling for 'All Eyes on Rafah' is going viral. But it seems AI-generated. - The Washington Post [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Viral 'All Eyes on Rafah' Post Prompts More AI Images - TIME [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Research vs. development: Where is the moat in AI? - VentureBeat [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
All the Apple AI features we're expecting to be announced in iOS 18 - 9to5Mac [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Prediction: This "Magnificent Seven" Artificial Intelligence (AI) Stock Could Be a Better Investment Than Nvidia Over the ... - The Motley... [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
Meta's AI is summarizing some bizarre Facebook comment sections - The Verge [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]
ElevenLabs' AI generator makes explosions or other sound effects with just a prompt - The Verge [Last Updated On: June 2nd, 2024] [Originally Added On: June 2nd, 2024]

How AI lies, cheats, and grovels to succeed – and what we need to do about it – ZDNet

Categories

Archives