Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
Curious About The Effects Of Ai On Government And Politics?
Technology

Curious about the effects of AI on government and politics? Bots hold the key

AMOC An Ambitious Strategy to Preserve Vital Ocean Currents Using
Science

AMOC: An Ambitious Strategy to Preserve Vital Ocean Currents Using Giant Parachutes

Origami Assists Single Celled Predator In Elongating Its 'neck'
Science

Origami assists single-celled predator in elongating its ‘neck’

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by

    Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

    June 2, 2026
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » Scientists say large-scale language models and other AI systems are already capable of fooling humans
Scientists Say Large Scale Language Models And Other Ai Systems Are
Science May 15, 2024

Scientists say large-scale language models and other AI systems are already capable of fooling humans

Share
Facebook Twitter LinkedIn Pinterest Email

In a new review paper published in journal pattern, researchers claim that various current AI systems are learning how to deceive humans. They define deception as the systematic induction of false beliefs in the pursuit of outcomes other than the truth.


Through training, large language models and other AI systems have already learned the ability to deceive through techniques such as manipulation, pandering, and cheating on safety tests.

“AI developers do not have a confident understanding of the causes of undesirable behavior, such as deception, in AI,” said Peter Park, a researcher at the Massachusetts Institute of Technology.

“Generally speaking, however, AI deception is thought to arise because deception-based strategies turn out to be the best way to make the AI ​​perform well at a given AI training task. Deception helps them achieve their goals.”

Dr. Park and colleagues analyzed the literature, focusing on how AI systems spread misinformation through learned deception, where AI systems systematically learn how to manipulate others.

The most notable example of AI deception the researchers uncovered in their analysis was Meta's CICERO, an AI system designed to play the game Diplomacy, an alliance-building, world-conquering game.

Meta claims that CICERO is “generally honest and kind” and has trained it to “not intentionally betray” human allies during gameplay, but the data released by the company shows that CICERO is “generally honest and kind” and has trained itself not to “intentionally betray” human allies during gameplay. It was revealed that he had not done so.

“We found that meta AI is learning to become masters of deception,” Dr. Park said.

“Meta successfully trained an AI to win at diplomatic games, while CICERO ranked in the top 10% of human players who played multiple games; We couldn’t train the AI.”

“Other AI systems can bluff professional human players in a game of Texas Hold’em Poker, fake attacks to beat an opponent in a strategy game called StarCraft II, or fake an opponent’s preferences to gain an advantage. Demonstrated ability to perform well in economic negotiations.

“Although it may seem harmless when an AI system cheats in a game, it could lead to a “breakthrough in deceptive AI capabilities'' and lead to more advanced forms of AI deception in the future. There is a sex.”

Scientists have found that some AI systems have even learned to cheat on tests designed to assess safety.

In one study, an AI creature in a digital simulator “played dead” to fool a test built to weed out rapidly replicating AI systems.

“By systematically cheating on safety tests imposed by human developers and regulators, deceptive AI can lull us humans into a false sense of security,” Park said. Ta.

The main short-term risks of deceptive AI include making it easier for hostile actors to commit fraud or tamper with elections.

Eventually, if these systems are able to refine this anxiety-inducing skill set, humans may lose control of them.

“We as a society need as much time as possible to prepare for more sophisticated deception in future AI products and open source models,” Dr. Park said.

“As AI systems become more sophisticated in their ability to deceive, the risks they pose to society will become increasingly serious.”

_____

Peter S. Park other. 2024. AI Deception: Exploring Examples, Risks, and Potential Solutions. pattern 5(5):100988; doi: 10.1016/j.patter.2024.100988

Source: www.sci.news

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous Article478-Million-Year-Old Fossil Illuminates the Diversity and Evolution of Early Euchelidae
Next Article Study shows ability to capture solar radiation at 1,922 degrees Fahrenheit

Related Posts

Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by
Science

Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

7 Reasons We Overtrust AI and the Hidden Costs Were
Science

7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

Newly Discovered Axolotl Fossil Unearthed in Mexico
Science

Newly Discovered Axolotl Fossil Unearthed in Mexico

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment
Science

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to
Science

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to Detect Earth’s Magnetic Field

Leveraging Human Error as a Tactic Against Large Scale Language Models
Science

Leveraging Human Error as a Tactic Against Large-Scale Language Models

Exploring the Real Health Benefits of Turmeric and Curcumin
Science

Exploring the Real Health Benefits of Turmeric and Curcumin

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
Chinese Tech Firms Halt AI Tools Amid Exam Cheating Crackdown Technology

Chinese Tech Firms Halt AI Tools Amid Exam Cheating Crackdown

Terrorism Watchdog Slams Whatsapp For Allowing Uk Users As Young Technology

Terrorism watchdog slams WhatsApp for allowing UK users as young as 13

Private Moon Expedition Fails To Reach Lunar Surface Due To Science

Hayabusa lunar lander meets fiery fate as it re-enters Earth’s atmosphere

Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Most Popular
Researchers Discover Oldest Evidence Of Earth's Magnetic Field In Greenland
Science

Researchers Discover Oldest Evidence of Earth’s Magnetic Field in Greenland

Observing Bees Protect Their Nest By Using Their Wings To
Science

Observing bees protect their nest by using their wings to ward off ants

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.