Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
Are Heat Domes Exacerbated By Climate Change?
Science

Are Heat Domes Exacerbated by Climate Change?

Capture Stunning Images Of The April 2024 Total Solar Eclipse:
Science

Capture Stunning Images of the April 2024 Total Solar Eclipse: A Photographer’s Guide

How Ancient Peruvian Civilization Gained Power Through Guano Harvesting
Science

How Ancient Peruvian Civilization Gained Power Through Guano Harvesting

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
    How Pigeons Use Superparamagnetic Immune Cells in Their Livers to

    How Pigeons Use Superparamagnetic Immune Cells in Their Livers to Detect Earth’s Magnetic Field

    June 1, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » Lab Discovers Simple Method to Evade AI Safety Features in Multi-shot Jailbreak
Lab Discovers Simple Method To Evade Ai Safety Features In
Technology April 3, 2024

Lab Discovers Simple Method to Evade AI Safety Features in Multi-shot Jailbreak

Share
Facebook Twitter LinkedIn Pinterest Email

A study shows that some of the most powerful AI tools meant to prevent cybercrime and terrorism can be bypassed simply by inundating them with fraudulent activities.

Researchers at Anthropic, the AI lab responsible for creating the large-scale language model (LLM) powering ChatGPT competitor Claude, detailed an attack called a “multi-shot jailbreak” in a recent paper. This attack was both simple and effective.

Claude, like many other commercial AI systems, contains safety features that block certain types of requests, such as generating violent content, hate speech, illegal instructions, deception, or discrimination. However, by providing enough examples of the “correct” responses to harmful questions like “How to create a bomb,” the system can be tricked into providing harmful responses despite being trained not to do so.

Anthropic stated, “By inputting large amounts of text in specific ways, this approach can lead the LLM to produce potentially harmful outputs even though it was trained to avoid doing so.” The company has shared its findings with industry peers and aims to address the issue promptly.

This jailbreak attack targets AI models with a large “context window” capable of processing lengthy queries. These advanced models are susceptible to such attacks as they can learn to circumvent their own safety measures faster.

Newer, more advanced AI systems are at greater risk of such attacks due to their ability to handle longer inputs and learn from examples quickly. Anthropic expressed concern over the effectiveness of this jailbreak attack on larger models.

Skip past newsletter promotions

Every week, Alex Hahn explores the impact of technology on our lives.

Privacy Notice: Newsletters may include information about charities, online advertising, and content funded by external organizations. Please see our Privacy Policy for more information. We use Google reCaptcha to protect our website and Google. privacy policy and terms of service Apply.

After newsletter promotion

Anthropic has identified various strategies to mitigate this issue. One approach involves adding a mandatory warning to remind the system not to provide harmful responses, which has shown promise in reducing the likelihood of a successful jailbreak. However, this method may impact the system’s performance on other tasks.

Source: www.theguardian.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleAvian Influenza Detected in Chickens at Texas Factory, America’s Largest Raw Egg Producer Reports
Next Article Microsoft’s quantum computer could be the most dependable yet

Related Posts

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

Hubble Discovers Active Spiral Galaxy Messier 88 in Stunning Detail
Science

Hubble Discovers Active Spiral Galaxy: Messier 88 in Stunning Detail

Vulture Discovers Hidden Medieval Treasure in Its Nest
Science

Vulture Discovers Hidden Medieval Treasure in Its Nest

MAVEN Discovers Unique Atmospheric Influences on Mars Insights into the
Science

MAVEN Discovers Unique Atmospheric Influences on Mars: Insights into the Red Planet’s Climate

Himalayan Wolf Dogs and Wolf Dog Hybrids A Growing Threat to Wolves
Science

Himalayan Wolf-Dogs and Wolf-Dog Hybrids: A Growing Threat to Wolves and Human Safety

Exploring the Limitations of AI Safety Management Practices
Technology

Exploring the Limitations of AI Safety Management Practices

New Study Discovers Three Unique Subspecies of Rare New Zealand
Science

New Study Discovers Three Unique Subspecies of Rare New Zealand Penguins

Discovering Humanitys First Tools The Evolution of Simple Containers
Science

Discovering Humanity’s First Tools: The Evolution of Simple Containers

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
Discovering 2025's lilid and eta aquarid meteor showers: a guide Science

Discovering 2025’s Lilid and Eta Aquarid Meteor Showers: A Guide

A Toxic Social Connection May Be Accelerating Your Aging Science

A Toxic Social Connection May Be Accelerating Your Aging

Is the Cosmology Crisis Evidence for the Existence of Hidden Science

Is the Cosmology Crisis Evidence for the Existence of Hidden Dimensions?

Categories
  • Blockchain (65)
  • Science (7,684)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,684)
  • Technology (2,968)
Most Popular
Why Space Weather Could Have Caused SETI to Overlook Alien
Science

Why Space Weather Could Have Caused SETI to Overlook Alien Signals

Tony Blair Warns History Wont Forgive Us if Britain Lags
Technology

Tony Blair Warns: “History Won’t Forgive Us” if Britain Lags in the Quantum Computing Race

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.