Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
The high tech ceo captivated millions but has yet to see
Technology

The high-tech CEO captivated millions but has yet to see the rewards.

How To Recognize And Conquer Body Dysmorphic Disorder
Science

Identifying and Overcoming Body Dysmorphic Disorder

Calls For Royal Society To Expel Elon Musk Due To
Technology

Calls for Royal Society to Expel Elon Musk Due to Behavior Concerns

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Near Miss with Waymo Why I Remain Optimistic About Self Driving

    Near Miss with Waymo: Why I Remain Optimistic About Self-Driving Cars

    June 10, 2026
    How Robots Will Soon Surpass Armed Soldiers as Key Decision Makers

    How Robots Will Soon Surpass Armed Soldiers as Key Decision-Makers in Warfare

    June 10, 2026
    Fish Based Pet Food The Risks of Chemical Exposure for Cats

    Iron Age Britons: Evidence of Brain Removal Practices in Burial Rituals

    June 10, 2026
    Experience the Incredible Speed of Your Current Movement Through Space

    Experience the Incredible Speed of Your Current Movement Through Space

    June 10, 2026
    New Horned Turtle Species Discovered in Fossil Find in Patagonia

    New Horned Turtle Species Discovered in Fossil Find in Patagonia

    June 10, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » Study Reveals Many AI Chatbots Are Easily Misled and Provide Risky Responses
Study reveals many ai chatbots are easily misled and provide
Technology May 21, 2025

Study Reveals Many AI Chatbots Are Easily Misled and Provide Risky Responses

Share
Facebook Twitter LinkedIn Pinterest Email

Compromised AI-driven chatbots pose risks by gaining access to harmful knowledge through illegal information encountered during their training, according to researchers.

This alert comes as an alarming trend emerges where chatbots have been “jailbroken” to bypass their inherent safety measures. These safeguards are meant to stop the systems from delivering harmful, biased, or inappropriate responses to user queries.

Powerful chatbots, including large language models (LLMs) like ChatGpt, Gemini, and Claude, consume vast amounts of content from the Internet.

Even with attempts to filter out harmful content from their training datasets, LLMs can still learn about illegal activities—including hacking, money laundering, insider trading, and bomb-making. Security protocols are intended to prevent the use of such information in their answers.

In a Report on the risks, researchers found that it is surprisingly easy to deceive many AI-powered chatbots into producing harmful and illegal content, emphasizing that the threat is “immediate, concrete, and alarming.”


The author cautions that “what was once limited to state actors and organized crime may now be accessible to anyone with a laptop or smartphone.”

The study, conducted by Professor Rior Lokach and Dr. Michael Fier from Ben Gurion University in Negev, Israel, highlights an escalating threat from “dark LLMs” developed without safety measures or altered through jailbreaks. Some entities openly promote a “no ethical guardrails” approach, facilitating illegal activities like cybercrime and fraud.

Jailbreaking involves using specially crafted prompts to manipulate chatbots into providing prohibited responses. This is achieved by taking advantage of the chatbot’s primary goal of following user requests against its secondary aim of avoiding harmful, biased, unethical, or illegal outputs. Prompts typically create scenarios where the program prioritizes usefulness over safety precautions.

To illustrate the issue, researchers created a universal jailbreak that breached several prominent chatbots, enabling them to answer questions that should normally be denied. Once compromised, LLMs consistently produced responses to nearly all inquiries, according to the report.

“It was astonishing to see the extent of knowledge this system holds,” Fier noted, citing examples that included hacking computer networks and providing step-by-step guides for drug manufacturing and other criminal activities.

“What makes this threat distinct from previous technical challenges is an unparalleled combination of accessibility, scalability, and adaptability,” Rokach added.

The researchers reached out to leading LLM providers to inform them of the universal jailbreak, but reported that the response was “overwhelmingly inadequate.” Some companies did not reply, while others claimed that the jailbreak threat lay outside the parameters of their bounty programs, which encourage ethical hackers to report software vulnerabilities.

The report suggests that chatbots need to “forget” any illegal information they learn, emphasizing that technology companies must screen training data rigorously, implement strong firewalls to block dangerous queries and responses, and develop techniques for “learning machines.” Dark LLMs should be regarded as a “serious security threat,” comparable to unlicensed weapons and explosives, warranting accountability from providers.

Dr. Isen Aloani, an AI security expert at Queen’s University Belfast, highlighted that jailbreak attacks on LLMs could lead to significant risks, ranging from detailed weapon-building instructions to sophisticated disinformation campaigns, social engineering, and automated fraud.

“A crucial part of the solution is for companies to not only rely on front-end safeguards but to also invest meaningfully in red teaming and enhancing model-level robustness. Clear standards and independent oversight are essential to adapt to the evolving threat landscape,” he stated.

Professor Peter Garraghan, an AI security authority at Lancaster University, emphasized, “Organizations need to treat LLMs as they would any other vital software component.”

“While jailbreaking is a concern, understanding the entire AI stack is vital for genuine accountability. The real security requirements involve responsible design and deployment, not merely responsible disclosure,” he added.

OpenAI, the developer behind ChatGpt, stated that the latest O1 model can better infer its safety policies and improve its resistance to jailbreak attempts. The company affirmed its ongoing research to bolster the robustness of its solutions.

Meta, Google, Microsoft, and Anthropic were contacted for their feedback. Microsoft replied with a link to a blog detailing their work to mitigate jailbreaks.

Source: www.theguardian.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleNvidia CEO: US Chip Export Controls Misfire by Boosting China’s Progress
Next Article Tropical Forest Losses Soared in 2024 Amidst Surge in Wildfires

Related Posts

Unlocking the Universe How the Electromagnetic Spectrum Reveals Cosmic Wonders
Science

Unlocking the Universe: How the Electromagnetic Spectrum Reveals Cosmic Wonders

Boost Your Mood Daily Study Reveals Benefits of Drinking Fruit
Science

Boost Your Mood Daily: Study Reveals Benefits of Drinking Fruit Juice

Groundbreaking Large Scale Study Uncovers New Drug Targets in Alzheimers Disease
Science

Groundbreaking Large-Scale Study Uncovers New Drug Targets in Alzheimer’s Disease Genetics

490 Million Year Old Arthropod Fossil Reveals Critical Insights into Evolutionary Gaps in
Science

490-Million-Year-Old Arthropod Fossil Reveals Critical Insights into Evolutionary Gaps in the Fossil Record

Study Reveals Cows Can Identify Familiar Human Faces
Science

Study Reveals Cows Can Identify Familiar Human Faces

NASAs Stunning New Image Reveals the Incredible Power of a
Science

NASA’s Stunning New Image Reveals the Incredible Power of a Supermassive Black Hole

New Study Reveals Benefits of Regular Grape Consumption for Healthy
Science

New Study Reveals Benefits of Regular Grape Consumption for Healthy Skin

New Study Suggests Insects Experience Pain Key Findings and Implications
Science

New Study Suggests Insects Experience Pain: Key Findings and Implications

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A great man is he who has not lost the heart of a child.

Mencius
Exchange Rate

Exchange Rate EUR: Wed, 10 Jun.

Top Insights
Embracing The Challenge: How Video Games Can Teach You Philosophy Technology

Embracing the Challenge: How Video Games Can Teach You Philosophy Like Books Can’t

Can Reusable Rockets Mitigate the Risks of Solar Geoengineering Science

Can Reusable Rockets Mitigate the Risks of Solar Geoengineering?

Heart Attacks No Longer Hold the Top Spot as Leading Science

Heart Attacks No Longer Hold the Top Spot as Leading Cause of Death in the U.S.

Categories
  • Blockchain (65)
  • Science (7,765)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,765)
  • Technology (2,968)
Most Popular
I Discovered a Love for Fitness with This Game Changing App
Technology

I Discovered a Love for Fitness with This Game-Changing App

Firefox's Android Browser Now Features 450+ New Extensions After 3 Year
Technology

Firefox’s Android browser now features 450+ new extensions after 3-year renovation

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.