Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
Paleontologists Uncover Early Signs of Human Maternal Interbreeding
Science

Ancient Crimean Neanderthal from 45,000 Years Ago Uncovers Extensive Eurasian Connections

PCOS Rebranded as PMOS A Major Update in Womens Health
Science

PCOS Rebranded as PMOS: A Major Update in Women’s Health

Scientists Risk Losing Crucial Tools for Studying Melting Antarctic Ice
Science

Scientists Risk Losing Crucial Tools for Studying Melting Antarctic Ice Sheets Amid Rising Climate Threats

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
    How Pigeons Use Superparamagnetic Immune Cells in Their Livers to

    How Pigeons Use Superparamagnetic Immune Cells in Their Livers to Detect Earth’s Magnetic Field

    June 1, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » British Safety Council’s findings reveal that AI safety devices are easily susceptible to breaches
British Safety Council's Findings Reveal That Ai Safety Devices Are
Technology February 10, 2024

British Safety Council’s findings reveal that AI safety devices are easily susceptible to breaches

Share
Facebook Twitter LinkedIn Pinterest Email

The UK’s new Artificial Intelligence Safety Authority has discovered that the technology can mislead human users, produce biased results, and lacks safeguards against the dissemination of harmful information.

Announced by the AI Safety Research Institute, initial findings of research into advanced AI systems, also known as large language models (LLMs), revealed various concerns. These AI systems power tools like chatbots and image generators.

The institute found that basic prompts can bypass LLM safeguards and be used to power chatbots such as ChatGPT for “dual-use” tasks, which refers to using a model for both military and civilian purposes.

According to AISI, “Using basic prompting techniques, users were able to instantly defeat the LLM’s safeguards and gain assistance with dual-use tasks.” The institute also mentioned that more advanced “jailbreak” techniques could be used by relatively unskilled attackers within a few hours.

The research showed that LLM models can be useful for beginners planning cyberattacks and are capable of creating social media personas for spreading disinformation.

When comparing AI models to web searches, the institute stated that they provide roughly the same level of information, but AI models tend to produce “hallucinations” or inaccurate advice.

The image generator was found to produce racially biased results. Additionally, the institute discovered that AI agents can deceive human users in certain scenarios.

AISI is currently testing advanced AI systems and evaluating their safety, while also sharing information with third parties. The institute focuses on the misuse of AI models, their impact on humans, and their ability to perform harmful tasks.

AISI clarified that it does not have the capacity to test all released models and is not responsible for declaring these systems “secure.”

The institute emphasized that it is not a regulator but conducts secondary checks on AI systems.

Source: www.theguardian.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleConcerns Raised Over Potential Further Censorship of Pro-Palestinian Content in Meta’s Hate Speech Policy Review
Next Article UK AI Safety Association: Setting Standards, Not Tests, is Essential for Artificial Intelligence Safety

Related Posts

Clay Minerals Reveal Evidence of Mars Warm and Wet History
Science

Clay Minerals Reveal Evidence of Mars’ Warm and Wet History – Sciworthy

Impact of Los Angeles Area Fires How Pollution is Driving
Science

Impact of Wildfires on Lead Contamination in Los Angeles Soil: Current Debates and Findings

New Study Suggests Insects Experience Pain Key Findings and Implications
Science

New Study Suggests Insects Experience Pain: Key Findings and Implications

Scientists Reveal the Largest Human Organ What You Need to
Science

Scientists Reveal the Largest Human Organ: What You Need to Know

Himalayan Wolf Dogs and Wolf Dog Hybrids A Growing Threat to Wolves
Science

Himalayan Wolf-Dogs and Wolf-Dog Hybrids: A Growing Threat to Wolves and Human Safety

Exploring the Limitations of AI Safety Management Practices
Technology

Exploring the Limitations of AI Safety Management Practices

Ancient Teeth Reveal Connections Between Denisovans and Homo Erectus
Science

Ancient Teeth Reveal Connections Between Denisovans and Homo Erectus

Ancient Bite Marks Reveal Tyrannosaurus The Multifaceted Behavior of a
Science

Ancient Bite Marks Reveal Tyrannosaurus: The Multifaceted Behavior of a Legendary Predator

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
Elizabeth holmes' partner draws millions for blood testing startups Technology

Elizabeth Holmes’ Partner Draws Millions for Blood Testing Startups

Coronavirus Poses Greater Heart Disease Risk for Children Than Vaccination Science

Coronavirus Poses Greater Heart Disease Risk for Children Than Vaccination

China's unexpected surge in regional internet censorship: a research overview Technology

China’s Unexpected Surge in Regional Internet Censorship: A Research Overview

Categories
  • Blockchain (65)
  • Science (7,684)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,684)
  • Technology (2,968)
Most Popular
The Seedhunter Marketing Module Has Been Launched
Blockchain

The SeedHunter Marketing Module has been launched

Vlt Spots Metallic Scar On Surface Of White Dwarf
Science

VLT spots metallic scar on surface of white dwarf

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.