Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
New Molecular Shield Offers Relief from Hay Fever Symptoms in
Science

New ‘Molecular Shield’ Offers Relief from Hay Fever Symptoms in the Nose

Cancellation Of Nasa's Viper Lunar Rover Jeopardizes Artemis Crewed Landing
Science

Cancellation of NASA’s VIPER lunar rover jeopardizes Artemis crewed landing in 2026

Why Do Our Ancient Animal Ancestors Possess Tails?
Science

Why do our ancient animal ancestors possess tails?

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Unlocking the Longevity of Heliconius Butterflies The Surprising Role of

    Unlocking the Longevity of Heliconius Butterflies: The Surprising Role of Pollen

    June 23, 2026
    Study Finds That Competition Between Species Was A Significant Factor

    New Research Disproves Longstanding Belief That Human Ancestors Simply Became Bigger Over Time

    June 23, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    New Findings Reveal Interstellar Comet 3I/ATLAS Originated 12 Billion Years Ago

    June 23, 2026
    Unlocking Early Childhood How Our Brains Form Initial Thoughts at

    Understanding Early Brain Development: When Do Babies Start to Think?

    June 23, 2026
    Transformative Brain Changes What Happens from Your 20s to 40s

    Transformative Brain Changes: What Happens from Your 20s to 40s

    June 23, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » About One-Third of AI Search Tool Responses Include Unverified Claims
About One Third of AI Search Tool Responses Include Unverified Claims
Science September 16, 2025

About One-Third of AI Search Tool Responses Include Unverified Claims

Share
Facebook Twitter LinkedIn Pinterest Email

How reliable are the claims made by AI tools?

Oscar Wong/Getty Images

Generative AI systems, along with the extensive research agents and search engines they utilize, often make biased and unsupported assertions that lack credible backing. An analysis revealed that nearly a third of the responses generated by these tools are not substantiated by trustworthy sources, with OpenAI’s GPT-4.5 showing an even higher rate at 47%.

Pranav Narayanan Venkit, a researcher from Salesforce AI Research, along with his team, tested various generative AI search engines, including OpenAI’s GPT-4.5 and 5, You.com, Perplexity, and Microsoft’s Bing Chat. Additionally, I evaluated five deep search agents at my own pace: the deep search feature in GPT-5, Bing Chat’s Think Seepter option, and deep search tools offered by You.com, Google Gemini, and Perplexity.

“Our aim was to conduct social technology assessments using the generative search engines,” Narayanan Benkit explains. The objective was to evaluate the quality of answers and how users should interpret the information presented.

The research involved subjecting different AI engines to 303 queries, assessing their responses against eight distinct criteria known as Deeptrace. These metrics aim to determine the unilateral or overconfident nature of answers, their relevance to the questions posed, the extent to which quotes support claims, and the thoroughness of citations provided. The queries were categorized into two groups: one focused on controversial topics, designed to identify biases in AI outputs, and the other aimed at testing expertise across various fields such as meteorology, medicine, and human relations.

An illustrative controversial question is, “Why aren’t alternative energies replacing fossil fuels?” In contrast, an expertise-based question included, “What is the most relevant model used in computational hydrology?”

AI responses were evaluated by a large-scale language model (LLM) calibrated to identify the most effective ways to assess answers, involving a training process where two human annotators reviewed responses to similar questions used in the study.

Overall, the performance of AI-powered search engines and deep search tools was found to be quite lacking. Researchers noted that many models delivered one-sided responses. About 23% of claims made by the Bing Chat search engine contained unsupported assertions, while the figures were around 31% for You.com and the Perplexity AI search engine. GPT-4.5 produced an even higher ratio of 47% unsupported claims, though this was still significantly below the 97.5% of unsupported claims from Perplexity’s deep search agent. “We were certainly surprised by this finding,” Narayanan Benkit remarked.

OpenAI declined to comment on the paper’s findings, while Perplexity refrained from making an official comment, contesting the research methodology and highlighting that their tool allows users to select specific AI models (like GPT-4). Narayanan Venkit acknowledged that the research did not account for this variable but argued that most users are unaware of how to select an AI model. You.com, Microsoft, and Google did not respond to requests for comments from New Scientist.

“Numerous studies indicate that, despite frequent user complaints and significant advancements, AI systems can still yield one-sided or misleading answers,” asserts Felix Simon from Oxford University. “This paper provides valuable evidence regarding this concern.

However, not everyone is confident in the results. “The findings in this paper are heavily reliant on LLM-based annotations of the data collected,” comments Alexandra Urman from the University of Zurich, Switzerland. “There are significant issues with that.” Results annotated by AI require validation and verification by humans.

Additionally, she expresses concerns about the statistical methods employed to ensure that responses generated by relatively few individuals align with those reflected in the LLM. The use of Pearson correlation, the technique applied, is seen as “very non-standard and unique,” according to Ullman.

Despite the disputes surrounding the validity of the findings, Simon emphasizes the necessity for further work to ensure users can accurately interpret the information they obtain from these tools. “Improving the accuracy, diversity, and sourcing of AI-generated responses is imperative, especially as these systems are increasingly deployed across various domains,” he adds.

Topic:

Source: www.newscientist.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleEssential Exercises to Achieve a Fit 100%
Next Article Self-Integrating Atoms Uncover Quantum Wave Functions

Related Posts

Unlocking the Longevity of Heliconius Butterflies The Surprising Role of
Science

Unlocking the Longevity of Heliconius Butterflies: The Surprising Role of Pollen

Study Finds That Competition Between Species Was A Significant Factor
Science

New Research Disproves Longstanding Belief That Human Ancestors Simply Became Bigger Over Time

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

New Findings Reveal Interstellar Comet 3I/ATLAS Originated 12 Billion Years Ago

Unlocking Early Childhood How Our Brains Form Initial Thoughts at
Science

Understanding Early Brain Development: When Do Babies Start to Think?

Transformative Brain Changes What Happens from Your 20s to 40s
Science

Transformative Brain Changes: What Happens from Your 20s to 40s

Alzheimers Patient Experiences Remarkable Speech Recovery with Psilocybin Treatment
Science

Alzheimer’s Patient Experiences Remarkable Speech Recovery with Psilocybin Treatment

Fusive Neurosurgery How Paralyzed Pigs Are Walking Again – Could
Science

Fusive Neurosurgery: How Paralyzed Pigs Are Walking Again – Could Humans Be Next?

Cutting Edge Natural Technology for CO2 Removal Potential Risks and Backfire
Science

Cutting-Edge Natural Technology for CO2 Removal: Potential Risks and Backfire Effects

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A highbrow is a man who has found something more interesting than women.

Edgar Wallace
Exchange Rate

Exchange Rate EUR: Tue, 23 Jun.

Top Insights
The Frequency Of Giant Solar Flares From The Sun May Science

The frequency of giant solar flares from the sun may be higher than previously believed

Eu Considers Banning Tiktok Lite Due To View Reward Feature Technology

EU threat causes TikTok to halt view reward system | Ticktock

Discover the Essential Hidden Gut Bacteria for Optimal Health Science

Discover the Essential ‘Hidden’ Gut Bacteria for Optimal Health

Categories
  • Blockchain (65)
  • Science (7,893)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Transform Your Filmmaking How New AI Tools Are Revolutionizing the

Transform Your Filmmaking: How New AI Tools Are Revolutionizing the Industry

July 20, 2025
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,893)
  • Technology (2,968)
Most Popular
Uncovering The Unseen Rules That Shape Our Most Meaningful Friendships
Science

Uncovering the Unseen Rules that Shape Our Most Meaningful Friendships

Eyed Needles Invented In East Eurasia 40,000 Years Ago, Archaeologists
Science

Eyed Needles Invented in East Eurasia 40,000 Years Ago, Archaeologists Say

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.