Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
5000 Year Old Cave Ice Reveals Multidrug Resistant Bacterial Strain A Groundbreaking Discovery
Science

5,000-Year-Old Cave Ice Reveals Multidrug-Resistant Bacterial Strain: A Groundbreaking Discovery

Deepfakes are harder to spot: now they even have a
Science

Deepfakes Are Harder to Spot: Now They Even Have a Heartbeat

Activists Advocate For Public Transparency Of Ride Hailing App Data To
Technology

Activists advocate for public transparency of ride-hailing app data to tackle exploitation and reduce emissions | Gig Economy

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by

    Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

    June 2, 2026
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » Despite Advances in Technology, AI Hallucinations Are Intensifying
Despite advances in technology, ai hallucinations are intensifying
Technology May 5, 2025

Despite Advances in Technology, AI Hallucinations Are Intensifying

Share
Facebook Twitter LinkedIn Pinterest Email

Last month, AI bots managing technical support for cursors, emerging tools for computer programmers, informed numerous customers about alterations to the company’s policy. They stated that using cursors on a different computer was no longer permitted.

In a frustrated post on the Internet Message Board, a customer expressed their discontent. Some users even canceled their cursor accounts, and others were irate upon discovering the misunderstanding. AIBOT had mentioned a non-existent policy change.

“Such a policy does not exist. Users can indeed utilize their cursor across multiple devices.” I posted on Reddit. “Regrettably, this is an inaccurate response from the AI support bot.”

Two years post the launch of CHATGPT, tech companies, office workers, and everyday users have increasingly turned to AI bots for a diverse array of tasks. Yet, there remains no reliable mechanism to guarantee the accuracy of the information these systems provide.

The latest advanced technologies—so-called inference systems from firms like OpenAI, Google, and the Chinese startup Deepseek—are producing fewer errors. The connection to factuality has sharpened as the mathematical capabilities have enhanced. The exact reason for this improvement remains somewhat unclear.

Contemporary AI bots are built upon intricate mathematical structures that learn by analyzing vast amounts of digital data. They lack the ability to discern truth from falsehood. Sometimes, they fabricate information, leading some AI researchers to describe it as ‘hallucination.’ In one assessment, the hallucination rate for the new AI system reached 79%.

These models utilize mathematical probabilities to deduce the most appropriate response instead of adhering strictly to guidelines established by human engineers. Thus, errors are inevitable. “Despite our efforts, hallucination will always persist,” said Amr Awadallah, CEO of Vectara, a startup developing AI tools for enterprises and a former Google executive. “It’s unavoidable.”

For years, this issue has raised doubts concerning the reliability of these systems. While they can be beneficial in specific contexts, such as drafting term papers, summarizing office documents, or coding, their inaccuracies pose significant challenges.

AI bots integrated with search engines like Google or Bing can generate laughable and erroneous search results. If you inquire about a popular marathon on the West Coast, they might point you to a race in Philadelphia. When asked for household statistics in Illinois, they could cite a source that doesn’t contain that information.

While these hallucinations may not significantly affect many users, they present serious concerns for those relying on technology for legal documents, medical data, or sensitive business information.

“We invest substantial time discerning which responses are factual and which are not,” remarked Pratik Verma, co-founder and CEO of Okaff, a firm assisting businesses in navigating hallucination issues. “If these inaccuracies are not adequately addressed, the value of an AI system diminishes. The goal is to automate tasks.”

Cursor and Truell did not respond to requests for comments.

Over the past two years, firms such as OpenAI and Google have consistently enhanced their AI systems and decreased the frequency of these errors. However, the latest inference systems are showing an uptick in mistakes. According to internal evaluations, OpenAI’s newest systems hallucinate more often than their predecessors.

The company determined that O3 (its most advanced system) exhibited a 33% hallucination rate during the PersonQA benchmark tests, which involve answering questions about public figures—over twice the hallucination rate of their previous inference system named O1. The newly released O4-MINI showed an even steeper hallucination rate of 48%.

Another evaluation, SimpleQA, which poses more generalized questions, revealed hallucination rates of 51% and 79% for O3 and O4-MINI, respectively, while the earlier system, O1, came in at 44%.

In a paper outlining the tests, OpenAI noted that further research is required to understand these results. Given that AI systems learn from more data than a human can process, it is challenging for technicians to discern their behavior.

“Hallucination is not inherently common in reasoning models, but we are actively striving to decrease the percentage of hallucinations observed in O3 and O4-MINI,” Gaby Raila commented. “We will continue our exploration of hallucinations across all models to enhance accuracy and reliability.”

Hannane Hajisiltzi, a professor at the University of Washington and a researcher at the Allen Institute of Artificial Intelligence, is part of a team that recently developed methods to monitor the behavior of these systems. Trained individual data allows for some tracking. Nevertheless, this tool cannot clarify everything because the systems learn from a vast dataset capable of generating almost any output. “We still do not fully understand how these models operate,” she remarked.

Tests by independent organizations and researchers reveal that inference models from companies including Google and Deepseek are also showing rising hallucination rates.

Since late 2023, Vectara, Awadallah’s company, has been monitoring how frequently chatbots deviate from the truth. They assign these systems simple, verifiable tasks, such as summarizing particular news articles, yet chatbots continually fabricate information.

Initial surveys by Vectara estimated that, in this context, chatbots presented incorrect information at least 3% of the time and sometimes as high as 27%.

Over the next eighteen months, companies like OpenAI and Google reduced these figures to a range of 1% to 2%. Startups in San Francisco, such as Humanity, floated around 4%. Nevertheless, hallucination rates for this assessment have been rising alongside the advancement of inference systems. Deepseek’s reasoning model, R1, hallucinated 14.3% of the time, while OpenAI’s O3 reached 6.8%.

(The New York Times has filed a lawsuit against OpenAI and its partner Microsoft, claiming copyright infringement over news content related to AI systems. Both OpenAI and Microsoft have denied these allegations.)

For years, companies like OpenAI operated under the simplistic assumption that feeding more internet data into AI systems would enhance performance. However, they eventually exhausted nearly all online English text and required alternative methods to improve their chatbots.

Consequently, these companies are increasingly adopting what scientists refer to as reinforcement learning. In this approach, the system learns through trial and error, proving effective in specific domains like mathematics and computer programming, but lacking in others.

“The training approach for these systems tends to focus on one task while neglecting others,” commented Laura Perez-Bertracini, a researcher at the University of Edinburgh, who is part of a team investigating hallucination issues in depth.

Another drawback is that inference models are crafted to spend time “thinking” through complex problems before reaching answers. Consequently, as they solve problems step by step, they risk hallucination at each stage. Errors can compound as they linger over them.

The latest bots transparently reveal each step to users, meaning users can witness each mistake made. Researchers often assert that the steps indicated by bots are unrelated to the final answer.

“The system’s perception of ‘thinking’ does not necessarily equate to actual cognitive processing,” remarked Aryo Pradipta Gema, an AI researcher and fellow at the University of Edinburgh.

Source: www.nytimes.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleIndia is Paving the Way for Solar Panel Production for Itself and the World.
Next Article Elon Musk, His 16-Foot Barrier, and the Ongoing Dispute with His Texas Neighbor

Related Posts

Is the Arctic Ocean Mitigating or Intensifying Global Warming –
Science

Is the Arctic Ocean Mitigating or Intensifying Global Warming? – Cyworthy

Exploring the Limitations of AI Safety Management Practices
Technology

Exploring the Limitations of AI Safety Management Practices

Unexpected Ways Hi Fi Technology is Built for the Future
Science

Unexpected Ways Hi-Fi Technology is Built for the Future

Monkeys Explore a Virtual World Solely Through Thought Control A
Science

Monkeys Explore a Virtual World Solely Through Thought Control: A Breakthrough in Mind-Driven Technology

Unlocking the ABC Conjecture A Pioneering Project to Solve Controversial
Science

Unlocking the ABC Conjecture: A Pioneering Project to Solve Controversial Mathematical Proofs with Computer Technology

Revolutionary Experiment Uncovers Major Unexpected Issues in Cloning Technology
Science

Revolutionary Experiment Uncovers Major Unexpected Issues in Cloning Technology

Unlocking Quantum Computing How an 1980s Niche Technology Could Revolutionize
Science

Unlocking Quantum Computing: How an 1980s Niche Technology Could Revolutionize the Future

Breakthrough Discovery Loophole Enables Quantum Cloning Technology
Science

Breakthrough Discovery: Loophole Enables Quantum Cloning Technology

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
Meta Removes Limitations On President Trump's Access To Facebook And Technology

Meta removes limitations on President Trump’s access to Facebook and Instagram accounts

The Top 10 Deadliest Spiders On Earth Science

The Top 10 Deadliest Spiders on Earth

Elon Musk Confirms Tesla Shareholders To Vote On $56 Billion Technology

Elon Musk Confirms Tesla Shareholders to Vote on $56 Billion Compensation Package

Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Most Popular
Mehta's Reversal Of The Decision To Remove Two Videos About
Technology

Mehta’s reversal of the decision to remove two videos about the Israel-Hamas war

The Us's Top 10 Most Dangerous Cities
Science

The US’s Top 10 Most Dangerous Cities

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.