Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
The Rise Of Dinosaurs Told Through Fossilized Feces
Science

The Rise of Dinosaurs Told through Fossilized Feces

Warning from thinktank: uk must ease ai regulations or face
Technology

Warning from ThinkTank: UK must ease AI regulations or face strain on transatlantic relations

The cretaceous period larvae possessed advanced eyes
Science

The Cretaceous period larvae possessed advanced eyes

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Unlocking the Longevity of Heliconius Butterflies The Surprising Role of

    Unlocking the Longevity of Heliconius Butterflies: The Surprising Role of Pollen

    June 23, 2026
    Study Finds That Competition Between Species Was A Significant Factor

    New Research Disproves Longstanding Belief That Human Ancestors Simply Became Bigger Over Time

    June 23, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    New Findings Reveal Interstellar Comet 3I/ATLAS Originated 12 Billion Years Ago

    June 23, 2026
    Unlocking Early Childhood How Our Brains Form Initial Thoughts at

    Understanding Early Brain Development: When Do Babies Start to Think?

    June 23, 2026
    Transformative Brain Changes What Happens from Your 20s to 40s

    Transformative Brain Changes: What Happens from Your 20s to 40s

    June 23, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » AI chatbots are incapable of diagnosing patients solely through conversation
Ai Chatbots Are Incapable Of Diagnosing Patients Solely Through Conversation
Science January 2, 2025

AI chatbots are incapable of diagnosing patients solely through conversation

Share
Facebook Twitter LinkedIn Pinterest Email

Don’t call your favorite AI “Doctor” yet

Just_Super/Getty Images

Advanced artificial intelligence models have scored highly in professional medical examinations, but they are still challenging one of the most important doctor tasks: talking to patients, gathering relevant medical information, and providing accurate diagnoses. I am still neglecting one thing.

“Large-scale language models perform well on multiple-choice tests, but their accuracy drops significantly on dynamic conversations,” he says. Pranav Rajpurkar at Harvard University. “Models especially struggle with open-ended diagnostic inference.”

This became clear when researchers developed a method to assess the reasoning ability of clinical AI models based on simulated doctor-patient conversations. “Patients” is based on 2000 medical cases drawn primarily from the United States Medical Board Specialty Examinations.

“Simulating patient interactions allows assessment of history-taking skills, which is an important element of clinical practice that cannot be assessed through case descriptions,” he says. shreya jolialso at Harvard University. The new assessment benchmark, called CRAFT-MD, “reflects real-world scenarios where patients may not know what details are important to share and may only disclose important information if prompted by specific questions. “I do,” she says.

The CRAFT-MD benchmark itself relies on AI. OpenAI's GPT-4 model acted as a “patient AI” that conversed with the “clinical AI” being tested. GPT-4 also helped score the results by comparing the clinical AI's diagnosis with the correct answer for each case. Human medical experts reconfirmed these assessments. We also reviewed the conversations to confirm the accuracy of the patient AI and whether the clinical AI was able to gather relevant medical information.

Multiple experiments have shown that the performance of four major large-scale language models (OpenAI's GPT-3.5 and GPT-4 models, Meta's Llama-2-7b model, and Mistral AI's Mistral-v2-7b model) is performance on benchmarks was shown to be significantly lower than at the time. Makes a diagnosis based on a written summary of the case. OpenAI, Meta, and Mistral AI did not respond to requests for comment.

For example, GPT-4's diagnostic accuracy was an impressive 82 percent when a structured case summary was presented and the diagnosis could be selected from a list of multiple-choice answers, but not when a multiple-choice option was provided. However, when it had to make a diagnosis from a simulated patient conversation, its accuracy dropped to just 26%.

And GPT-4 performs best among the AI ​​models tested in this study, with GPT-3.5 often coming in second place, and Mistral AI models sometimes coming in second or third place. Meta's Llama models generally had the lowest scores.

AI models also failed to collect complete medical histories a significant proportion of the time, with the leading model, GPT-4, only able to do so in 71% of simulated patient conversations. Even if an AI model collects a patient's relevant medical history, it doesn't necessarily yield the correct diagnosis.

It says such simulated patient conversations are a “much more useful” way to assess an AI's clinical reasoning ability than medical tests. Eric Topol At the Scripps Research Institute Translational Institute in California.

Even if an AI model ultimately passes this benchmark and consistently makes accurate diagnoses based on conversations with simulated patients, it won't necessarily be better than a human doctor. says Rajpurkar. He points out that real-world medical procedures are “more troublesome” than simulations. That includes managing multiple patients, coordinating with medical teams, performing physical exams, and understanding the “complex social and systemic factors” in the local health care setting.

“While the strong performance in the benchmarks suggests that AI may be a powerful tool to support clinical practice, it does not necessarily replace the holistic judgment of experienced physicians.” says Rajpurkar.

topic:

Source: www.newscientist.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleElon Musk Urges Labor MP to Address Tommy Robinson’s Anger
Next Article A Study on the Unique Variety of Camellia sinensis Found in the Tea Plant of Hainan Island

Related Posts

Unlocking the Longevity of Heliconius Butterflies The Surprising Role of
Science

Unlocking the Longevity of Heliconius Butterflies: The Surprising Role of Pollen

Study Finds That Competition Between Species Was A Significant Factor
Science

New Research Disproves Longstanding Belief That Human Ancestors Simply Became Bigger Over Time

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

New Findings Reveal Interstellar Comet 3I/ATLAS Originated 12 Billion Years Ago

Unlocking Early Childhood How Our Brains Form Initial Thoughts at
Science

Understanding Early Brain Development: When Do Babies Start to Think?

Transformative Brain Changes What Happens from Your 20s to 40s
Science

Transformative Brain Changes: What Happens from Your 20s to 40s

Alzheimers Patient Experiences Remarkable Speech Recovery with Psilocybin Treatment
Science

Alzheimer’s Patient Experiences Remarkable Speech Recovery with Psilocybin Treatment

Fusive Neurosurgery How Paralyzed Pigs Are Walking Again – Could
Science

Fusive Neurosurgery: How Paralyzed Pigs Are Walking Again – Could Humans Be Next?

Cutting Edge Natural Technology for CO2 Removal Potential Risks and Backfire
Science

Cutting-Edge Natural Technology for CO2 Removal: Potential Risks and Backfire Effects

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A highbrow is a man who has found something more interesting than women.

Edgar Wallace
Exchange Rate

Exchange Rate EUR: Tue, 23 Jun.

Top Insights
Stealth Radio Conceals Signals in Ambient Noise to Safeguard Drone Science

Stealth Radio Conceals Signals in Ambient Noise to Safeguard Drone Operators

Could It Be A Severed Leg? No, It's Actually A Science

Could it be a severed leg? No, it’s actually a sea slug

The Mistakes Of Zuckerberg And Musk In Understanding The Digital Science

The Mistakes of Zuckerberg and Musk in Understanding the Digital Economy

Categories
  • Blockchain (65)
  • Science (7,893)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Transform Your Filmmaking How New AI Tools Are Revolutionizing the

Transform Your Filmmaking: How New AI Tools Are Revolutionizing the Industry

July 20, 2025
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,893)
  • Technology (2,968)
Most Popular
'avatar' And 'jurassic Park' Animatronics Company Collaborates With Boston Dynamics
Technology

‘Avatar’ and ‘Jurassic Park’ animatronics company collaborates with Boston Dynamics

NHS Talking Therapy Appears to Be Ineffective for Young Adults
Science

NHS Talking Therapy Appears to Be Ineffective for Young Adults

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.