

After asking the LLM to assess 100 random case studies, a trio of doctors from Cohen Children’s Medical Center in New York discovered ChatGPT pediatric diagnostic skills to be very poor. Joseph Barile, Alex Margolis, and Grace Cason tested ChatGPT’s diagnostic abilities in their study, which was published in the journal JAMA Pediatrics.
The researchers emphasize that pediatric diagnosis is particularly difficult since, in addition to taking into account all of the symptoms identified in a specific child, age must also be considered. They observed in this new effort that LLMs have been advocated as a viable new diagnostic tool by some in the medical profession. The researchers compiled 100 random pediatric case studies and requested ChatGPT to diagnose them to determine their efficacy.
To keep things simple, the researchers queried the LLM using a same approach for all of the case studies. They pasted in the case study material first, followed by the instruction “List a differential diagnosis and a final diagnosis.”
A differential diagnosis is a method of generating a preliminary diagnosis (or several of them) based on a patient’s history and physical examination. As the name implies, the final diagnosis is the suspected source of the symptoms. Answers provided by the LLM were rated by two non-study colleagues—there were three possible scores: “correct,” “incorrect,” and “did not fully capture diagnosis.”
The researchers discovered that ChatGPT gave the right scores just 17 times, 11 of which were clinically related to the correct diagnosis but were nonetheless incorrect.
The researchers acknowledge that ChatGPT is not yet suitable for use as a diagnostic tool, but they also indicate that more targeted training could improve results. They go on to say that, in the meanwhile, LLMs like ChatGPT could be beneficial as an administrative tool, or to help write research publications or generate instruction sheets for patients to use in aftercare applications.
For more information: Joseph Barile et al, Diagnostic Accuracy of a Large Language Model in Pediatric Case Studies, JAMA Pediatrics (2024). DOI: 10.1001/jamapediatrics.2023.5750
more recommended stories
CDI Test Shows Early Detection Potential in Alzheimer’s
Breakthrough Study Highlights Brain Blood Flow.
Prenatal Chlorpyrifos Exposure and Pediatric Brain Changes
A Growing Concern in Pediatric Neurodevelopment.
Valve Deterioration After TAVI: New Research Insights
The Growing Role of TAVI in.
FAST Walking Shows Promise in Post-Stroke Rehab
Stroke remains one of the leading.
Legionnaires’ Disease Long-Term Effects on Patient Recovery
Persistent Symptoms and Post-Acute Infection Syndromes.
Brisk Walking: A 15-Minute Daily Habit That Reduces Mortality in Underserved Populations
A powerful yet simple intervention A.
Vaping and Teen Smoking: UK Study Warns of Rising Risks
A new study published in Tobacco.
Targeting Cancer Cell Memory Boosts Chemotherapy
Researchers double chemotherapy effectiveness through chromatin.
PFAS Exposure Linked to Increased Type 2 Diabetes Risk
Investigating the Metabolic Impact of Forever.
Gecko-Inspired Nanoparticles Offer New Direction in Localized Cancer Therapy
Nature-Inspired Nanoparticles Deliver Precision Chemotherapy Researchers.
Leave a Comment