
Loading, please wait...

Loading, please wait...
"Wherever the art of Medicine is loved, there is also a love of Humanity."
— Hippocrates

A recent study published in the American Journal of Roentgenology highlights the potential of LLM-augmented diagnostic reasoning in thoracic imaging. Researchers evaluated how human-in-the-loop workflows, which utilize reader-generated text descriptions rather than direct image analysis, influence clinical decision-making. The results suggest that while radiologic expertise remains crucial, AI significantly supports less experienced clinicians in achieving higher diagnostic accuracy. Consequently, this reader-mediated approach may offer a more practical integration path for large language models in current clinical settings.
The study analyzed 93 complex thoracic cases involving CT, MRI, and PET/CT images. Ten readers, including five thoracic radiologists and five residents, participated in two distinct interpretation sessions. In the first session, readers selected diagnoses based solely on their expertise. However, in the second session, they utilized output from Gemini 3.0 Pro, which processed their free-text descriptions of the findings. Notably, the LLM achieved a diagnostic accuracy of 63.9% when using human descriptions, compared to only 52.7% when processing images directly. This indicates that human-provided context significantly enhances the AI's reasoning capabilities.
Furthermore, the expertise of the person providing the text description played a pivotal role. The LLM performed better when processing descriptions generated by thoracic radiologists (67.3%) than those from residents (60.4%). This disparity underscores the continuing importance of specialized radiologic skills in guiding AI systems. More importantly, the use of LLM-augmented diagnostic reasoning led to a significant improvement in the diagnostic performance of radiology residents, whose accuracy rose from 56.8% to 65.2%. In contrast, the accuracy of board-certified radiologists did not see a statistically significant change, suggesting that AI serves as a powerful "safety net" or educational tool for trainees.
Integrating AI into radiology workflows often faces technical and regulatory hurdles. However, this reader-mediated workflow provides a feasible alternative by keeping the radiologist at the center of the process. Since the LLM interprets the text rather than the pixel data, it bypasses many of the complexities associated with direct image-processing software. Moreover, the study demonstrates that even without seeing the images, the LLM can provide valuable explanatory rationales that assist residents in refining their final diagnoses. This suggests that AI could eventually serve as a real-time consultation partner during the reporting process.
This workflow significantly boosts the diagnostic accuracy of residents by providing a ranking of differential diagnoses and explanatory rationales based on their own observations. It helps bridge the gap between trainee experience and specialist-level interpretation.
Large language models currently excel more at text-based reasoning than direct image interpretation. By having a human describe the findings first, the AI receives structured, high-quality clinical data, which allows it to generate more accurate diagnostic rankings.
According to the study, the accuracy of experienced thoracic radiologists did not show a statistically significant increase with AI assistance. This implies that while AI is a valuable support tool for trainees, its impact on seasoned specialists may be more limited in standard diagnostic tasks.
Disclaimer: This content is for informational and educational purposes only and does not constitute medical advice or a professional recommendation. Healthcare providers should rely on their clinical judgment and refer to the latest local and national guidelines for clinical practice.
References
Song J et al. Human-in-the-Loop Large Language Model-Augmented Diagnostic Reasoning in Thoracic Imaging: Impact of Radiologic Expertise. AJR Am J Roentgenol. 2026 May 20. doi: 10.2214/AJR.26.34999. PMID: 42160120.
"
A study shows that LLM-augmented diagnostic reasoning using reader-generated text significantly boosts diagnostic accuracy, particularly for radiology resid...
2 weeks back

An overview of how protein tyrosine kinases (PTKs) regulate dendritic cells to bridge innate and adaptive immunity for effective anti-cancer surveillance....
Today

The 2026 ERAS guidelines for gynecologic oncology offer updated, evidence-based perioperative care protocols to optimize patient recovery and surgical outco...
Today

An umbrella review confirms that ultra-processed food consumption significantly raises risks for obesity, T2DM, CVD, and mental health disorders globally....
Today

A study identifies inadequate antibiotic prophylaxis and prolonged operative time as key risk factors for poor wound healing in pediatric bone cyst surgery....
Today

A clinical trial demonstrates that combining microwave ablation with lauromacrogol injection significantly improves volume reduction in thyroid cystic-solid...
Today