Crash Lab

Research Project

Can AI read a chest X-ray as well as a radiologist?

PublishedRSNA 2025 — Cutting Edge Oral Presentation29 Sept 2025
radiologybenchmarkmultimodal-AIchest-xray

83%

Expert Radiologists

57%

Gemini 3.0 Pro

45%

Radiology Trainees

30%

GPT-5 Thinking

26pt

Human–AI Gap

Project Team

DS

Dr. Suvrankar Datta

2025-Present · JIPMER, AIIMS Delhi, Ashoka University

DH

Dr. Hakikat Bir Singh Bhatti

2025-Present · CRASH Lab, MBBS, IIT-Delhi

RadLE was built to answer a blunt question that most healthcare AI vendors avoid: how far are current frontier models from expert clinical performance when the cases are messy, ambiguous, and real?

The benchmark compares expert radiologists, trainees, and multimodal AI models on chest radiology tasks that require clinical reasoning rather than pattern matching alone.

Its headline result matters beyond rankings: the best AI crossed trainee-level performance, but still remained far behind experts, defining a clearer threshold for responsible deployment.

Related projects