Multidimensional evaluation framework for autonomous optimization pipelines in personalized head CT and chest radiograph report generation.
RSNA 2025
View research →Our lab has secured $20,000 from Google and $15,000 in support from OpenAI to accelerate our work
Read AnnouncementResearch Project
83%
Expert Radiologists
57%
Gemini 3.0 Pro
45%
Radiology Trainees
30%
GPT-5 Thinking
26pt
Human–AI Gap
Project Team
Dr. Suvrankar Datta
2025-Present · JIPMER, AIIMS Delhi, Ashoka University
Dr. Hakikat Bir Singh Bhatti
2025-Present · CRASH Lab, MBBS, IIT-Delhi
RadLE was built to answer a blunt question that most healthcare AI vendors avoid: how far are current frontier models from expert clinical performance when the cases are messy, ambiguous, and real?
The benchmark compares expert radiologists, trainees, and multimodal AI models on chest radiology tasks that require clinical reasoning rather than pattern matching alone.
Its headline result matters beyond rankings: the best AI crossed trainee-level performance, but still remained far behind experts, defining a clearer threshold for responsible deployment.
Multidimensional evaluation framework for autonomous optimization pipelines in personalized head CT and chest radiograph report generation.
RSNA 2025
View research →Systematic evaluation of how frontier AI models perform on clinical data from Indian healthcare settings versus the Western datasets they were trained on.
Co-creating intuitive AI tools with frontline clinicians. Hyper-personalized documentation systems and trusted AI-driven assistants.