CRASH Lab · Ashoka University

Research & Publications

Active research projects, benchmarks, and accepted papers — with evaluation grounded in clinical reality, not presentation-friendly metrics alone.

Radiology's Last Exam (RadLE): Benchmarking Frontier Multimodal AI Against Human Experts and a Taxonomy of Visual Reasoning Errors in Radiology

The flagship RadLE benchmark comparing frontier multimodal AI systems with human radiology expertise and defining a taxonomy of visual reasoning errors in radiology.

Authors: Suvrankar Datta, Divya Buchireddygari, Lakshmi Vennela Chowdary Kaza, Mrudula Bhalke, Kautik Singh, Ayush Pandey, Sonit Sai Vasipalli, Upasana Karnwal, +19 more

Open paperPreprint2025Benchmark

CheXthought: A Global Multimodal Dataset of Clinical Chain-of-Thought Reasoning and Visual Attention for Chest X-ray Interpretation

A global multimodal dataset of clinical reasoning traces and visual attention annotations for chest X-ray interpretation, developed by the Global Radiology Consortium with CRASH Lab contributing to the Indian consortium.

Authors: Sonali Sharma, Jin Long, George Shih, Sarah Eid, Christian Bluethgen, Francine L. Jacobson, Emily B. Tsai, Global Radiology Consortium, +2 more

Open paperarXiv2026Paper

Learning to Write Like a Radiologist: Multidimensional Evaluation and Benchmarking of Autonomous Optimization Pipelines for Hyper-Personalized Head CT Report Generation

Multidimensional evaluation framework for autonomous optimization pipelines in personalized head CT report generation.

RSNA 20252025Abstract

Stress-Test and Radiologist Blinded Validation of Multimodal Foundation Models on an Unseen Chest Radiograph Dataset Using a Novel Multi-Metric Evaluation Framework

Comprehensive evaluation framework for multimodal foundation models in chest radiograph analysis with radiologist-blinded validation.

RSNA 20252025Abstract

Style-Aware Radiology Reporting: A Scalable Autonomous Optimisation Pipeline for Improving Head CT Report Generation Quality

Scalable autonomous optimization pipeline focused on style-aware improvements in head CT report generation.

RSNA 20252025Abstract

Towards Hyper-Personalised Radiology Reporting: A Scalable Autonomous Optimisation Pipeline for Improving Chest X-Ray Report Generation Quality

Autonomous optimization pipeline for hyper-personalized chest X-ray report generation with quality improvements.

RSNA 20252025Abstract

TRUST: A Novel Five-Point Scale for Assessment of Reliability and Referencing Integrity in AI Agent Generated Radiology Reports

Novel assessment scale for evaluating reliability and referencing integrity in AI-generated radiology reports.

RSNA 20252025Abstract

Validation of RADAR and TRUST Metrics: Analyzing Inter-Reader Agreement and Draft Variability in Agentic Radiology Reporting

Analysis of inter-reader agreement and draft variability using RADAR and TRUST metrics in agentic radiology reporting.

RSNA 20252025Abstract

2026 And Beyond
National Research Collaborations

We began securing support from leading national bodies, including ANRF, DST, and the IndiaAI Mission, to advance responsible healthcare AI research.

Supported by Koita Foundation

Funding for the lab is provided by the Koita Foundation and Google, alongside contributions from institutional partners and private philanthropy

Click here to learn about CRASH Lab's Journey

Lead PI

Dr. Suvrankar Datta

JIPMERAIIMS DelhiAshoka University

Meet the team behind CRASH Lab.

Research & Publications

Radiology's Last Exam (RadLE): Benchmarking Frontier Multimodal AI Against Human Experts and a Taxonomy of Visual Reasoning Errors in Radiology

CheXthought: A Global Multimodal Dataset of Clinical Chain-of-Thought Reasoning and Visual Attention for Chest X-ray Interpretation

Learning to Write Like a Radiologist: Multidimensional Evaluation and Benchmarking of Autonomous Optimization Pipelines for Hyper-Personalized Head CT Report Generation

Stress-Test and Radiologist Blinded Validation of Multimodal Foundation Models on an Unseen Chest Radiograph Dataset Using a Novel Multi-Metric Evaluation Framework

Style-Aware Radiology Reporting: A Scalable Autonomous Optimisation Pipeline for Improving Head CT Report Generation Quality

Towards Hyper-Personalised Radiology Reporting: A Scalable Autonomous Optimisation Pipeline for Improving Chest X-Ray Report Generation Quality

TRUST: A Novel Five-Point Scale for Assessment of Reliability and Referencing Integrity in AI Agent Generated Radiology Reports

Validation of RADAR and TRUST Metrics: Analyzing Inter-Reader Agreement and Draft Variability in Agentic Radiology Reporting

2026 And BeyondNational Research Collaborations

Dr. Suvrankar Datta

2026 And Beyond
National Research Collaborations