QuantRad: Advancing Quantitative Reliability in Radiology Report Generation with Cascaded Decoders

Ying Jin; Noel C Codella; Yanbo Xu; Haoquan Fang; Yu Gu; Mu Wei; Jenq-Neng Hwang

QuantRad: Advancing Quantitative Reliability in Radiology Report Generation with Cascaded Decoders

Ying Jin, Noel C Codella, Yanbo Xu, Haoquan Fang, Yu Gu, Mu Wei, Jenq-Neng Hwang

27 Sept 2024 (modified: 31 Oct 2024)ICLR 2025 Conference Withdrawn SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Radiology Report Generation, Image Captioning, Medical Imaging

Abstract: Radiology report generation using artificial intelligence has shown promise in enhancing clinical workflows. However, due to limitations of language modeling loss, existing approaches struggle with quantitative accuracy (e.g., measuring the size of nodules), and lack the ability to produce confidence scores for medical findings, which is crucial for quantitative metrics required by regulatory approval. This paper introduces QuantRad, a novel approach utilizing cascaded decoders to address these challenges in radiology report generation. QuantRad pairs a vision encoder with three decoders that operate sequentially: the first conducts sentence-level topic planning by generating a series of questions, the second recognizes abnormal targets and their quantitative and categorical attributes, and the third generates the final report by answering each question based on the recognized targets. With the dedicated target recognition step, our method integrates the quantitative strength of a perception model to text generation. Specifically, QuantRad recognizes abnormal targets without being biased by language priors, and produces probability scores along with each finding, allowing adjustments of sensitivity for clinical adoption and producing ROC curves for regulatory compliance. Besides, the disentangled topic planning captures the uncertainties in the omission of medical findings and their presentation order, allowing the report generation decoder to be trained with less ambiguity. Our method advances the accuracy and reliability of radiology report generation, offering a promising path for clinical applications and regulatory validation.

Primary Area: generative models

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.

Reciprocal Reviewing: I understand the reciprocal reviewing requirement as described on https://iclr.cc/Conferences/2025/CallForPapers. If none of the authors are registered as a reviewer, it may result in a desk rejection at the discretion of the program chairs. To request an exception, please complete this form at https://forms.gle/Huojr6VjkFxiQsUp6.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 11978

Loading