On the limits of cross-domain generalization in automated X-ray prediction

Joseph Paul Cohen; Mohammad Hashir; Rupert Brooks; Hadrien Bertrand

On the limits of cross-domain generalization in automated X-ray prediction

Joseph Paul Cohen, Mohammad Hashir, Rupert Brooks, Hadrien Bertrand

Published: 18 Apr 2020, Last Modified: 14 Jul 2025MIDL 2020Readers: Everyone

Keywords: Chest X-ray, Radiology, Deep Learning, Generalization

TL;DR: This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well and which do not.

Track: full conference paper

Abstract: This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well across multiple different datasets. We present evidence that the issue of generalization is not due to a shift in the images but instead a shift in the labels. We study the cross-domain performance, agreement between models, and model representations. We find interesting discrepancies between performance and agreement where models which both achieve good performance disagree in their predictions as well as models which agree yet achieve poor performance. We also test for concept similarity by regularizing a network to group tasks across multiple datasets together and observe variation across the tasks. All code is made available online and data is publicly available: https://github.com/mlmed/torchxrayvision

Paper Type: methodological development

Source Latex: zip

Presentation Upload: zip

Presentation Upload Agreement: I agree that my presentation material (videos and slides) will be made public.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 9 code implementations](https://www.catalyzex.com/paper/on-the-limits-of-cross-domain-generalization/code)

13 Replies

Loading