BEAMRAD: A tool for Creating and Assessing Medical Dataset Documentation

10 Apr 2025 (modified: 12 Apr 2025)MIDL 2025 Short Papers SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Medical datasets, Dataset documentation, Bias
TL;DR: To address bias stemming from dataset documentation practices, we introduce BEAMRAD, a tool to create and assess medical dataset documentation.
Abstract: Medical datasets drive deep learning in medical imaging but may introduce biases that impact model performance and clinical applicability. To address these bias challenges, we introduce BEAMRAD, a dynamic tool to create and assess medical dataset documentation. BEAMRAD systematically evaluates documentation, and links insufficient reporting to potential biases. Through an exemplary assessment of publicly available medical datasets, we highlight gaps in dataset documentation, including inconsistencies in data annotation, error quantification, and dataset limitations reporting. We propose to address these issues with three key improvements: stricter repository oversight, reflective documentation practices, and adaptable documentation.
Submission Number: 29
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview