DMLR: Data-centric Machine Learning Research - Past, Present and Future

Published: 01 Jan 2023, Last Modified: 20 May 2025CoRR 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods towards positive scientific, societal and business impact.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview