Data and its (dis)contents: A survey of dataset development and use in machine learning researchOpen Website

2021 (modified: 11 Jan 2022)Patterns 2021Readers: Everyone
Abstract: Datasets have become a critical component in the advancement of machine learning research. The ways in which such datasets are collected, constructed, and shared play a significant role in shaping the quality and impact of this research. We conduct a survey of the literature on concerns relating to the design, collection, maintenance, and distribution of machine learning datasets, as well as broader disciplinary norms and cultures that pervade the field.
0 Replies

Loading