DescribeML: A dataset description tool for machine learning

Published: 01 Jan 2024, Last Modified: 18 May 2025Sci. Comput. Program. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A tool for describing dataset for machine learning.•Describe the composition, provenance, and social concerns of the data used to train ML models.•Provide a set of language features and IDE extension to facilitate the dataset description process.•Developed as an VSCode extension and published in the VSCode Market.
Loading