Multi-class imbalanced big data classification on Spark

Published: 2021, Last Modified: 08 Mar 2025Knowl. Based Syst. 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•First complete framework for learning from multi-class imbalanced big data.•Informative multi-class sampling methods that use instance-level characteristics.•Novel oversampling modification dedicated to MapReduce environments.•Code and data repository for reproducibility and applications of proposed methods.
Loading