Abstract: Highlights•First complete framework for learning from multi-class imbalanced big data.•Informative multi-class sampling methods that use instance-level characteristics.•Novel oversampling modification dedicated to MapReduce environments.•Code and data repository for reproducibility and applications of proposed methods.
Loading