Inherently Interpretable Tree Ensemble Learning

Inherently Interpretable Tree Ensemble Learning

ICLR 2026 Conference Submission20828 Authors

19 Sept 2025 (modified: 08 Oct 2025)ICLR 2026 Conference SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Functional ANOVA, Model interpretability, Purification Algorithm, Tree Ensemble

TL;DR: This paper develops an equivalent representation for tree ensemble learning algorithms, which unwraps the black box and enables inherently interpretable models.

Abstract: Tree ensembles such as random forests and gradient boosting machines are among the most effective methods for tabular prediction, but their strong performance often comes at the cost of interpretability. We show that ensembles of shallow decision trees admit an equivalent functional ANOVA representation, making them inherently interpretable while retaining competitive accuracy. Building on this insight, we develop an exact algorithm that decomposes tree ensembles into main effects and interactions, yielding faithful explanations without approximation. We further introduce two strategies to enhance interpretability: (i) imposing constraints on depth, monotonicity, and interactions, and (ii) post-hoc pruning of trivial effects via sparse modeling and effect selection. Across synthetic and real-world datasets, our approach achieves a superior trade-off between interpretability and predictive power compared to established interpretable models such as Explainable Boosting Machines and GAMI-Net. These results position shallow tree ensembles as a practical and theoretically grounded alternative for transparent high-performance modeling of tabular data.

Primary Area: unsupervised, self-supervised, semi-supervised, and supervised representation learning

Submission Number: 20828

Loading