Permutation importance: a corrected feature importance measure

André Altmann; Laura Tolosi; Oliver Sander; Thomas Lengauer

Permutation importance: a corrected feature importance measure

André Altmann, Laura Tolosi, Oliver Sander, Thomas Lengauer

Published: 01 Jan 2010, Last Modified: 22 May 2024Bioinform. 2010EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: : In life sciences, interpretability of machine learning models is as important as their prediction accuracy. Linear models are probably the most frequently used methods for assessing feature relevance, despite their relative inflexibility. However, in the past years effective estimators of feature relevance have been derived for highly complex or non-parametric models such as support vector machines and RandomForest (RF) models. Recently, it has been observed that RF models are biased in such a way that categorical variables with a large number of categories are preferred.

Loading