Comparative study of classifier performance using automatic feature construction by M3GP

João E. Batista; Sara Silva

Comparative study of classifier performance using automatic feature construction by M3GP

João E. Batista, Sara Silva

Published: 01 Jan 2022, Last Modified: 15 May 2025CEC 2022EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: The M3GP algorithm, originally designed to per-form multiclass classification with genetic programming, is also a powerful feature construction method. Here we explore its ability to evolve hyper-features that are tailored not only to the problem to be solved, but also to the learning algorithm that is used to solve it. We pair M3GP with six different machine learning algorithms and study its performance in eight classification problems from different scientific domains, with substantial variety in the number of classes, features and samples. The results show that automatic feature construction with M3GP, when compared to using the standalone classifiers without feature construction, achieves statistically significant improvements in the majority of the test cases, sometimes by a very large margin, while degrading the weighted f-measure in only one out of 48 cases. We observe the differences in the number and size of the hyper-features evolved for each case, hypothesising that the simpler the classifier, the larger the amount of problem complexity is being captured in the hyper-features. Our results also reveal that the M3GP algorithm can be improved, both in execution time and in model quality, by replacing its default classifier with support vector machines or random forest classifiers.

Loading