An Extended Evaluation of Single-Label Multi-modal Field of Research Classification Using a Taxonomy-Based Metric

Published: 01 Jan 2024, Last Modified: 13 Jun 2025ESWC Satellite Events (2) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper presents a multi-modal approach to the single-label field of research classification shared task. Our method, SLAMFORC, incorporates metadata, full text, and image data from scholarly articles to generate comprehensive document embeddings. We built a voting ensemble of pre-trained BERT models (SciBERT and SciNCL) and traditional classifiers and achieved competitive performance in the Field of Research Classification of Scholarly Publications shared task. SLAMFORC scored highest in F1 score and precision and second best in recall and accuracy. We extend our original analysis by examining misclassified samples to improve future iterations. Additionally, we apply a taxonomy-based evaluation metric to better assess our results.
Loading