Evaluating text classification: A benchmark study

Published: 01 Jan 2024, Last Modified: 20 May 2025Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Overall, Bidirectional LSTMs (BiLSTMs) are the best-performing method.•LR with TF-IDF shows statistically similar results to BiLSTM and RoBERTa.•For fake news and topic detection, simple techniques are preferred.•The sentiment analysis tasks prefer more complex methods.•Smallest datasets prefer less complex techniques.
Loading