Evaluating text classification: A benchmark study

Published: 01 Jan 2024, Last Modified: 15 Oct 2024Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Overall, Bidirectional LSTMs (BiLSTMs) are the best-performing method.•LR with TF-IDF shows statistically similar results to BiLSTM and RoBERTa.•For fake news and topic detection, simple techniques are preferred.•The sentiment analysis tasks prefer more complex methods.•Smallest datasets prefer less complex techniques.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview