Evaluation of clustering and topic modeling methods over health-related tweets and emails

Published: 01 Jan 2021, Last Modified: 05 Feb 2025Artif. Intell. Medicine 2021EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Evaluation of topic modeling and clustering on health-related tweets and emails.•Topic modeling: LSI, LDA, BTM, GibbsLDA, Online LDA, Online Twitter LDA, and GSDMM.•Clustering: k -means with two feature representations (TF-IDF and Doc2Vec).•The evaluation is based on two internal and five external cluster validity indices.
Loading