Automatic Topic Label Generation using Conversational Models

Virginia Ramón-Ferrer, Carlos Badenes-Olmedo, Óscar Corcho

Published: 2023, Last Modified: 05 Jan 2026K-CAP 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: In probabilistic topic models, a topic is characterised by a set of words, with a probability associated to each of them. Even though it is not necessary to understand the meaning of topics to perform common downstream tasks where topic models are used, such as topic inference or document similarity, there have been attempts to uncover the semantics of topics by providing labels to them, consisting in a couple of concepts. In this paper we propose a methodology, Conversational Probabilistic Topic Labelling (CPTL), to study whether conversational models can be used to generate labels that describe probabilistic topics given their most representative keywords. We evaluate and compare the performance of a selection of conversational models for the topic label generation task with the performance of a task-specific language model trained to generate topic labels.