Identifying interdisciplinary topics and their evolution based on BERTopic

Published: 01 Jan 2024, Last Modified: 19 Feb 2025Scientometrics 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Interdisciplinary topic reflects the knowledge exchange and integration between different disciplines. Analyzing its evolutionary path is beneficial for interdisciplinary research in identifying potential cooperative research direction and promoting the cross-integration of different disciplines. However, current studies on the evolution of interdisciplinary topics mainly focus on identifying interdisciplinary topics at the macro level. More analysis of the evolution process of interdisciplinary topics at the micro level is still needed. This paper proposes a framework for interdisciplinary topic identification and evolutionary analysis based on BERTopic to bridge the gap. The framework consists of four steps: (1) Extract the topics from the dataset using the BERTopic model. (2) Filter out the invalid global topics and stage topics based on lexical distribution and further filter out the invalid stage topics based on topic correlation. (3) Identify interdisciplinary topics based on disciplinary diversity and disciplinary cohesion. (4) Analyze the interdisciplinary topic evolution by inspecting the intensity and content in the evolution, and visualize the evolution using Sankey diagrams. Finally, We conduct an empirical study on a dataset collected from the Web of Science (WoS) in Library & Information Science (LIS) to evaluate the validity of the framework. From the dataset, we have identified two distinct types of interdisciplinary topics in LIS. Our findings suggest that the growth points of LIS mainly exist in the interdisciplinary research topics. Additionally, our analysis reveals that more and more interdisciplinary knowledge needs to be integrated to solve more complex problems. Mature interdisciplinary topics mainly formed from the internal core knowledge in LIS stimulated by external disciplinary knowledge, while promising interdisciplinary topics are still at the stage of internalizing and absorbing the knowledge of other disciplines. The dataset, the code for implementing the algorithms, and the complete experiment results will be released on GitHub at: https://github.com/haihua0913/IITE-BERT.
Loading