A Weighted Topic Model Learned From Local Semantic Space for Automatic Image Annotation

Published: 20 Apr 2020, Last Modified: 05 Mar 2025IEEE AccessEveryoneCC BY 4.0
Abstract: Automatic image annotation plays a signi cant role in image understanding, retrieval, classi cation, and indexing. Today, it is becoming increasingly important in order to annotate large-scale social media images from content-sharing websites and social networks. These social images are usually annotated by user-provided low-quality tags. The topic model is considered as a promising method to describe these weak-labeling images by learning latent representations of training samples. The recent annotation methods based on topic models have two shortcomings. First, they are dif cult to scale to a large-scale image dataset. Second, they can not be used to online image repository because of continuous addition of new images and new tags. In this paper, we propose a novel annotation method based on topic model, namely local learning-based probabilistic latent semantic analysis (LL-PLSA), to solve the above problems. The key idea is to train a weighted topic model for a given test image on its semantic neighborhood consisting of a xed number of semantically and visually similar images. This method can scale to a large-scale image database, as training samplesinvolvedinmodelingareafewnearestneighborsratherthantheentiredatabase. Moreover, this proposed topic model, online customized for the test image, naturally addresses the issue of continuous addition of new images and new tags in a database. Extensive experiments on three benchmark datasets demonstrate that the proposed method signi cantly outperforms the state-of-the-art especially in terms of overall metrics.
Loading