A Cross-media Model for Automatic Image Annotation

Lamberto Ballan, Tiberio Uricchio, Lorenzo Seidenari, Alberto Del Bimbo

2014 (modified: 26 Jan 2022)ICMR 2014Readers: Everyone

Abstract: Automatic image annotation is still an important open problem in multimedia and computer vision. The success of media sharing websites has led to the availability of large collections of images tagged with human-provided labels. Many approaches previously proposed in the literature do not accurately capture the intricate dependencies between image content and annotations. We propose a learning procedure based on Kernel Canonical Correlation Analysis which finds a mapping between visual and textual words by projecting them into a latent meaning space. The learned mapping is then used to annotate new images using advanced nearest-neighbor voting methods. We evaluate our approach on three popular datasets, and show clear improvements over several approaches relying on more standard representations.

0 Replies