Visually Grounded Interpretation of Noun-Noun Compounds in English

Anonymous

Visually Grounded Interpretation of Noun-Noun Compounds in English

Anonymous

Published: 29 Mar 2022, Last Modified: 05 May 2023CMCL 2022Readers: Everyone

Keywords: visuo-linguistic representations, grounding, noun compound interpretation, semantics, vision and language interface

TL;DR: We combine visual and linguistic representations for noun-noun compound interpretation.

Abstract: Noun-noun compounds (NNCs) occur frequently in the English language. Accurate NNC interpretation, i.e. determining the implicit relationship between the constituents of a NNC, is crucial for the advancement of many natural language processing tasks. Until now, computational NNC interpretation has been limited to approaches involving linguistic representations only. However, much research suggests that grounding linguistic representations in vision or other modalities can increase performance on this and other tasks. Our work is a novel comparison of linguistic and visuo-linguistic representations for the task of NNC interpretation. We frame NNC interpretation as a relation classification task, evaluating on a large, relationally-annotated NNC dataset. We combine distributional word vectors with image vectors to investigate how visual information can help improve NNC interpretation systems. We find that adding visual vectors increases classification performance on our dataset in many cases.

4 Replies

Loading