Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation

Published: 17 Sept 2025, Last Modified: 06 Nov 2025ACS 2025 OralEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Word sense disambiguation, natural language understanding, symbolic representation, language models
Abstract: Word sense disambiguation is a fundamental challenge in natural language understanding. Current methods are primarily aimed at coarse-grained representations (e.g. WordNet synsets or FrameNet frames) and require hand-annotated training data to construct. This makes it difficult to automatically disambiguate richer representations (e.g. built on OpenCyc) that are needed for sophisticated inference. We propose a method that uses statistical language models as oracles for disambiguation that does not require any hand-annotation of training data. Instead, the multiple candidate meanings generated by a symbolic NLU system are converted into distinguishable natural language alternatives, which are used to query an LLM to select appropriate interpretations given the linguistic context. The selected meanings are propagated back to the symbolic NLU system. We evaluate our method against human-annotated gold answers to demonstrate its effectiveness.
Paper Track: Technical paper
Submission Number: 24
Loading