Dimensions of Similarity: Towards Interpretable Dimension-Based Text Similarity

Published: 01 Jan 2023, Last Modified: 18 Oct 2024ECAI 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper paves the way for interpretable and configurable semantic similarity search, by training state-of-the-art models for identifying textual similarity guided by a set of aspects or dimensions. The similarity models are analyzed as to which interpretable dimensions of similarity they place the most emphasis on. We conceptually introduce configurable similarity search for finding documents similar in specific aspects but dissimilar in others. To evaluate the interpretability of these dimensions, we experiment with downstream retrieval tasks using weighted combinations of these dimensions. Configurable similarity search is an invaluable tool for exploring datasets and will certainly be helpful in many applied natural language processing research applications.
Loading