Interpretable Text Embeddings and Text Similarity Explanation: A Survey

Juri Opitz, Lucas Möller, Andrianos Michail, Sebastian Padó, Simon Clematide

Published: 2025/2/20

Abstract

Text embeddings are a fundamental component in many NLP tasks, including classification, regression, clustering, and semantic search. However, despite their ubiquitous application, challenges persist in interpreting embeddings and explaining similarities between them. In this work, we provide a structured overview of methods specializing in inherently interpretable text embeddings and text similarity explanation, an underexplored research area. We characterize the main ideas, approaches, and trade-offs. We compare means of evaluation, discuss overarching lessons learned and finally identify opportunities and open challenges for future research.

Interpretable Text Embeddings and Text Similarity Explanation: A Survey | SummarXiv | SummarXiv