Contextual Modeling for Document-level ASR Error Correction

Published: 2024, Last Modified: 10 Jan 2026LREC/COLING 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Contextual information, including the sentences in the same document and in other documents of the dataset, plays a crucial role in improving the accuracy of document-level ASR Error Correction (AEC), while most previous works ignore this. In this paper, we propose a context-aware method that utilizes a k-Nearest Neighbors (kNN) approach to enhance the AEC model by retrieving a datastore containing contextual information. We conduct experiments on two English and two Chinese datasets, and the results demonstrate that our proposed model can effectively utilize contextual information to improve document-level AEC. Furthermore, the context information from the whole dataset provides even better results.
Loading