The effect of OCR errors on stylistic text classificationOpen Website

2006 (modified: 12 Nov 2022)SIGIR 2006Readers: Everyone
Abstract: Recently, interest is growing in non-topical text classification tasks such as genre classification, sentiment analysis, and authorship profiling. We study to what extent OCR errors affect stylistic text classification from scanned documents. We find that even a relatively high level of errors in the OCRed documents does not substantially affect stylistic classification accuracy.
0 Replies

Loading