GMW-Greek Misspelled Words

Published: 15 Apr 2021, Last Modified: 14 Jan 2026ZenodoEveryoneRevisionsCC BY-SA 4.0
Abstract: This dataset contains 574,883 distinct Greek words and for each one of them it contains various misspellings, in average 4.32 misspellings per word. In total, it contains more than 3 million forms of Greek words (3,063,143). The misspelled words have been produced algorithmically. The dataset can be used for evaluating methods for approximate matching, error correction, etc.
Loading