SlovakBERT: Slovak Masked Language Model

Published: 01 Jan 2022, Last Modified: 19 May 2025EMNLP (Findings) 2022EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We introduce a new Slovak masked language model called SlovakBERT. This is to our best knowledge the first paper discussing Slovak transformers-based language models. We evaluate our model on several NLP tasks and achieve state-of-the-art results. This evaluation is likewise the first attempt to establish a benchmark for Slovak language models. We publish the masked language model, as well as the fine-tuned models for part-of-speech tagging, sentiment analysis and semantic textual similarity.
Loading