LVBERT: Transformer-Based Model for Latvian Language Understanding

Published: 2020, Last Modified: 09 Jan 2026Baltic HLT 2020EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper presents LVBERT – the first publicly available monolingual language model pre-trained for Latvian. We show that LVBERT improves the state-of-the-art for three Latvian NLP tasks including Part-of-Speech tagging, Named Entity Recognition and Universal Dependency parsing. We release LVBERT to facilitate future research and downstream applications for Latvian NLP.
Loading