SWEb: A Large Web Dataset for the Scandinavian Languages

Tobias Norlund, Tim Isbister, Amaru Cuba Gyllensten, Paul Gabriel dos Santos, Danila Petrelli, Ariel Ekgren, Magnus Sahlgren

Published: 2025, Last Modified: 28 May 2026ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading