Toggle navigation
OpenReview
.net
Login
×
Go to
CORR 2023
homepage
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Hugo Laurençon
,
Lucile Saulnier
,
Thomas Wang
,
Christopher Akiki
,
Albert Villanova del Moral
,
Teven Le Scao
,
Leandro von Werra
,
Chenghao Mou
,
Eduardo González Ponferrada
,
Huu Nguyen
,
Jörg Frohberg
,
Mario Sasko
,
Quentin Lhoest
,
Angelina McMillan-Major
,
Gérard Dupont
,
Stella Biderman
,
Anna Rogers
,
Loubna Ben Allal
,
Francesco De Toni
,
Giada Pistilli
et al. (34 additional authors not shown)
2023 (modified: 27 Mar 2023)
CoRR 2023
Readers:
Everyone
0 Replies
Loading