OpenReview.net
  • Login
back arrowGo to CORR 2023 homepage

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual DatasetOpen Website

Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Sasko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli et al. (34 additional authors not shown)

2023 (modified: 27 Mar 2023)CoRR 2023Readers: Everyone
0 Replies

Loading
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Feedback
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Terms of Use
  • Privacy Policy
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • Join the Team
  • Frequently Asked Questions
  • Contact
  • Feedback
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview