Publications in collaboration with researchers from Massachusetts Institute of Technology (1)

2022

  1. The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

    Advances in Neural Information Processing Systems