HITZ Hizkuntza Teknologiako Zentroa-Basque Centre for Language Technology
Institut
University of Washington
Seattle, Estados UnidosPublikationen in Zusammenarbeit mit Forschern von University of Washington (2)
2024
-
Data Contamination Report from the 2024 CONDA Shared Task
CONDA 2024 - 1st Data Contamination Workshop, Proceedings of the Workshop
2022
-
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Advances in Neural Information Processing Systems