Marta Villegas (@martavillegasm) 's Twitter Profile
Marta Villegas

@martavillegasm

NLP, language resources, @BSC_CNS

ID: 1001705731

linkhttps://es.linkedin.com/in/martavillegasmontserrat calendar_today10-12-2012 13:45:06

729 Tweet

662 Takipçi

369 Takip Edilen

Marta Villegas (@martavillegasm) 's Twitter Profile Photo

Community OSCAR, a collective effort to address the gap between English and non-English data availability. The dataset covers over 150 languages with 45 billion documents, totaling over 345 TiB of data. Proud to be part of it.  huggingface.co/datasets/oscar…