
Lucas Bandarkar
@lucasbandarkar
PhD student @uclaNLP — ML / #NLProc / multilingual
@AIatMeta
ID: 2220885312
http://lucasbandarkar.com 29-11-2013 05:33:37
65 Tweet
214 Followers
273 Following



This dataset subsamples MMLU to limit questions that are too Western-centric and they then tanslate to 42 languages. Wow Cohere For AI with two big multilingual benchmarks released this week. Great to know I will no longer have to rely on machine-translated MMLU

📣Happy to (pre-)release my Fleurs-SLU benchmark to evaluate massively multilingual spoken language understanding on SIB & Belebele. Work done at Mila - Institut québécois d'IA with David Ifeoluwa Adelani 🇳🇬 Goran Glavaš Ivan Vulić Datasets: huggingface.co/datasets/WueNL… huggingface.co/datasets/WueNL… Details to follow👇

Paper #3: Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models What can we do in model merging when we want to transfer task performance from one language to another? Lucas Bandarkar got y'all covered! Link: arxiv.org/abs/2410.01335











this paper reveals a whole number of heuristic-style errors in dense retrievers (i.e. for RAG) accepted at ACL, congrats Mohsen Fayyaz