Neel Bhandari
@neelbhandari9
Masters Student @LTIatCMU | ML Scientist @PayPal | Open Research @CohereForAI Community | Previously External Research Student @MITIBMLab. Views my own.
ID: 775256214384971776
https://neelbhandari6.github.io 12-09-2016 08:54:19
599 Tweet
286 Takipçi
816 Takip Edilen
🚨New preprint 🚨 I’m super excited to share our work: To Code, or Not To Code? Exploring the Impact of Code in Pre-training 📜: arxiv.org/abs/2408.10914 w/ Yixuan Su, raymond ma, Adrien Morisot, Ivan Zhang, Acyr Locatelli, Marzieh Fadaee, Ahmet Üstün, Sara Hooker [1/n]
These days RAG systems have gotten popular for boosting LLMs—but they're brittle💔. Minor shifts in phrasing (✍️ style, politeness, typos) can wreck the pipeline. Even advanced components don’t fix the issue. Check out this extensive eval by Neel Bhandari and Tianyu (Tiya) Cao!
Real user queries often look different from the clean, concise ones in academic benchmarks - ambiguity, full of typos, and much less readable. We show that even strong RAG systems quickly break under these conditions. Awesome project led by Neel Bhandari and Tianyu (Tiya) Cao!!
Very proud of this work which is being presented ICLR 2026 later today. While I will not be there — Catch up with Viraat Aryabumi and Ahmet Üstün who are both fantastic and can share more about our work at both Cohere Labs and cohere. 🔥✨