Neel Bhandari (@neelbhandari9) 's Twitter Profile
Neel Bhandari

@neelbhandari9

Masters Student @LTIatCMU | ML Scientist @PayPal | Open Research @CohereForAI Community | Previously External Research Student @MITIBMLab. Views my own.

ID: 775256214384971776

linkhttps://neelbhandari6.github.io calendar_today12-09-2016 08:54:19

599 Tweet

286 Followers

816 Following

Akhila Yerukola (@akhila_yerukola) 's Twitter Profile Photo

These days RAG systems have gotten popular for boosting LLMs—but they're brittle💔. Minor shifts in phrasing (✍️ style, politeness, typos) can wreck the pipeline. Even advanced components don’t fix the issue. Check out this extensive eval by Neel Bhandari and Tianyu (Tiya) Cao!

Akari Asai (@akariasai) 's Twitter Profile Photo

Real user queries often look different from the clean, concise ones in academic benchmarks - ambiguity, full of typos, and much less readable. We show that even strong RAG systems quickly break under these conditions. Awesome project led by Neel Bhandari and Tianyu (Tiya) Cao!!

Devansh Jain (@devanshrjain) 's Twitter Profile Photo

Excited to share PolyGuard 🛡️, our new state-of-the-art multilingual safety detector. PolyGuard supports 17 languages and outperforms all open-source and commercial moderation tools!

Sara Hooker (@sarahookr) 's Twitter Profile Photo

Very proud of this work which is being presented ICLR 2026 later today. While I will not be there — Catch up with Viraat Aryabumi and Ahmet Üstün who are both fantastic and can share more about our work at both Cohere Labs and cohere. 🔥✨

Very proud of this work which is being presented <a href="/iclr_conf/">ICLR 2026</a> later today. While I will not be there — Catch up with <a href="/viraataryabumi/">Viraat Aryabumi</a> and <a href="/ahmetustun89/">Ahmet Üstün</a> who are both fantastic and can share more about our work at both <a href="/Cohere_Labs/">Cohere Labs</a> and <a href="/cohere/">cohere</a>. 🔥✨
Kshitish Ghate (@ghatekshitish) 's Twitter Profile Photo

Excited to announce our #NAACL2025 Oral paper! 🎉✨ We carried out the largest systematic study so far to map the links between upstream choices, intrinsic bias, and downstream zero-shot performance across 131 CLIP Vision-language encoders, 26 datasets, and 55 architectures!

Excited to announce our #NAACL2025 Oral paper! 🎉✨   
We carried out the largest systematic study so far to map the links between upstream choices, intrinsic bias, and downstream zero-shot performance across 131 CLIP Vision-language encoders, 26 datasets, and 55 architectures!
Shivalika Singh (@singhshiviii) 's Twitter Profile Photo

LMArena is widely used for model evaluation, but is it measuring true progress? 🔮 In our work, "The Leaderboard Illusion", we reveal: 🔒 Private testing 📊 Data access asymmetries ⚠️ Overfitting risks 🚫 Silent deprecations Despite best intentions, arena policies favor a few!

LMArena is widely used for model evaluation, but is it measuring true progress? 🔮

In our work, "The Leaderboard Illusion", we reveal:
🔒 Private testing
📊 Data access asymmetries
⚠️ Overfitting risks
🚫 Silent deprecations

Despite best intentions, arena policies favor a few!
Sean Welleck (@wellecks) 's Twitter Profile Photo

Excited to teach Advanced NLP at CMU again this semester! Slides are on the course page as the course proceeds: cmu-l3.github.io/anlp-fall2025/ Lectures will be uploaded to Youtube: youtube.com/playlist?list=…

Excited to teach Advanced NLP at CMU again this semester!  

Slides are on the course page as the course proceeds: cmu-l3.github.io/anlp-fall2025/ 

Lectures will be uploaded to Youtube:
youtube.com/playlist?list=…