Devansh Jain
@devanshrjain
model routers @ ยฌโ | ai safety @LTIatCMU | ex cs @bitspilaniindia
ID: 3301689482
30-07-2015 16:08:00
89 Tweet
166 Followers
816 Following
๐งต Multilingual safety training/eval is now standard practice, but a critical question remains: Is multilingual safety actually solved? Our new survey with Cohere Labs answers this and dives deep into: - Language gap in safety research - Future priority areas Thread ๐
Thanks Language Technologies Institute | @CarnegieMellon and CMU School of Computer Science for featuring our work!!โจ๐ซ Our paper on culturally offensive nonverbal gestures is accepted to #ACL2025 main! Detailed thread๐งต: x.com/akhila_yerukolโฆ Preprint๐: arxiv.org/abs/2502.17710 Work done with Saadia Gabriel Violet Peng Maarten Sap (he/him)
Day 3 (Thu Oct 9), 11:00amโ1:00pm, Poster Session 5 Poster #13: PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages โ led by Priyanshu Kumar, Devansh Jain Poster #74: Fluid Language Model Benchmarking โ led by Valentin Hofmann
(Thu Oct 9, 11:00amโ1:00pm) Poster Session 5 ๐๐จ๐ฌ๐ญ๐๐ซ #๐๐: PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages; w/ amazing Priyanshu Kumar, Devansh Jain PolyGuard is among the SOTA multilingual safety moderation tool + we release comprehensive multilingual
While Sonnet-4.5 remains a popular choice among developers, our benchmarks show it underperforms GPT-5 on SRE-related tasks when both are run with default parameters. However, using the Not Diamond prompt adaptation platform, Sonnet-4.5 achieved up to a 2x performance