Anwoy Chatterjee (@anwoy_) 's Twitter Profile
Anwoy Chatterjee

@anwoy_

PhDing @iitdelhi @lcs2lab โ€ข Google PhD Fellow in #NLProc โ€ข Ex-Research Intern @adobe โ€ข Prev: BTech in CS @iitbhu_varanasi

Views are personal

ID: 1634557860934721536

linkhttps://c-anwoy.github.io/ calendar_today11-03-2023 14:12:31

77 Tweet

95 Followers

555 Following

Anwoy Chatterjee (@anwoy_) 's Twitter Profile Photo

Good to see discussions around prompt sensitivity & challenges in reporting optimized baselines! Our POSIX work feels even more relevant now. Reporting a POSIX score alongside performance numbers would bring much-needed perspective. CC: Omar Khattab Rulin Shao Kowndinya Renduchintala

LCS2 Lab (@lcs2lab) 's Twitter Profile Photo

๐Ÿšจ New #TACL Paper Alert ๐Ÿšจ We explore a crucial question in instruction tuning: should we weight prompt and response tokens differently in the loss function? Introducing Weighted Instruction Tuning - a simple idea that boosts generalization by up to +6.55% across 5 benchmarks!

๐Ÿšจ New #TACL Paper Alert ๐Ÿšจ
We explore a crucial question in instruction tuning: should we weight prompt and response tokens differently in the loss function?
Introducing Weighted Instruction Tuning - a simple idea that boosts generalization by up to +6.55% across 5 benchmarks!
LCS2 Lab (@lcs2lab) 's Twitter Profile Photo

๐Ÿ“ On the Effect of Instruction Tuning Loss on Generalization ๐Ÿ‘ฅ Anwoy Chatterjee Anwoy Chatterjee, Kowndinya Renduchintala Kowndinya Renduchintala, Sumit Bhatia, Tanmoy Chakraborty Tanmoy Chakraborty ๐Ÿ’พ Code: github.com/kowndinya-rendโ€ฆ arxiv.org/abs/2507.07817 #MachineLearning #NLP #LLMs #TACL

LCS2 Lab (@lcs2lab) 's Twitter Profile Photo

๐Ÿš€ LCS2 Sneak Peek Series for #EMNLP2025 ๐Ÿš€ ๐Ÿ“ Do You Know About My Nation? Investigating Multilingual Language Modelsโ€™ Cultural Literacy Through Factual Knowledge ๐Ÿ‘ฅ @Eshaant2000, Anwoy Chatterjee, Michael Saxon, Alon Albalak, William Yang Wang, Tanmoy Chakraborty ๐Ÿ“Œ Paper: aclanthology.org/2025.emnlp-maiโ€ฆ

๐Ÿš€ LCS2 Sneak Peek Series for #EMNLP2025 ๐Ÿš€
๐Ÿ“ Do You Know About My Nation? Investigating Multilingual Language Modelsโ€™ Cultural Literacy Through Factual Knowledge
๐Ÿ‘ฅ @Eshaant2000, <a href="/anwoy_/">Anwoy Chatterjee</a>, <a href="/m2saxon/">Michael Saxon</a>, <a href="/AlbalakAlon/">Alon Albalak</a>, William Yang Wang, <a href="/Tanmoy_Chak/">Tanmoy Chakraborty</a> 
๐Ÿ“Œ Paper: aclanthology.org/2025.emnlp-maiโ€ฆ
Michael Saxon (@m2saxon) 's Twitter Profile Photo

๐Ÿ†• at #EMNLP: Will a LM be better at answering questions about Germany in German than in French? Is national knowledge linguistically contingent? Interestingly, only for some multilingual models is this true. Aya knows China best in Chinese, but LLaMA's best in English always.

๐Ÿ†• at #EMNLP: Will a LM be better at answering questions about Germany in German than in French? Is national knowledge linguistically contingent?

Interestingly, only for some multilingual models is this true. Aya knows China best in Chinese, but LLaMA's best in English always.
Anwoy Chatterjee (@anwoy_) 's Twitter Profile Photo

I will be virtually presenting our #EMNLP2025 paper at 10:30 am tomorrow, during the Computational Social Science 2 Oral Session. ๐Ÿ—“๏ธ Nov 6 โฒ๏ธ 10:30 AM ๐Ÿ“Œ A109 TL;DR: We introduce the XNationQA - a parallel multilingual and multicultural benchmark to evaluate the cultural

Neel Somani (@neelsalami) 's Twitter Profile Photo

"Of course you think SAEs are feature detectors, you're a first year interpretability researcher, just posted to a safety community, LessWrong probably. You'll talk about circuits until you read Neel Nanda

"Of course you think SAEs are feature detectors, you're a first year interpretability researcher, just posted to a safety community, LessWrong probably. You'll talk about circuits until you read Neel Nanda
Devi Parikh (@deviparikh) 's Twitter Profile Photo

Just because an AI wrote it, doesnโ€™t mean I donโ€™t mean it. Just because I used AI, doesnโ€™t mean I donโ€™t care. Itโ€™s like saying, โ€œIf you really cared about this contract, youโ€™d do the budget math by hand.โ€ Me being efficient with my time doesnโ€™t mean I respect yours less. All

Gabriele Berton (@gabriberton) 's Twitter Profile Photo

Repeat after me: LLM detectors don't work. AI image detectors are bound to fail. Despite what some companies claim, AI and non-AI data will eventually be indistinguishable

Pasquale Minervini is hiring postdocs! ๐Ÿš€ (@pminervini) 's Twitter Profile Photo

Unpopular opinion -- I don't think we should accept papers in top AI conferences whose contribution is mainly experimental but don't provide the data and tools to reproduce the experiments. I have seen several papers with flawed results, but without code there's no way of telling

Peyman Milanfar (@docmilanfar) 's Twitter Profile Photo

stop acting like the em-dash is proof of AI. itโ€™s called grammar โ€” people have been using it to great effect for hundreds of years. read a book for once

Anwoy Chatterjee (@anwoy_) 's Twitter Profile Photo

It feels so disappointing when you meticulously read the author rebuttals for the papers youโ€™re reviewing, responding to them and engaging with the authors only to get ghosted by ALL the reviewers on your own paper as an author ACLRollingReview ACL 2026

Tanmoy Chakraborty (@tanmoy_chak) 's Twitter Profile Photo

Six papers from our lab have been accepted for publication in #ACL2026. The papers cover topics including Interpretability, empowering small VLMs with advanced tool calling, LLM personalisation, and different benchmarking. #nlproc ACL 2026

Six papers from our lab have been accepted for publication in #ACL2026. The papers cover topics including Interpretability, empowering small VLMs with advanced tool calling, LLM personalisation, and different benchmarking.

#nlproc <a href="/aclmeeting/">ACL 2026</a>