Anwoy Chatterjee (@anwoy_) Twitter Tweets • TwiCopy

Anwoy Chatterjee

@anwoy_

+ Follow

PhDing @iitdelhi @lcs2lab • Google PhD Fellow in #NLProc • Ex-Research Intern @adobe • Prev: BTech in CS @iitbhu_varanasi

Views are personal

ID: 1634557860934721536

linkhttps://c-anwoy.github.io/ calendar_today11-03-2023 14:12:31

77 Tweet

95 Followers

555 Following

Anwoy Chatterjee

@anwoy_

a year ago

Good to see discussions around prompt sensitivity & challenges in reporting optimized baselines! Our POSIX work feels even more relevant now. Reporting a POSIX score alongside performance numbers would bring much-needed perspective. CC: Omar Khattab Rulin Shao Kowndinya Renduchintala

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

LCS2 Lab

@lcs2lab

10 months ago

🚨 New #TACL Paper Alert 🚨 We explore a crucial question in instruction tuning: should we weight prompt and response tokens differently in the loss function? Introducing Weighted Instruction Tuning - a simple idea that boosts generalization by up to +6.55% across 5 benchmarks!

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

LCS2 Lab

@lcs2lab

10 months ago

📝 On the Effect of Instruction Tuning Loss on Generalization 👥 Anwoy Chatterjee Anwoy Chatterjee, Kowndinya Renduchintala Kowndinya Renduchintala, Sumit Bhatia, Tanmoy Chakraborty Tanmoy Chakraborty 💾 Code: github.com/kowndinya-rend… arxiv.org/abs/2507.07817 #MachineLearning #NLP #LLMs #TACL

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

LCS2 Lab

@lcs2lab

6 months ago

🚀 LCS2 Sneak Peek Series for #EMNLP2025 🚀 📝 Do You Know About My Nation? Investigating Multilingual Language Models’ Cultural Literacy Through Factual Knowledge 👥 @Eshaant2000, Anwoy Chatterjee, Michael Saxon, Alon Albalak, William Yang Wang, Tanmoy Chakraborty 📌 Paper: aclanthology.org/2025.emnlp-mai…

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Michael Saxon

@m2saxon

6 months ago

🆕 at #EMNLP: Will a LM be better at answering questions about Germany in German than in French? Is national knowledge linguistically contingent? Interestingly, only for some multilingual models is this true. Aya knows China best in Chinese, but LLaMA's best in English always.

thumb_up_off_alt29

chat_bubble_outline0

repeat4

shareShare

Anwoy Chatterjee

@anwoy_

6 months ago

I will be virtually presenting our #EMNLP2025 paper at 10:30 am tomorrow, during the Computational Social Science 2 Oral Session. 🗓️ Nov 6 ⏲️ 10:30 AM 📌 A109 TL;DR: We introduce the XNationQA - a parallel multilingual and multicultural benchmark to evaluate the cultural

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Neel Somani

@neelsalami

4 months ago

"Of course you think SAEs are feature detectors, you're a first year interpretability researcher, just posted to a safety community, LessWrong probably. You'll talk about circuits until you read Neel Nanda

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat49

shareShare

Devi Parikh

@deviparikh

4 months ago

Just because an AI wrote it, doesn’t mean I don’t mean it. Just because I used AI, doesn’t mean I don’t care. It’s like saying, “If you really cared about this contract, you’d do the budget math by hand.” Me being efficient with my time doesn’t mean I respect yours less. All

thumb_up_off_alt177

chat_bubble_outline20

repeat17

shareShare

Gabriele Berton

@gabriberton

4 months ago

Repeat after me: LLM detectors don't work. AI image detectors are bound to fail. Despite what some companies claim, AI and non-AI data will eventually be indistinguishable

thumb_up_off_alt424

chat_bubble_outline69

repeat20

shareShare

Pasquale Minervini is hiring postdocs! 🚀

@pminervini

4 months ago

Unpopular opinion -- I don't think we should accept papers in top AI conferences whose contribution is mainly experimental but don't provide the data and tools to reproduce the experiments. I have seen several papers with flawed results, but without code there's no way of telling

thumb_up_off_alt71

chat_bubble_outline8

repeat6

shareShare

Peyman Milanfar

@docmilanfar

4 months ago

stop acting like the em-dash is proof of AI. it’s called grammar — people have been using it to great effect for hundreds of years. read a book for once

thumb_up_off_alt4,4K

chat_bubble_outline503

repeat878

shareShare

Prof Lennart Nacke, PhD

@acagamic

4 months ago

Most researchers collect data first, then wonder why nobody reads their papers. Start with the story. The data comes after.

thumb_up_off_alt20,20K

chat_bubble_outline102

repeat3,3K

shareShare

Anwoy Chatterjee

@anwoy_

3 months ago

It feels so disappointing when you meticulously read the author rebuttals for the papers you’re reviewing, responding to them and engaging with the authors only to get ghosted by ALL the reviewers on your own paper as an author ACLRollingReview ACL 2026

thumb_up_off_alt16

chat_bubble_outline1

repeat1

shareShare

Anwoy Chatterjee

@anwoy_

a month ago

1/1 accepted at #ACL2026 Mains. See you in San Diego 🇺🇸 More details and preprint soon …

thumb_up_off_alt29

chat_bubble_outline0

repeat3

shareShare

Tanmoy Chakraborty

@tanmoy_chak

a month ago

Six papers from our lab have been accepted for publication in #ACL2026. The papers cover topics including Interpretability, empowering small VLMs with advanced tool calling, LLM personalisation, and different benchmarking. #nlproc ACL 2026

thumb_up_off_alt78

chat_bubble_outline1

repeat5

shareShare