Maharshi Gor (@maharshigor) 's Twitter Profile
Maharshi Gor

@maharshigor

Ph.D. student @umdcs @ClipUmd
NLP, Retrievers, Human-AI, Efficient Methods
Past: @ContextualAI @Cohere @GoogleAI @theteamatx 🚀
he/him 🏳️‍🌈

ID: 236863853

linkhttp://mgor.info calendar_today11-01-2011 15:00:21

293 Tweet

397 Followers

595 Following

Yapei Chang (@yapeichang) 's Twitter Profile Photo

🤔 Can simple string-matching metrics like BLEU rival reward models for LLM alignment? 🔍 We show that given access to a reference, BLEU can match reward models in human preference agreement, and even train LLMs competitively with them using GRPO. 🫐 Introducing BLEUBERI:

Dayeon (Zoey) Ki (@zoeykii) 's Twitter Profile Photo

1/ How can a monolingual English speaker 🇺🇸 decide if a French translation 🇫🇷 is good enough to be shared? Introducing ❓AskQE❓, an #LLM-based Question Generation + Answering framework that detects critical MT errors and provides actionable feedback 🗣️ #ACL2025

1/ How can a monolingual English speaker 🇺🇸 decide if a French translation 🇫🇷 is good enough to be shared? 

Introducing ❓AskQE❓, an #LLM-based Question Generation + Answering framework that detects critical MT errors and provides actionable feedback 🗣️ 

#ACL2025
Jordan Boyd-Graber (@boydgraber) 's Twitter Profile Photo

Do you like trivia? Can you spot when AI is feeding you BS? Or can you make AIs turn themselves inside out? Then on June 14 at College Park (or June 21 online), we have a competition for you.

Do you like trivia?  Can you spot when AI is feeding you BS?  Or can you make AIs turn themselves inside out?  Then on June 14 at College Park (or June 21 online), we have a competition for you.
Maharshi Gor (@maharshigor) 's Twitter Profile Photo

Very cool and much needed analysis. The counterintuitive scaling limit is very interesting, it's validating to my personal observations about usage of high order of think tokens for just a moderately complex task.

Jordan Boyd-Graber (@boydgraber) 's Twitter Profile Photo

Today is the (human) deadline to sign up for our Human-Computer trivia competition held on June 14, 2024 in College Park, MD. $150 prize for the team who can answer the most questions with the help of an AI.

Today is the (human) deadline to sign up for our Human-Computer trivia competition held on June 14, 2024 in College Park, MD. $150 prize for the team who can answer the most questions with the help of an AI.
Jordan Boyd-Graber (@boydgraber) 's Twitter Profile Photo

We had our first human–computer cooperative AI tournament at the UMD. Key takeaways: 1) computers are getting better at trivia 2) they still suck at calibration 3) our teaming mechanic kept the games competitive and mostly fun (at least that’s what the players said).

We had our first human–computer cooperative AI tournament at the UMD.  Key takeaways: 1) computers are getting better at trivia 2) they still suck at calibration 3) our teaming mechanic kept the games competitive and mostly fun (at least that’s what the players said).
Revanth Reddy (On the Job Market) (@gangi_official) 's Twitter Profile Photo

The models and code are now public! Models on HF: huggingface.co/collections/Sa… Code: github.com/SalesforceAIRe… Project Page: salesforceairesearch.github.io/SweRank/ If you are interesting in integrating the SweRank models as a plug-in within VS Code, please do reach out! We have more exciting

Dora Zhao (@dorazhao9) 's Twitter Profile Photo

While we’re building amazing new human-AI systems, how do we actually know if they work well for people? In our #ACL2025 Findings Paper, we introduce SPHERE, a framework for making evaluations of human-AI systems more transparent and replicable. ✨aclanthology.org/2025.findings-…

While we’re building amazing new human-AI systems, how do we actually know if they work well for people?

In our #ACL2025 Findings Paper, we introduce SPHERE, a framework for making evaluations of human-AI systems more transparent and replicable.
✨aclanthology.org/2025.findings-…
Julian Eisenschlos (@eisenjulian) 's Twitter Profile Photo

This week I'll be attending ACL in Vienna, presenting at poster sessions on Monday and Wednesday at 11am, and then giving a keynote talk at the Table Representation Learning workshop on Thursday. Come say hi!

Yoo Yeon Sung (@yooyeonsung1) 's Twitter Profile Photo

I’ll be presenting this work in Room 1.62 today! If you're curious about how calibration errors in LLMs can be measured through human calibration, come find me and Eve Fleisig @ ACL 2025! 📍Oral Session 3 - HC: Human-centered NLP 📅Monday, July 28@ 2PM

Rupak (@rupak_53) 's Twitter Profile Photo

Linguistic theory tells us that common ground is essential to conversational success. But to what extent is it essential? Can LLMs detect when humans lose common ground in conversation? Our ACL 2025 (Oral) paper explores these questions on real-world data. #ACL2025NLP #ACL2025

Linguistic theory tells us that common ground is essential to conversational success. But to what extent is it essential? Can LLMs detect when humans lose common ground in conversation?

Our ACL 2025 (Oral) paper explores these questions on real-world data.

#ACL2025NLP #ACL2025
Dayeon (Zoey) Ki (@zoeykii) 's Twitter Profile Photo

I'll also be presenting our paper on using question-answer pairs as a new signal for spotting translation errors 🕵️ Come to talk more about MT evaluation! 📍Poster session (Hall X4, X5) 📆Tuesday (7/29) 4-5:30pm 📝aclanthology.org/2025.findings-…

Neha Srikanth (@nehasrikanth) 's Twitter Profile Photo

When questions are poorly posed, how do humans vs. models handle them? Our #ACL2025 paper explores this + introduces a framework for detecting and analyzing poorly-posed information-seeking questions! Joint work with Jordan Boyd-Gräber befindet sich in Wien (ACL2025) & Rachel Rudinger! 🔗 aclanthology.org/2025.acl-long.…

When questions are poorly posed, how do humans vs. models handle them? Our #ACL2025 paper explores this + introduces a framework for detecting and analyzing poorly-posed information-seeking questions! 

Joint work with <a href="/boydgraber/">Jordan Boyd-Gräber befindet sich in Wien (ACL2025)</a> &amp; <a href="/rachelrudinger/">Rachel Rudinger</a>!

🔗 aclanthology.org/2025.acl-long.…