Maharshi Gor (@maharshigor) Twitter Tweets • TwiCopy

Maharshi Gor

@maharshigor

+ Follow

Ph.D. student @umdcs @ClipUmd
NLP, Retrievers, Human-AI, Efficient Methods
Past: @ContextualAI @Cohere @GoogleAI @theteamatx 🚀
he/him 🏳️‍🌈

ID: 236863853

linkhttp://mgor.info calendar_today11-01-2011 15:00:21

293 Tweet

397 Followers

595 Following

Yapei Chang

@yapeichang

7 months ago

🤔 Can simple string-matching metrics like BLEU rival reward models for LLM alignment? 🔍 We show that given access to a reference, BLEU can match reward models in human preference agreement, and even train LLMs competitively with them using GRPO. 🫐 Introducing BLEUBERI:

thumb_up_off_alt191

chat_bubble_outline6

repeat41

shareShare

Dayeon (Zoey) Ki

@zoeykii

7 months ago

1/ How can a monolingual English speaker 🇺🇸 decide if a French translation 🇫🇷 is good enough to be shared? Introducing ❓AskQE❓, an #LLM-based Question Generation + Answering framework that detects critical MT errors and provides actionable feedback 🗣️ #ACL2025

thumb_up_off_alt43

chat_bubble_outline1

repeat16

shareShare

Jordan Boyd-Graber

@boydgraber

6 months ago

Do you like trivia? Can you spot when AI is feeding you BS? Or can you make AIs turn themselves inside out? Then on June 14 at College Park (or June 21 online), we have a competition for you.

thumb_up_off_alt29

chat_bubble_outline1

repeat7

shareShare

Maharshi Gor

@maharshigor

6 months ago

Very cool and much needed analysis. The counterintuitive scaling limit is very interesting, it's validating to my personal observations about usage of high order of think tokens for just a moderately complex task.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Jordan Boyd-Graber

@boydgraber

6 months ago

Today is the (human) deadline to sign up for our Human-Computer trivia competition held on June 14, 2024 in College Park, MD. $150 prize for the team who can answer the most questions with the help of an AI.

thumb_up_off_alt11

chat_bubble_outline1

repeat6

shareShare

Jordan Boyd-Graber

@boydgraber

6 months ago

We had our first human–computer cooperative AI tournament at the UMD. Key takeaways: 1) computers are getting better at trivia 2) they still suck at calibration 3) our teaming mechanic kept the games competitive and mostly fun (at least that’s what the players said).

thumb_up_off_alt24

chat_bubble_outline2

repeat7

shareShare

Revanth Reddy (On the Job Market)

@gangi_official

5 months ago

The models and code are now public! Models on HF: huggingface.co/collections/Sa… Code: github.com/SalesforceAIRe… Project Page: salesforceairesearch.github.io/SweRank/ If you are interesting in integrating the SweRank models as a plug-in within VS Code, please do reach out! We have more exciting

thumb_up_off_alt24

chat_bubble_outline0

repeat9

shareShare

Maharshi Gor

@maharshigor

5 months ago

Woohoo! A must-go talk and a well-deserved work by Nishant to be selected for oral.

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Dora Zhao

@dorazhao9

4 months ago

While we’re building amazing new human-AI systems, how do we actually know if they work well for people? In our #ACL2025 Findings Paper, we introduce SPHERE, a framework for making evaluations of human-AI systems more transparent and replicable. ✨aclanthology.org/2025.findings-…

thumb_up_off_alt89

chat_bubble_outline1

repeat26

shareShare

Julian Eisenschlos

@eisenjulian

4 months ago

This week I'll be attending ACL in Vienna, presenting at poster sessions on Monday and Wednesday at 11am, and then giving a keynote talk at the Table Representation Learning workshop on Thursday. Come say hi!

thumb_up_off_alt14

chat_bubble_outline1

repeat3

shareShare

Yoo Yeon Sung

@yooyeonsung1

4 months ago

I’ll be presenting this work in Room 1.62 today! If you're curious about how calibration errors in LLMs can be measured through human calibration, come find me and Eve Fleisig @ ACL 2025! 📍Oral Session 3 - HC: Human-centered NLP 📅Monday, July 28@ 2PM

thumb_up_off_alt14

chat_bubble_outline0

repeat4

shareShare

Rupak

@rupak_53

4 months ago

Linguistic theory tells us that common ground is essential to conversational success. But to what extent is it essential? Can LLMs detect when humans lose common ground in conversation? Our ACL 2025 (Oral) paper explores these questions on real-world data. #ACL2025NLP #ACL2025

thumb_up_off_alt27

chat_bubble_outline1

repeat7

shareShare

Dayeon (Zoey) Ki

@zoeykii

4 months ago

I'll also be presenting our paper on using question-answer pairs as a new signal for spotting translation errors 🕵️ Come to talk more about MT evaluation! 📍Poster session (Hall X4, X5) 📆Tuesday (7/29) 4-5:30pm 📝aclanthology.org/2025.findings-…

thumb_up_off_alt34

chat_bubble_outline1

repeat4

shareShare

Neha Srikanth

@nehasrikanth

4 months ago

When questions are poorly posed, how do humans vs. models handle them? Our #ACL2025 paper explores this + introduces a framework for detecting and analyzing poorly-posed information-seeking questions! Joint work with Jordan Boyd-Gräber befindet sich in Wien (ACL2025) & Rachel Rudinger! 🔗 aclanthology.org/2025.acl-long.…

thumb_up_off_alt57

chat_bubble_outline6

repeat12

shareShare