
Yoo Yeon Sung
@yooyeonsung1
PhD Student @umdclip @iSchoolUMD; Human-AI Alignment, human-interactable NLP, LLM evaluation, benchmark creation, misinformation
ID: 1372646134078971916
https://yysung.github.io/ 18-03-2021 20:28:42
56 Tweet
275 Followers
479 Following






Turns out, LLMs are pretty bad at balancing debatable queries in summarization โ๏ธ๐ Ironically, when I ask Google if LLMs can do this, it says "Yes" without fully covering the other side ๐คฆ Excited to present my summer work Adobe Research at #NAACL2025 where we fix this! ๐


I'll be presenting this work with Rachel Rudinger at #NAACL2025 tomorrow (Wednesday 4/30) during Session C (Oral/Poster 2) at 2pm! ๐ฌ Decomposing hypotheses in traditional NLI and defeasible NLI helps us measure various forms of consistency of LLMs. Come join us!


I'll be presenting two papers NAACL HLT 2025! 1. Why LLMs can't write a question with the answer "468" ๐ค๐ 2. A multi-agent LLM that balances opinions on "is pineapple good on pizza?" ๐ญ๐ Let's also chat about: ๐ Helpfulness ๐ Why MCQA sucks ๐ Generating cute paper titles ๐





Itโs that time again: QANTA 2025๐ง This yearโs theme: Cooperation among humans + agents + QA models! We invite you to: ๐ค Submit models ๐ Write questions to stump them all ๐งโโ๏ธ Play as human opponents It's my 3rd year hosting QANTA, always surprises in human-AI eval. Staytunedโณ

