Pouya Pezeshkpour (@ppezeshkpour) 's Twitter Profile
Pouya Pezeshkpour

@ppezeshkpour

Research Scientist at @MegagonLabs, working on NLP/ML || PhD from UCI, and former research intern at @MSFTResearch, @Apple, @allen_ai, and @FujitsuAmerica

ID: 1044705848990228480

linkhttps://pouyapez.github.io calendar_today25-09-2018 21:50:59

144 Tweet

327 Takipçi

271 Takip Edilen

Megagon Labs (@megagonlabs) 's Twitter Profile Photo

📊 Did you know? LLMs show up to a 75% performance variation based on option order in multiple-choice tasks. Pouya Pezeshkpour and estevamhruschka uncover the intricacies of model behavior in multiple-choice tasks. #AI #NLP #LLMs #NAACL2024 NAACL megagon.ai/order-matters-…

Tamanna Hossain-Kay (@thossainkay) 's Twitter Profile Photo

1/ 🌈Misgendering causes real harm & is often overlooked in NLP. Based on a community survey, Sunipa Dev, sameer & I introduce the 1st interventions for misgendering task and eval dataset, MisgenderMender, in our #NAACL2024 paper: shorturl.at/MdzHv 🗣️: 6/18 Don Diego 9am!

1/ 🌈Misgendering causes real harm &amp; is often overlooked in NLP. Based on a community survey, <a href="/sunipa17/">Sunipa Dev</a>, <a href="/sameer/">sameer</a> &amp; I introduce the 1st interventions for misgendering task and eval dataset, MisgenderMender, in our #NAACL2024 paper: shorturl.at/MdzHv

🗣️: 6/18 Don Diego 9am!
Pouya Pezeshkpour (@ppezeshkpour) 's Twitter Profile Photo

Are you interested in LLM hallucinations in multi-document summarization? We find that: LLMs hallucinate a lot, and often towards the end of summaries; Errors arise from ignored instructions or offering generic insights; post-hoc mitigation methods are not very effective. 👇

Megagon Labs (@megagonlabs) 's Twitter Profile Photo

Search engines, hiring platforms, and rec systems all rely on ranking. Traditional ranking methods struggle with multiple conflicting priorities, so we created Multi-Conditional Ranking, MCRank Bench, and EXSIR 🧵megagon.ai/rethinking-ran… Pouya Pezeshkpour estevamhruschka #NAACL2025