
Shahan
@shahanmemon
phd @UW // visiting @nyuabudhabi // ex- @CarnegieMellon // researching if AI can do science // tweets about genAI, agents, science of science, @SciencePlusAI
ID: 134687994
http://samemon.github.io 19-04-2010 04:55:05
2,2K Tweet
948 Followers
2,2K Following

We are launching HALoGEN💡, a way to systematically study *when* and *why* LLMs still hallucinate. New work w/ Shrusti Ghela* David Wadden Yejin Choi 💫 🧵 [1/n]
![Abhilasha Ravichander (@lasha_nlp) on Twitter photo We are launching HALoGEN💡, a way to systematically study *when* and *why* LLMs still hallucinate.
New work w/ <a href="/shrusti_ghela/">Shrusti Ghela</a>* <a href="/davidjwadden/">David Wadden</a> <a href="/YejinChoinka/">Yejin Choi</a> 💫
🧵 [1/n] We are launching HALoGEN💡, a way to systematically study *when* and *why* LLMs still hallucinate.
New work w/ <a href="/shrusti_ghela/">Shrusti Ghela</a>* <a href="/davidjwadden/">David Wadden</a> <a href="/YejinChoinka/">Yejin Choi</a> 💫
🧵 [1/n]](https://pbs.twimg.com/media/GimvuBZbYAYyXqP.jpg)








🚨 Our latest paper is out today in Science! We uncover stark and systematic partisan differences in the amount, content, and character of science used in policy, which mirror differences in political elites’ trust in science. Four years in the making. Led by Zander Furnas 1/n






📢 New paper with Iason Gabriel is out! 2025 is being called the year of AI agents, with overwhelming headlines about them every day. But we lack a shared vocabulary to distinguish their fundamental properties. Our paper aims to bridge this gap. A 🧵







Seth Lazar It is necessary to invest in alternatives right now. Without it, we might see the worst aspects of our current platform economy amplified. We don't have all the answers in the paper, but we have a blueprint for where to start. Paper: arxiv.org/pdf/2505.04345