
Gal Yona
@_galyo
Research scientist @googleai, previously CS PhD @weizmannscience
ID: 86293812
https://galyona.github.io/ 30-10-2009 11:45:44
284 Tweet
474 Takipçi
486 Takip Edilen

Excited to attend EMNLP 2025 in Miami next week 🤩 DM me if you'd like to grab a coffee and chat about interpretability, knowledge, or reasoning in LLMs! Our group/collabs will be presenting a bunch of cool works, come check them out! 🧵


What prompt generated the image on the right? Come find out today at our tutorial on OOD generalization: Shortcuts, Spuriousness, and Stability @Maggiemakar aahlad puli Panel: Elan Rosenfeld Aditi Raghunathan Danica Sutherland







.Percy Liang & Tatsunori Hashimoto start the 2nd offering of CS336 Language Modeling from Scratch at Stanford NLP Group. The class philosophy is Understanding by Building. We need many people who understand the detailed design of modern LLMs, not just a few at “frontier” 🤭 AI companies.



Sam Altman the single biggest thing you could do for safety/alignment is to put a massive emphasis in the RL feedback loop on basic HONESTY and never misleading, tricking, overstating, exaggerating, etc. It should be like touching a hot stove to the model. Just like how you raise kids



new work by Gabrielle Kaili-May Liu shows that LLMs still struggle to faithfully express their uncertainty in words, but cool to see that meta cognitive inspired prompting can go a long way. looking forward to seeing more positive results on this fundamental problem!