Ekaterina Lobacheva
@katelobacheva
Postdoc @Mila_Quebec @UMontreal
Like to explain unexpected behavior of neural nets 🤯
ID: 1069184356533583872
https://tipt0p.github.io/ 02-12-2018 10:59:49
80 Tweet
513 Takipçi
356 Takip Edilen
🚨 We’re recruiting new students for Fall 2026 — come join Chandar Lab! 🚨
LLMs memorize a lot of training data, but memorization is poorly understood. Where does it live inside models? How is it stored? How much is it involved in different tasks? Jack Merullo & Srihita Vatsavaya's new paper examines all of these questions using loss curvature! (1/7)
New research: are prompting and activation steering just two sides of the same coin? Eric Bigelow Daniel Wurgaft Ekdeep Singh and coauthors argue they are: ICL and steering have formally equivalent effects. (1/4)