
Will Timkey
@wtimkey8
aspiring computational psycholinguist | (first gen!) PhD student @nyuling | transit & urbanism enthusiast
(he/they)
ID: 2911919954
http://wtimkey.github.io 27-11-2014 13:38:06
431 Tweet
279 Followers
255 Following

Do deep transformer LMs generalize better? In a new preprint we (Sjoerd van Steenkiste Ishita Dasgupta Fei Sha Dan Garrette Tal Linzen) control for parameter count to show how depth helps models on compositional generalization tasks, but diminishingly so🧵 📜 jpetty.org/depth



🧙‍♀️ I'm hoping to recruit ~1 PhD student this cycle through BU Linguistics! Students who are broadly interested in meaning and computational models would be a good fit. I'll mention a few specific topics I've been working on & looking to expand below:







A bit late, but I'll be at #HSP2024 this year! Alongside some incredible coauthors (@psydock112 Suhas Arehalli Grusha Prasad @linguistbrian Tal Linzen), I'll be presenting a poster about which eye tracking measures LM surprisal does and doesn't explain in garden path sentences.







