Theoretical systems neuroscience at bcf.uni-freiburg.de. Color-blind super-scientist Mary trying to navigate the career path. Looking for a job.
ID: 2983142458
http://xiaoxionglin.com 17-01-2015 21:00:28
383 Tweet
170 Followers
316 Following

Can LLMs do reinforcement learning in-context - and if so, how do they do it? Using Sparse Autoencoders, we find that Llama 3 relies on representations resembling TD errors, Q-values and even the SR to learn in three RL tasks in-context! Co-lead with the inimitable Can Demircan












Kenneth Stanley Yes, RL can drive open-ended novelty search. It’s finally time to make self-driven, sustained discovery a reality



