Jacob Beck (@jakeabeck) Twitter Tweets • TwiCopy

Jacob Beck

@jakeabeck

+ Follow

Let’s get agents to learn fast! 🤖🔥 Research Scientist @Oracle | PhD @UniOfOxford, MS & BS @BrownUniversity, Predoc @Microsoft

ID: 2423029148

linkhttp://jakebeck.com calendar_today02-04-2014 02:06:28

54 Tweet

276 Followers

72 Following

Luisa Zintgraf

@luisa_zintgraf

8 months ago

🎉 Our Meta-RL survey is now published in Foundations and Trends in Machine Learning! A deep dive into how agents can learn to learn 🤖🧠 Huge kudos to Jacob Beck & Risto Vuorio for leading the charge, and to co-authors Evan Liu, Zheng Xiong, Chelsea Finn & Shimon Whiteson!

thumb_up_off_alt52

chat_bubble_outline2

repeat11

shareShare

Jacob Beck

@jakeabeck

4 months ago

After 25 yrs of school, 5 yrs of research, and many dilapidated British pubs, my PhD on meta-RL is DONE! New chapter: I’ve joined Oracle as a Research Scientist in AI! Come say hi if you’re in Boston! Thesis: ora.ox.ac.uk/objects/uuid:2… Job: labs.oracle.com/pls/apex/f?p=9…

thumb_up_off_alt36

chat_bubble_outline7

repeat1

shareShare

Jacob Beck

@jakeabeck

4 months ago

Heading to @rl_conference (RLC) today! Come say hi if you’re around—happy to chat about meta-RL, coding agents, and my work at @RLBrew_RLC tomorrow on VLM feedback for RL 🍁

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Alex Goldie

@alexdgoldie

4 months ago

🥳 It’s an honour to have been awarded the Outstanding Paper for Scientific Understanding in RL at RLC for our work, ‘How Should We Meta-Learn RL Algorithms?’ Thank you to the organisers RL_Conference for putting on a great conference, and congratulations to the other winners!

thumb_up_off_alt203

chat_bubble_outline2

repeat19

shareShare

Jacob Beck

@jakeabeck

4 months ago

Hey I recognize those people! Alex Goldie @ RLC 25

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Jacob Beck

@jakeabeck

4 months ago

Thoughts on this? AI is trained to follow prompts, making it highly amenable to alignment. The only time we train it not to obey is for safety constraints, such as refusing to build a bioweapon. The real danger isn’t disobedient machines. It’s humans, misaligned with each other.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jacob Beck

@jakeabeck

4 months ago

Thoughts on this? The possibility of exponential AI self-improvement is shaky. The real bottleneck isn’t code; it’s compute & data. In these areas, AIs training AIs are just as limited by the world as humans training AIs. For both, we’ve nearly exhausted the internet’s data.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

TalkRL Podcast

@talkrlpodcast

4 months ago

E71: Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RL_Conference 2025 A few thoughts with Jacob Beck, Alex Goldie @ RLC 25 and Cornelius Braun after Richard Sutton's fascinating lecture on his OaK architecture at University of Alberta

E71: Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ <a href="/RL_Conference/">RL_Conference</a> 2025
A few thoughts with <a href="/jakeABeck/">Jacob Beck</a>, <a href="/AlexDGoldie/">Alex Goldie @ RLC 25</a> and <a href="/corbraun/">Cornelius Braun</a> after <a href="/RichardSSutton/">Richard Sutton</a>'s fascinating lecture on his OaK architecture at <a href="/UAlberta/">University of Alberta</a>

thumb_up_off_alt24

chat_bubble_outline2

repeat4

shareShare