Yingchen Xu (@yingchenx) Twitter Tweets • TwiCopy

Nathan Herr

10 months ago

Excited to introduce LLM-First Search (LFS) - a new paradigm where the language model takes the lead in reasoning and search! LFS is a self-directed search method that empowers LLMs to guide the exploration process themselves, without relying on predefined heuristics or fixed

thumb_up_off_alt119

chat_bubble_outline2

repeat21

shareShare

Reinforcement Learning & Video Games Workshop @RLC

@rlvg2025

9 months ago

We’re excited to announce our next speaker: Roberta Raileanu (Roberta Raileanu) from Google DeepMind! Roberta will discuss NetHack: A Grand Challenge for RL and LLM Agents Alike. ⚔️ Join us on August 5th to learn how to develop agents capable of tackling open-ended environments!

We’re excited to announce our next speaker: Roberta Raileanu (<a href="/robertarail/">Roberta Raileanu</a>) from <a href="/GoogleDeepMind/">Google DeepMind</a>!

Roberta will discuss NetHack: A Grand Challenge for RL and LLM Agents Alike. ⚔️

Join us on August 5th to learn how to develop agents capable of tackling open-ended environments!

thumb_up_off_alt105

chat_bubble_outline3

repeat9

shareShare

Edward Grefenstette

@egrefen

8 months ago

Do you have a PhD (or equivalent) or will have one in the coming months (i.e. 2-3 months away from graduating)? Do you want to help build open-ended agents that help humans do humans things better, rather than replace them? We're hiring 1-2 Research Scientists! Check the 🧵👇

thumb_up_off_alt339

chat_bubble_outline9

repeat38

shareShare

Roberta Raileanu

@robertarail

8 months ago

I’m building a new team at Google DeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an

thumb_up_off_alt2,2K

chat_bubble_outline74

repeat222

shareShare

Harshit Sikchi

@harshit_sikchi

8 months ago

We are hosting a social again this year at #RLC2025 (RL_Conference ) on August 5. Come to meet people pre-conference and find friends and collaborators. RSVP below if you can make it:

We are hosting a social again this year at #RLC2025 (<a href="/RL_Conference/">RL_Conference</a> ) on August 5. Come to meet people pre-conference and find friends and collaborators. RSVP below if you can make it:

thumb_up_off_alt39

chat_bubble_outline1

repeat7

shareShare

Harshit Sikchi

@harshit_sikchi

8 months ago

I will be RL_Conference presenting the below work on Fast Adaptation on Wednesday August 6 at 10:20 am and some works on unsupervised RL and imitation at RLBrew workshop on August 5.

thumb_up_off_alt107

chat_bubble_outline3

repeat10

shareShare

Roberta Raileanu

@robertarail

8 months ago

Excited to be in Edmonton for the RL_Conference this week! Today I’ll be at Inductive Biases in RL and Reinforcement Learning & Video Games Workshop @RLC giving two workshop talks and participating in the panels. Stop by to say hello! 📍 Inductive Biases in RL Workshop ⏰ 9:15am 🤖 LLM Whispers: Injecting Human Priors into RL

thumb_up_off_alt57

chat_bubble_outline1

repeat7

shareShare

Jack Parker-Holder

@jparkerholder

8 months ago

Genie 3 feels like a watershed moment for world models 🌐: we can now generate multi-minute, real-time interactive simulations of any imaginable world. This could be the key missing piece for embodied AGI… and it can also create beautiful beaches with my dog, playable real time

thumb_up_off_alt4,4K

chat_bubble_outline217

repeat456

shareShare

Tim Rocktäschel

@_rockt

8 months ago

Harder, Better, Faster, Stronger, Real-time! We are excited to reveal Genie 3, our most capable real-time foundational world model. Fantastic cross-team effort led by Jack Parker-Holder and Shlomi Fruchter. Below some interactive worlds and capabilities that were highlights for me

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat151

shareShare

Davide Paglieri

@paglieridavide

7 months ago

"Always reasoning" (ReAct) isn't optimal for LLM agents! 🧠 Our new paper identifies a "Goldilocks" effect: planning too frequently or not enough degrades performance. We show how to train agents to learn to dynamically allocate test-time compute when needed for best results. 👇

thumb_up_off_alt89

chat_bubble_outline2

repeat18

shareShare

Minqi Jiang

@minqijiang

7 months ago

What if you kept asking an LLM to "make it better"? In some recent work at FAIR, we investigate how we can efficiently use RL to fine-tune LLMs to iteratively self-improve on their previous solutions at inference-time. Training for iterated self-improvement can be costly. The

thumb_up_off_alt405

chat_bubble_outline14

repeat75

shareShare

Sakana AI

@sakanaailabs

6 months ago

We’re excited to introduce ShinkaEvolve: An open-source framework that evolves programs for scientific discovery with unprecedented sample-efficiency. Blog: sakana.ai/shinka-evolve/ Code: github.com/SakanaAI/Shink… Like AlphaEvolve and its variants, our framework leverages LLMs to

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat227

shareShare

Tim Rocktäschel

@_rockt

6 months ago

Proud to announce that Dr Laura Ruis defended her PhD thesis titled "Understanding and Evaluating Reasoning in Large Language Models" last week 🥳. Massive thanks to Noah Goodman and Emine Yilmaz for examining! As is customary, Laura received a personal mortarboard from

Proud to announce that Dr <a href="/LauraRuis/">Laura Ruis</a> defended her PhD thesis titled "Understanding and Evaluating Reasoning in Large Language Models" last week 🥳. Massive thanks to Noah Goodman and Emine Yilmaz for examining! As is customary, Laura received a personal mortarboard from

thumb_up_off_alt90

chat_bubble_outline6

repeat12

shareShare

Sakana AI

@sakanaailabs

6 months ago

We are excited to share that “Continuous Thought Machines” has been accepted as a Spotlight at #NeurIPS2025! 🧠✨ The CTM is an AI that mimics biological brains by using neural dynamics & synchronization to think over time. It can solve complex mazes by building internal maps,

thumb_up_off_alt580

chat_bubble_outline9

repeat73

shareShare

A. H. Guzel

@ahguzeluk

6 months ago

🎮 How can agents learn to generalize from limited offline data? We introduce iMac (Imagined Autocurricula) - training agents entirely in world models with emergent curricula!

thumb_up_off_alt76

chat_bubble_outline1

repeat19

shareShare

Luke Darlow

@learningluked

5 months ago

I had to share this stunning gif! Do Continuous Thought Machines dream dream of electric sheep...? This is a UMAP projection showing the neurons of a CTM firing while generating text (5 tokens, with time to think between). Do you see the emergence of FAST and SLOW thoughts?

thumb_up_off_alt73

chat_bubble_outline3

repeat8

shareShare

hardmaru

@hardmaru

4 months ago

Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (Sebastian Risi), Yujin Tang (Yujin Tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat225

shareShare

Sebastian Risi

@risi1979

4 months ago

I’m beyond excited to announce our MIT Press book on Neuroevolution! An HTML version is now available for free on neuroevolutionbook.com, with a print edition coming out later in 2026. Real intelligence is not static; it evolves. For decades, the field of neuroevolution has

thumb_up_off_alt551

chat_bubble_outline20

repeat156

shareShare

Laura Ruis

@lauraruis

4 months ago

Apply to do research with me on emergence of agency/planning in LLMs, out-of-context reasoning, understanding generalization from data, or propose your own direction! Very excited to be mentoring this spring 💫

thumb_up_off_alt195

chat_bubble_outline4

repeat21

shareShare

Yingchen Xu

@yingchenx

2 months ago

Huge congrats, Laura!! 🥳🎓Super excited to read the thesis, what a timely drop 🔥

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare