Yingchen Xu (@yingchenx) 's Twitter Profile
Yingchen Xu

@yingchenx

CS PhD at @ucl_dark and @MetaAI 👩‍💻
deep reinforcement learning | world models | reasoning & planning
🤖️🎨⛰️

ID: 1281109670761885697

linkhttps://ycxuyingchen.github.io/ calendar_today09-07-2020 06:15:32

137 Tweet

555 Takipçi

324 Takip Edilen

Nathan Herr (@naitherr) 's Twitter Profile Photo

Excited to introduce LLM-First Search (LFS) - a new paradigm where the language model takes the lead in reasoning and search! LFS is a self-directed search method that empowers LLMs to guide the exploration process themselves, without relying on predefined heuristics or fixed

Excited to introduce LLM-First Search (LFS) -  a new paradigm where the language model takes the lead in reasoning and search!

LFS is a self-directed search method that empowers LLMs to guide the exploration process themselves, without relying on predefined heuristics or fixed
Reinforcement Learning & Video Games Workshop @RLC (@rlvg2025) 's Twitter Profile Photo

We’re excited to announce our next speaker: Roberta Raileanu (Roberta Raileanu) from Google DeepMind! Roberta will discuss NetHack: A Grand Challenge for RL and LLM Agents Alike. ⚔️ Join us on August 5th to learn how to develop agents capable of tackling open-ended environments!

We’re excited to announce our next speaker: Roberta Raileanu (<a href="/robertarail/">Roberta Raileanu</a>) from <a href="/GoogleDeepMind/">Google DeepMind</a>!

Roberta will discuss NetHack: A Grand Challenge for RL and LLM Agents Alike. ⚔️

Join us on August 5th to learn how to develop agents capable of tackling open-ended environments!
Edward Grefenstette (@egrefen) 's Twitter Profile Photo

Do you have a PhD (or equivalent) or will have one in the coming months (i.e. 2-3 months away from graduating)? Do you want to help build open-ended agents that help humans do humans things better, rather than replace them? We're hiring 1-2 Research Scientists! Check the 🧵👇

Roberta Raileanu (@robertarail) 's Twitter Profile Photo

I’m building a new team at Google DeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an

Harshit Sikchi (@harshit_sikchi) 's Twitter Profile Photo

We are hosting a social again this year at #RLC2025 (RL_Conference ) on August 5. Come to meet people pre-conference and find friends and collaborators. RSVP below if you can make it:

We are hosting a social again this year at #RLC2025 (<a href="/RL_Conference/">RL_Conference</a> ) on August 5.  Come to meet people pre-conference and find friends and collaborators. RSVP below if you can make it:
Harshit Sikchi (@harshit_sikchi) 's Twitter Profile Photo

I will be RL_Conference presenting the below work on Fast Adaptation on Wednesday August 6 at 10:20 am and some works on unsupervised RL and imitation at RLBrew workshop on August 5.

Roberta Raileanu (@robertarail) 's Twitter Profile Photo

Excited to be in Edmonton for the RL_Conference this week! Today I’ll be at Inductive Biases in RL and Reinforcement Learning & Video Games Workshop @RLC giving two workshop talks and participating in the panels. Stop by to say hello! 📍 Inductive Biases in RL Workshop ⏰ 9:15am 🤖 LLM Whispers: Injecting Human Priors into RL

Jack Parker-Holder (@jparkerholder) 's Twitter Profile Photo

Genie 3 feels like a watershed moment for world models 🌐: we can now generate multi-minute, real-time interactive simulations of any imaginable world. This could be the key missing piece for embodied AGI… and it can also create beautiful beaches with my dog, playable real time

Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

Harder, Better, Faster, Stronger, Real-time! We are excited to reveal Genie 3, our most capable real-time foundational world model. Fantastic cross-team effort led by Jack Parker-Holder and Shlomi Fruchter. Below some interactive worlds and capabilities that were highlights for me

Davide Paglieri (@paglieridavide) 's Twitter Profile Photo

"Always reasoning" (ReAct) isn't optimal for LLM agents! 🧠 Our new paper identifies a "Goldilocks" effect: planning too frequently or not enough degrades performance. We show how to train agents to learn to dynamically allocate test-time compute when needed for best results. 👇

Minqi Jiang (@minqijiang) 's Twitter Profile Photo

What if you kept asking an LLM to "make it better"? In some recent work at FAIR, we investigate how we can efficiently use RL to fine-tune LLMs to iteratively self-improve on their previous solutions at inference-time. Training for iterated self-improvement can be costly. The

Sakana AI (@sakanaailabs) 's Twitter Profile Photo

We’re excited to introduce ShinkaEvolve: An open-source framework that evolves programs for scientific discovery with unprecedented sample-efficiency. Blog: sakana.ai/shinka-evolve/ Code: github.com/SakanaAI/Shink… Like AlphaEvolve and its variants, our framework leverages LLMs to

Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

Proud to announce that Dr Laura Ruis defended her PhD thesis titled "Understanding and Evaluating Reasoning in Large Language Models" last week 🥳. Massive thanks to Noah Goodman and Emine Yilmaz for examining! As is customary, Laura received a personal mortarboard from

Proud to announce that Dr <a href="/LauraRuis/">Laura Ruis</a> defended her PhD thesis titled "Understanding and Evaluating Reasoning in Large Language Models" last week 🥳. Massive thanks to Noah Goodman and Emine Yilmaz for examining! As is customary, Laura received a personal mortarboard from
Sakana AI (@sakanaailabs) 's Twitter Profile Photo

We are excited to share that “Continuous Thought Machines” has been accepted as a Spotlight at #NeurIPS2025! 🧠✨ The CTM is an AI that mimics biological brains by using neural dynamics & synchronization to think over time. It can solve complex mazes by building internal maps,

A. H. Guzel (@ahguzeluk) 's Twitter Profile Photo

🎮 How can agents learn to generalize from limited offline data? We introduce iMac (Imagined Autocurricula) - training agents entirely in world models with emergent curricula!

Luke Darlow (@learningluked) 's Twitter Profile Photo

I had to share this stunning gif! Do Continuous Thought Machines dream dream of electric sheep...? This is a UMAP projection showing the neurons of a CTM firing while generating text (5 tokens, with time to think between). Do you see the emergence of FAST and SLOW thoughts?

hardmaru (@hardmaru) 's Twitter Profile Photo

Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (Sebastian Risi), Yujin Tang (Yujin Tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can

Sebastian Risi (@risi1979) 's Twitter Profile Photo

I’m beyond excited to announce our MIT Press book on Neuroevolution! An HTML version is now available for free on neuroevolutionbook.com, with a print edition coming out later in 2026. Real intelligence is not static; it evolves. For decades, the field of neuroevolution has

I’m beyond excited to announce our MIT Press book on Neuroevolution! An HTML version is now available for free on neuroevolutionbook.com, with a print edition coming out later in 2026.

Real intelligence is not static; it evolves. For decades, the field of neuroevolution has
Laura Ruis (@lauraruis) 's Twitter Profile Photo

Apply to do research with me on emergence of agency/planning in LLMs, out-of-context reasoning, understanding generalization from data, or propose your own direction! Very excited to be mentoring this spring 💫