Yoonho Lee (@yoonholeee) Twitter Tweets • TwiCopy

Yoonho Lee

@yoonholeee

+ Follow

ML PhD student @StanfordAILab

ID: 837865857606795264

linkhttp://cs.stanford.edu/~yoonho calendar_today04-03-2017 03:22:42

137 Tweet

876 Takipçi

447 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Why is action chunking crucial for robot dexterity? 🤖 - We identify a natural tradeoff between temporal consistency and reactivity - New policy decoding technique that is *both* temporally consistent & fully reactive ICLR 2025 paper: arxiv.org/abs/2408.17355 A short thread 🧵

thumb_up_off_alt339

chat_bubble_outline4

repeat65

shareShare

Yuxiao Qu

@quyuxiao

5 months ago

Heading to ICML Conference #ICML2025 this week! DM me if you’d like to chat ☕️ Come by our poster sessions on: 🧠 Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning (arxiv.org/abs/2503.07572) 🔍 Learning to Discover Abstractions for LLM Reasoning (drive.google.com/file/d/1Sfafrk…)

Heading to <a href="/icmlconf/">ICML Conference</a> #ICML2025 this week! DM me if you’d like to chat ☕️

Come by our poster sessions on:
🧠 Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning (arxiv.org/abs/2503.07572)
🔍 Learning to Discover Abstractions for LLM Reasoning (drive.google.com/file/d/1Sfafrk…)

thumb_up_off_alt45

chat_bubble_outline0

repeat7

shareShare

Yuxiao Qu

@quyuxiao

2 months ago

🚨 NEW PAPER: "RLAD: Training LLMs to Discover Abstractions for Reasoning"! We introduce reasoning abstractions: concise insights that help LLMs solve hard reasoning problems by guiding structured exploration. 📄 arxiv.org/abs/2510.02263 🌐 cohenqu.github.io/rlad.github.io/ 🧵[1/N]

thumb_up_off_alt115

chat_bubble_outline7

repeat20

shareShare

Anikait Singh

@anikait_singh_

2 months ago

🚨🚨New Paper: Training LLMs to Discover Abstractions for Solving Reasoning Problems Introducing RLAD, a two-player RL framework for LLMs to discover 'reasoning abstractions'—natural language hints that encode procedural knowledge for structured exploration in reasoning.🧵⬇️

thumb_up_off_alt586

chat_bubble_outline14

repeat116

shareShare

Chelsea Finn

@chelseabfinn

2 months ago

Hierarchical RL for LLM reasoning. Paper: arxiv.org/abs/2510.02263

thumb_up_off_alt609

chat_bubble_outline4

repeat70

shareShare

Anikait Singh

@anikait_singh_

2 months ago

Excited to share that I’ll be presenting two papers at CoLM 2025! Cognitive Behaviors that Enable Self-Improving Reasoners (Session 1, Poster 26) Training LLMs to Discover Abstractions for Solving Reasoning Problems (Ram 2 Workshop, Talk 4 — Oct 10, 4:00 PM) If you’d like to

thumb_up_off_alt145

chat_bubble_outline8

repeat13

shareShare

Aviral Kumar

@aviral_kumar2

2 months ago

Check out our new paper on improving exploration in CoT for LLMs by generating abstractions! 👇 Rather than letting the LLM think longer and longer to explore, we can let it first produce concise insights that help guide structured exploration later. This works really well!

thumb_up_off_alt88

chat_bubble_outline1

repeat12

shareShare