Yixuan Wang (@yxwangbot) Twitter Tweets • TwiCopy

Yixuan Wang

@yxwangbot

+ Follow

CS Ph.D. student @Columbia working on robotics | Worked at Boston Dynamics AI Institute, Google X #Vision #Robotics #Learning

ID: 1184179638215397376

linkhttps://wangyixuan12.github.io/ calendar_today15-10-2019 18:50:09

152 Tweet

1,1K Followers

964 Following

RoboPapers

@robopapers

6 months ago

Ep#10 with Roger Qiu on Humanoid Policy ~ Human Policy human-as-robot.github.io Co-hosted by Chris Paxton & Michael Cho - Rbt/Acc

thumb_up_off_alt18

chat_bubble_outline0

repeat6

shareShare

Haoyu Xiong

@haoyu_xiong_

5 months ago

It is cool to see that you can steer your low-level policy with foundation models. Check out new work from Yixuan Wang

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

**Steerability** remains one of the key issues for current vision-language-action models (VLAs). Natural language is often ambiguous and vague: "Hang a mug on a branch" vs "Hang the left mug on the right branch." Many works claim to handle language input, yet the tasks are often

thumb_up_off_alt133

chat_bubble_outline0

repeat23

shareShare

Yixuan Wang

@yxwangbot

5 months ago

Two releases in a row from our lab today 😆 One problem I was always pondering on is how to use structured representation while making it scalable. Super excited that Kaifeng's work pushes this direction forward and I cannot wait to see what's more in the future!!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Kaifeng Zhang

@kaiwynd

5 months ago

Check out the cool results and demo!

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Katherine Liu

@robo_kat

5 months ago

How can we achieve both common sense understanding that can deal with varying levels of ambiguity in language and dextrous manipulation? Check out CodeDiffuser, a really neat work that bridges Code Gen with a 3D Diffusion Policy! This was a fun project with cool experiments! 🤖

thumb_up_off_alt12

chat_bubble_outline1

repeat5

shareShare

Yunzhu Li

@yunzhuliyz

5 months ago

We’ve been exploring 3D world models with the goal of finding the right recipe that is both: (1) structured—for sample efficiency and generalization (my personal emphasis), and (2) scalable—as we increase real-world data collection. With **Particle-Grid Neural Dynamics**

thumb_up_off_alt85

chat_bubble_outline0

repeat11

shareShare

Yixuan Wang

@yxwangbot

5 months ago

Just arrived at LA and excited to be at RSS! I will present CodeDiffuser at following sessions: - Presentation on June 22 (Sun.) 5:30 PM - 6:30 PM - Poster on June 22 (Sun.) 6:30 PM - 8:00 PM I will also present CuriousBot at - FM4RoboPlan Workshop on June 21 (Sat.) 9:40 - 10:10

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Yunzhu Li

@yunzhuliyz

5 months ago

Had a great time yesterday giving three invited talks at #RSS2025 workshops—on foundation models, structured world models, and tactile sensing for robotic manipulation. Lots of engaging conversations! One more talk coming up on Wednesday (6/25). Also excited to be presenting two

thumb_up_off_alt75

chat_bubble_outline1

repeat11

shareShare

Shivansh Patel

@shivanshpatel35

5 months ago

🚀 Introducing RIGVid: Robots Imitating Generated Videos! Robots can now perform complex tasks—pouring, wiping, mixing—just by imitating generated videos, purely zero-shot! No teleop. No OpenX/DROID/Ego4D. No videos of human demonstrations. Only AI generated video demos 🧵👇

thumb_up_off_alt143

chat_bubble_outline2

repeat31

shareShare

Russ Tedrake

@russtedrake

4 months ago

TRI's latest Large Behavior Model (LBM) paper landed on arxiv last night! Check out our project website: toyotaresearchinstitute.github.io/lbm1/ One of our main goals for this paper was to put out a very careful and thorough study on the topic to help people understand the state of the

thumb_up_off_alt334

chat_bubble_outline2

repeat75

shareShare