Paul Zhou (@zhiyuan_zhou_) Twitter Tweets • TwiCopy

Will Chen

7 months ago

I'm excited to announce that we'll be hosting a Workshop on Learned Robot Representations (RoboReps) at #RSS2025! This will be a full day workshop on June 25, 2025, at USC. Submissions open at bit.ly/3ENsAlN - Due May 28 AOE Website: rss25-roboreps.github.io (1/🧵)

thumb_up_off_alt21

chat_bubble_outline1

repeat8

shareShare

Arthur Allshire

@arthurallshire

7 months ago

our new system trains humanoid robots using data from cell phone videos, enabling skills such as climbing stairs and sitting on chairs in a single policy (w/ Hongsuk Benjamin Choi Junyi Zhang David McAllister)

thumb_up_off_alt550

chat_bubble_outline28

repeat98

shareShare

RoboPapers

@robopapers

7 months ago

Full episode dropping soon! Geeking out with Paul Zhou on AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World auto-eval.github.io Co-hosted by Chris Paxton & Michael Cho - Rbt/Acc

thumb_up_off_alt18

chat_bubble_outline0

repeat5

shareShare

Paul Zhou

@zhiyuan_zhou_

7 months ago

This was fun thanks for having me Chris Paxton Michael Cho - Rbt/Acc! See the podcast for some livestream of the robot in real time and me evaluating a policy live! Or check it out for yourself at auto-eval.github.io and eval your policy in real without breaking a sweat

thumb_up_off_alt35

chat_bubble_outline2

repeat6

shareShare

RoboPapers

@robopapers

7 months ago

Full episode dropping soon! Geeking out with Paul Zhou on AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World auto-eval.github.io Co-hosted by Chris Paxton & Michael Cho - Rbt/Acc

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Paul Zhou

@zhiyuan_zhou_

7 months ago

Yes! Lets build a network of distributed eval stations together 🦾 With our open sourced framework it now only takes 3-5 hours to set up a new AutoEval station! We have released a detailed step by step guide.

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Akhil Bagaria

@akhil_bagaria

7 months ago

So cool!

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Chris Paxton

@chris_j_paxton

6 months ago

One of the biggest challenges in robot learning is that we don't have a solution for comparable, reproducible evaluation of different methods. Enter AutoEval, which allows you to (1) test methods on known problems via web api, and (2) gives you all the tools you need to set up

thumb_up_off_alt50

chat_bubble_outline1

repeat2

shareShare

Michael Cho - Rbt/Acc

@micoolcho

6 months ago

In my view, real world eval is an even bigger bottleneck than lack of data in robotics. We need more attempts like AutoEval to creatively think of ways to scale evals/benchmarking. Great fun chatting with Paul Zhou & Chris Paxton

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Ville 🤖

@villekuosmanen

6 months ago

This is a very cool project and I can see why access to safe, autonomous, and robust API-first eval cells around the world could be a useful commercial product!

thumb_up_off_alt8

chat_bubble_outline2

repeat1

shareShare

Yifei Zhou

@yifeizhou02

6 months ago

With previous research in multimodal and agents, I believe the only truly useful multimodal agent before 2027 is multimodal co-creations in structured formats. Sharing my first blogpost, cuz I do not quite see this point of view around but can be quite impacful to the society.

thumb_up_off_alt114

chat_bubble_outline6

repeat15

shareShare

Paul Zhou

@zhiyuan_zhou_

6 months ago

Thanks for sharing our work!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Kevin Frans

@kvfrans

6 months ago

Over the past year, I've been compiling some "alchemist's notes" on deep learning. Right now it covers basic optimization, architectures, and generative models. Focus is on learnability -- each page has nice graphics and an end-to-end implementation. notes.kvfrans.com

thumb_up_off_alt210

chat_bubble_outline3

repeat28

shareShare

Kevin Frans

@kvfrans

6 months ago

Stare at policy improvement and diffusion guidance, and you may notice a suspicious similarity... We lay out an equivalence between the two, formalizing a simple technique (CFGRL) to improve performance across-the-board when training diffusion policies. arxiv.org/abs/2505.23458

thumb_up_off_alt239

chat_bubble_outline8

repeat37

shareShare

Seohong Park

@seohong_park

6 months ago

We found a way to do RL *only* with BC policies. The idea is simple: 1. Train a BC policy π(a|s) 2. Train a conditional BC policy π(a|s, z) 3. Amplify(!) the difference between π(a|s, z) and π(a|s) using CFG Here, z can be anything (e.g., goals for goal-conditioned RL). 🧵↓

thumb_up_off_alt339

chat_bubble_outline5

repeat41

shareShare

Paul Zhou

@zhiyuan_zhou_

5 months ago

Traveling to #RSS2025 tomorrow and looking forward to catching up with old friends and meeting new ones! I’ll be presenting AutoEval at the Robot Evaluation Workshop on Wednesday, and honored to receive the oral and the workshop award! Oral: Wed 11AM Poster: Wed 3:30-4:30

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare