Abrar Anwar (@_abraranwar) Twitter Tweets • TwiCopy

Nathan Dennler

9 months ago

User-aligned robot representations are best learned from user data, but how do we collect this data without onerous labeling processes? We can learn from data that users are ✨intrinsically motivated✨ to produce through exploratory search 🔎! 🧵1/6

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Gautam Salhotra

@gautamsalhotra

7 months ago

We see so many startups offering robot data collection -- how come we don't see ones focussing on real robot evals as a service? There should be real-robot benchmarks for all organisations to test their VLAs on.

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Yiğit Korkmaz

@yigitkkorkmaz

6 months ago

I recently wrote a post about MILE for USC Robotics blog — check it out here: rasc.usc.edu/blog/mile-mode… Feel free to reach out if you have any questions or thoughts! See you at ICRA 🤖🙂

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Abrar Anwar

@_abraranwar

6 months ago

Arrived at #ICRA2025 and I'll be presenting my ReMEmbR work with NVIDIA on Tuesday! Happy to chat with people on robot memory, evaluation, language+robots, and reward learning (more to come soon on this one 😉)!

thumb_up_off_alt72

chat_bubble_outline0

repeat4

shareShare

Jesse Zhang

@jesse_y_zhang

6 months ago

Reward models that help real robots learn new tasks—no new demos needed! ReWiND uses language-guided rewards to train bimanual arms on OOD tasks in 1 hour! Offline-to-online, lang-conditioned, visual RL on action-chunked transformers. 🧵

thumb_up_off_alt301

chat_bubble_outline10

repeat56

shareShare

Erdem Bıyık

@ebiyik_

6 months ago

Jesse is such a naturally talented advisor that I didn't demonstrate anything about advising, just gave some language instructions. Do you know who else is so good? The model. Check it out 👇🏻

thumb_up_off_alt34

chat_bubble_outline1

repeat3

shareShare

Erdem Bıyık

@ebiyik_

6 months ago

By the way, Jesse graduated last week and will move to Allen School for a postdoc. Be on the lookout for him when he enters the job market in a few years 🌟

By the way, Jesse graduated last week and will move to <a href="/uwcse/">Allen School</a> for a postdoc. Be on the lookout for him when he enters the job market in a few years 🌟

thumb_up_off_alt24

chat_bubble_outline1

repeat1

shareShare

Erdem Bıyık

@ebiyik_

6 months ago

Allen School Jesse Zhang Another shoutout goes to Jiahui Zhang , Yusen Luo and Abrar Anwar , who did most of the work. Yusen will apply for PhD in the next cycle, so another great candidate to be aware of. Abrar and I are both at #ICRA2025. We will be happy to chat with anyone about the work :)

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Abhishek Gupta

@abhishekunique7

6 months ago

Very very exciting to have Jesse Zhang join us at UW soon! He's done some incredible work - I'd recommend reading rewind-reward.github.io! Congratulations on a fantastic Ph.D. Jesse Zhang 🎉

thumb_up_off_alt48

chat_bubble_outline0

repeat3

shareShare

Sumedh Sontakke

@sota_kke

6 months ago

Reward learning (like DYNA) has enabled e2e policies to reach 99% SR but (1) generalization to new tasks and (2) sample efficiency are still hard! ReWiND produces better rewards for OOD tasks than SOTA like GVL & LIV from Jason Ma that inspired us! 🌐: rewind-reward.github.io

thumb_up_off_alt54

chat_bubble_outline1

repeat9

shareShare

Yusen Luo

@yusen_2001

6 months ago

Absolutely thrilled to be part of this work— I truly enjoyed every moment of the collaboration. Huge thanks to all the amazing collaborators who made it happen! 😄

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Rajat Kumar Jenamani

@rkjenamani

6 months ago

Excited to share our work on continual, flexible, active, and safe robot personalization w/ Tom Silver, Ziang Liu, Ben Dodson & Tapomayukh "Tapo" Bhattacharjee. Also: Tom Silver is starting a lab at Princeton!! I HIGHLY recommend joining — thoughtful, kind, and an absolute joy to work with!

thumb_up_off_alt17

chat_bubble_outline1

repeat4

shareShare

Gautam Salhotra

@gautamsalhotra

6 months ago

Wondering how to get more from your robot finetuning datasets? MILE extracts more out of intervention-based demos during training, giving you more bang for your buck per demonstration. Read more in the new USC Robotics blogpost! rasc.usc.edu/blog/mile-mode… Yiğit Korkmaz Erdem Bıyık

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

C's Robotics Paper Notes

@roboreading

6 months ago

rewind-reward.github.io ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations LLM generated instructions z + demo -> learning-based reward model (progress) R(o,z) -> optimize policy via RL online

thumb_up_off_alt82

chat_bubble_outline0

repeat13

shareShare

Jesse Zhang

@jesse_y_zhang

6 months ago

How can non-experts quickly teach robots a variety of tasks? Introducing HAND ✋, a simple, time-efficient method of training robots! Using just a **single hand demo**, HAND learns manipulation tasks in under **4 minutes**! 🧵

thumb_up_off_alt237

chat_bubble_outline6

repeat30

shareShare

Jenny Zhang

@jennyzhangzt

6 months ago

**When AIs Start Rewriting Themselves** Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents The Darwin Gödel Machine can: 1. Read and modify its own code 2. Evaluate if the change improves performance 3. Open-endedly explore the solution space 🧵👇

thumb_up_off_alt224

chat_bubble_outline7

repeat51

shareShare

Ishika Singh

@ishika_s_

6 months ago

VLAs have the potential to generalize over scenes and tasks, but require a ton of data to learn robust policies. We introduce OG-VLA, a novel architecture and learning framework that combines the generalization strengths of VLAs with the robustness of 3D-aware policies. 🧵

thumb_up_off_alt142

chat_bubble_outline2

repeat23

shareShare

Abrar Anwar

@_abraranwar

6 months ago

Cool to see more real-world RL! Great job!

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare