Jason Liu (@jasonjzliu) Twitter Tweets • TwiCopy

Exiciting to see (at 5:55) Nvidia adopting LEAP Hand in their sim2real efforts! Build your own at leaphand.com ! Lots more coming this summer, stay tuned :) Deepak Pathak Ananye Agarwal

thumb_up_off_alt44

chat_bubble_outline3

repeat7

shareShare

Robot data is expensive and hard to scale But what if we could collect rich, diverse demos—with just our hands? 🙌 Our latest work, DexWild, shows how large-scale human data 💪 + robot data 🦾 co-training enables strong generalization across tasks, scenes, and embodiments

thumb_up_off_alt95

chat_bubble_outline1

repeat6

shareShare

Deepak Pathak

@pathak2206

6 months ago

Introducing DexWild -- a scalable approach to diverse "in the wild" data collection for dexterous robotic hands! This data can be used to co-train policy for any downstream robotic hands on any body form factor (humanoids, AMR with arms, etc). 🚀🤖

thumb_up_off_alt69

chat_bubble_outline3

repeat8

shareShare

Kenny Shaw

@kenny__shaw

6 months ago

Very exciting Handy Moves workshop at ICRA 2025 this year! It's an honor to be hosting this morning session! Please join us in Room 302 😀 sites.google.com/view/dexterity…

thumb_up_off_alt33

chat_bubble_outline3

repeat5

shareShare

Mihir Prabhudesai

@mihirp98

6 months ago

Excited to share our work: Maximizing Confidence Alone Improves Reasoning Humans rely on confidence to learn when answer keys aren’t available (e.g taking an exam). Surprisingly, LLMs can also learn w/o ground-truth answers, simply by reinforcing high-confidence answers via RL!

thumb_up_off_alt276

chat_bubble_outline14

repeat34

shareShare

Lili

@lchen915

6 months ago

One fundamental issue with RL – whether it’s for robots or LLMs – is how hard it is to get rewards. For LLM reasoning, we need ground-truth labels to verify answers. We found that maximizing confidence alone allows LLMs to improve their reasoning with RL!

thumb_up_off_alt129

chat_bubble_outline5

repeat26

shareShare

Weave Robotics

@weaverobotics

6 months ago

Tidying should feel robotic 🔊🐝

thumb_up_off_alt1,1K

chat_bubble_outline63

repeat159

shareShare

Fahim Tajwar

@fahimtajwar10

6 months ago

RL with verifiable reward has shown impressive results in improving LLM reasoning, but what can we do when we do not have ground truth answers? Introducing Self-Rewarding Training (SRT): where language models provide their own reward for RL training! 🧵 1/n

thumb_up_off_alt819

chat_bubble_outline20

repeat136

shareShare

Deepak Pathak

@pathak2206

6 months ago

Maximizing Confidence Alone Improves Reasoning Feels like the start of the "curiosity-driven learning" era for LLMs. I have spent most of my career towards building agents that can self-improve without any external rewards (e.g., curiosity work during Phd and then at CMU).

thumb_up_off_alt77

chat_bubble_outline4

repeat12

shareShare

Robotic Systems Lab

@leggedrobotics

6 months ago

A legged mobile manipulator trained to play badminton with humans coordinates whole-body maneuvers and onboard perception. Paper: science.org/doi/10.1126/sc……Video: youtu.be/zYuxOVQXVt8 Yuntao Ma, Andrei Cramariuc, Farbod Farshidian, Marco Hutter

thumb_up_off_alt247

chat_bubble_outline5

repeat42

shareShare

Mengda Xu

@mengdaxu__

6 months ago

Can we collect robot dexterous hand data directly with human hand? Introducing DexUMI: 0 teleoperation and 0 re-targeting dexterous hand data collection system → autonomously complete precise, long-horizon and contact-rich tasks Project Page: dex-umi.github.io

thumb_up_off_alt224

chat_bubble_outline10

repeat44

shareShare

Tyler Lum

@tylerlum23

6 months ago

🧑🤖 Introducing Human2Sim2Robot! 💪🦾 Learn robust dexterous manipulation policies from just one human RGB-D video. Our Real→Sim→Real framework crosses the human-robot embodiment gap using RL in simulation. #Robotics #DexterousManipulation #Sim2Real 🧵1/7

thumb_up_off_alt248

chat_bubble_outline5

repeat48

shareShare

Yitang Li

@li_yitang

6 months ago

🤖Can a humanoid robot carry a full cup of beer without spilling while walking 🍺? Hold My Beer ! Introducing Hold My Beer🍺: Learning Gentle Humanoid Locomotion and End-Effector Stabilization Control Project: lecar-lab.github.io/SoFTA/ See more details below👇

thumb_up_off_alt94

chat_bubble_outline1

repeat18

shareShare

Jingyun Yang

@yjy0625

6 months ago

Introducing Mobi-π: Mobilizing Your Robot Learning Policy. Our method: ✈️ enables flexible mobile skill chaining 🪶 without requiring additional policy training data 🏠 while scaling to unseen scenes 🧵↓

thumb_up_off_alt282

chat_bubble_outline7

repeat71

shareShare

Tony Tao @ RSS 🤖

@_tonytao_

5 months ago

🦾 DexWild is now open-source! Scaling up in-the-wild data will take a community effort, so let’s work together. Can’t wait to see what you do with DexWild! Main Repo: github.com/dexwild/dexwild Hardware Guide: tinyurl.com/dexwild-hardwa… Training Code: github.com/dexwild/dexwil…

thumb_up_off_alt128

chat_bubble_outline1

repeat30

shareShare

Haoyu Xiong

@haoyu_xiong_

5 months ago

Your bimanual manipulators might need a Robot Neck 🤖🦒 Introducing Vision in Action: Learning Active Perception from Human Demonstrations ViA learns task-specific, active perceptual strategies—such as searching, tracking, and focusing—directly from human demos, enabling robust

thumb_up_off_alt304

chat_bubble_outline11

repeat74

shareShare