Wenli Xiao (@_wenlixiao) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🚀Can we have a freely moving hand in the air for the manipulation policy to directly command in the real world? We introduce Flying Hand: End-Effector-Centric Framework for Versatile Aerial Manipulation Teleoperation and Policy Learning. 🎯EE-centric MPC for aerial manipulator

thumb_up_off_alt125

chat_bubble_outline5

repeat27

shareShare

Physical Intelligence

@physical_int

4 months ago

We got a robot to clean up homes that were never seen in its training data! Our new model, π-0.5, aims to tackle open-world generalization. We took our robot into homes that were not in the training data and asked it to clean kitchens and bedrooms. More below⤵️

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat260

shareShare

Xuxin Cheng

@xuxin_cheng

3 months ago

Meet 𝐀𝐌𝐎 — our universal whole‑body controller that unleashes the 𝐟𝐮𝐥𝐥  kinematic workspace of humanoid robots to the physical world. AMO is a single policy trained with RL + Hybrid Mocap & Trajectory‑Opt. Accepted to #RSS2025. Try our open models & more 👉

thumb_up_off_alt550

chat_bubble_outline23

repeat113

shareShare

Yanjie Ze

@zeyanjie

3 months ago

🤖Introducing TWIST: Teleoperated Whole-Body Imitation System. We develop a humanoid teleoperation system to enable coordinated, versatile, whole-body movements, using a single neural network. This is our first step toward general-purpose robots. 🌐humanoid-teleop.github.io

thumb_up_off_alt399

chat_bubble_outline14

repeat86

shareShare

Arthur Allshire

@arthurallshire

3 months ago

our new system trains humanoid robots using data from cell phone videos, enabling skills such as climbing stairs and sitting on chairs in a single policy (w/ Hongsuk Benjamin Choi Junyi Zhang David McAllister)

thumb_up_off_alt550

chat_bubble_outline28

repeat98

shareShare

Wenli Xiao

@_wenlixiao

3 months ago

Impressive work! Yuanhang Zhang

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Wenli Xiao

@_wenlixiao

3 months ago

A portable data collection system for general dexterous manipulation—DexWild🦾—can collect data Anywhere, enabling impressive generalization across robots/tasks/scenarios/... Huge congrats to Tony Tao and the team! 💥👏 Website: dexwild.github.io

thumb_up_off_alt19

chat_bubble_outline1

repeat1

shareShare

Max Fu

@letian_fu

3 months ago

Tired of teleoperating your robots? We built a way to scale robot datasets without teleop, dynamic simulation, or even robot hardware. Just one smartphone scan + one human hand demo video → thousands of diverse robot trajectories. Trainable by diffusion policy and VLA models

thumb_up_off_alt407

chat_bubble_outline21

repeat77

shareShare

Wenli Xiao

@_wenlixiao

3 months ago

✈️OMW to #ICRA2025 Hope to meet old&new friends and happy to chat about humanoid robot and dex manipulation!😊

thumb_up_off_alt23

chat_bubble_outline0

repeat0

shareShare

Jim Fan

@drjimfan

3 months ago

What if robots could dream inside a video generative model? Introducing DreamGen, a new engine that scales up robot learning not with fleets of human operators, but with digital dreams in pixels. DreamGen produces massive volumes of neural trajectories - photorealistic robot

thumb_up_off_alt619

chat_bubble_outline48

repeat106

shareShare

Wenli Xiao

@_wenlixiao

3 months ago

Congrats Haoru Xue Chaoyi Pan

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Wenli Xiao

@_wenlixiao

2 months ago

Tired of watching fancy humanoid dancing? Can they just do some daily useful tasks like: "Pass me a bottle of Water🍺"? 🤔Turns out it's nontrivial to stablize whole-body manipulation and locomotion at the same time. We basically want our humanoid to be stable as a camera

thumb_up_off_alt118

chat_bubble_outline5

repeat21

shareShare

Tianyuan Zhang

@tianyuanzhang99

2 months ago

Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch

thumb_up_off_alt390

chat_bubble_outline5

repeat74

shareShare

Dana Aubakirova

@daubakirovaa

2 months ago

Today, we are introducing SmolVLA: a 450M open-source vision-language action model. Best-in-class performance and inference speed! And the best part? We trained it using all the open-source LeRobot datasets in the Hugging Face hub! But how? 🫳🏀

thumb_up_off_alt476

chat_bubble_outline9

repeat87

shareShare

Wenli Xiao

@_wenlixiao

2 months ago

"I believe finding such a scalable off-policy RL algorithm is the most important missing piece in machine learning today." Very insightful blog on offlineRL by Seohong Park 🫡 It's quite painful that offlineRL only works for "reduced horizon" at this stage. looking forward to

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Wenli Xiao

@_wenlixiao

2 months ago

💡Wow—super dynamic motion controlled by a unified general policy! 🔗 gmt-humanoid.github.io Feels like the recipe for training a general whole-body controller has almost converged: MoE oracle teacher → generalist student policy In our previous research: - HOVER

thumb_up_off_alt25

chat_bubble_outline1

repeat4

shareShare

Wenli Xiao

@_wenlixiao

2 months ago

VLA language following capability can be improved by freezing the VLM, and a good VLM matters.

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Haoyu Xiong

@haoyu_xiong_

2 months ago

Your bimanual manipulators might need a Robot Neck 🤖🦒 Introducing Vision in Action: Learning Active Perception from Human Demonstrations ViA learns task-specific, active perceptual strategies—such as searching, tracking, and focusing—directly from human demos, enabling robust

thumb_up_off_alt304

chat_bubble_outline11

repeat74

shareShare

Joel Jang

@jang_yoel

2 months ago

🚀 GR00T Dreams code is live! NVIDIA GEAR Lab's open-source solution for robotics data via video world models. Fine-tune on any robot, generate 'dreams', extract actions with IDM, and train visuomotor policies with LeRobot datasets (GR00T N1.5, SmolVLA). github.com/NVIDIA/GR00T-D…

thumb_up_off_alt119

chat_bubble_outline4

repeat35

shareShare

Wenli Xiao

Gate.io

Xiaofeng Guo

Physical Intelligence

Xuxin Cheng

Yanjie Ze

Arthur Allshire

Wenli Xiao

Wenli Xiao

Max Fu

Wenli Xiao

Jim Fan

Wenli Xiao

Wenli Xiao

Tianyuan Zhang

Dana Aubakirova

Wenli Xiao

Wenli Xiao

Wenli Xiao

Haoyu Xiong

Joel Jang