Max Fu (@letian_fu) Twitter Tweets • TwiCopy

Arthur Allshire

7 months ago

our new system trains humanoid robots using data from cell phone videos, enabling skills such as climbing stairs and sitting on chairs in a single policy (w/ Hongsuk Benjamin Choi Junyi Zhang David McAllister)

thumb_up_off_alt550

chat_bubble_outline28

repeat98

shareShare

ollama

@ollama

6 months ago

Multimodal model support is here in 0.7! Ollama now supports multimodal models via its new engine. Cool vision models to try👇 - Llama 4 Scout & Maverick - Gemma 3 - Qwen 2.5 VL - Mistral Small 3.1 and more 😍 Blog post 🧵👇

thumb_up_off_alt1,1K

chat_bubble_outline67

repeat278

shareShare

Long Lian

@longtonylian

6 months ago

As we all know, collecting data for robotics is very costly. This is why I’m very impressed by this work: it generates a huge amount of data for different robots without any teleoperation.

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Raven Huang

@ravenhuang4

6 months ago

Can we scale up robot data collection without a robot? We propose a pipeline to scale robot dataset from one human demonstration. Through a real2render2real pipeline, policies trained with the generated data can be deployed directly on a real robot.

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare

Fangchen Liu

@fangchenliu_

6 months ago

Ppl are collecting large-scale teleoperation datasets, which are often just kinematics-level trajectories. Real2Render2Real is a new framework that can generate these data w.o. teleoperation or tricky sim+rl. High data quality for BC + nice scaling effect, plz dive in for more!

thumb_up_off_alt54

chat_bubble_outline0

repeat3

shareShare

Justin Yu

@uynitsuj

6 months ago

Next challenge: scalable learning of robot manipulation skills from truly in-the-wild videos, such as YouTube!

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Max Fu

@letian_fu

6 months ago

Large language models can do new tasks from a few text prompts. What if robots could do the same—with trajectories? 🤖 ICRT enables zero-shot imitation: prompt with a few teleop demos, and it acts—no fine-tuning. Happy to chat more at ICRA! 📍 ICRA | Wed 21 May | 08:35 - 08:40

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Zubair Irshad

@mzubairirshad

6 months ago

Interested in collecting robot training data without robots in the loop? 🦾 Check out this cool new approach that uses a single mobile device scan and a human demo video to generate diverse data for training diffusion and VLA manipulation policies. 🚀 Great work by Max Fu

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Yi Zhou

@papagina_yi

6 months ago

🚀 Struggling with the lack of high-quality data for AI-driven human-object interaction research? We've got you covered! Introducing HUMOTO, a groundbreaking 4D dataset for human-object interaction, developed with a combination of wearable motion capture, SOTA 6D pose

thumb_up_off_alt821

chat_bubble_outline11

repeat157

shareShare

Max Fu

@letian_fu

6 months ago

Learning 🤝 Model-Based Method See you tomorrow at ICRA! GWCC Building A, Room 412 1:30 PM - 6:00 PM

thumb_up_off_alt19

chat_bubble_outline0

repeat1

shareShare

Haonan Chen

@haonanchen_

6 months ago

We hope everyone had a great time at the ICRA 2025 Workshop on Learning Meets Model-Based Methods for Contact-Rich Manipulation (contact-rich.github.io)! Big thanks to our incredible speakers, panelists, and generous sponsors — and most of all, to our amazing co-organizers

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Stella Li

@stellalisy

6 months ago

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat322

shareShare

Robotic Systems Lab

@leggedrobotics

6 months ago

A legged mobile manipulator trained to play badminton with humans coordinates whole-body maneuvers and onboard perception. Paper: science.org/doi/10.1126/sc……Video: youtu.be/zYuxOVQXVt8 Yuntao Ma, Andrei Cramariuc, Farbod Farshidian, Marco Hutter

thumb_up_off_alt247

chat_bubble_outline5

repeat42

shareShare

Sawyer Merritt

@sawyermerritt

5 months ago

Waymo in a new blog post: "We conducted a comprehensive study using Waymo’s internal dataset. Spanning 500,000 hours of driving, it is significantly larger than any dataset used in previous scaling studies in the AV domain. Our study uncovered the following: • Similar to LLMs,

thumb_up_off_alt2,2K

chat_bubble_outline182

repeat137

shareShare