YI LI (@yi_li_uw) Twitter Tweets • TwiCopy

YI LI

@yi_li_uw

+ Follow

🎓 Final-Year Ph.D. Candidate @UW | 🤖 Robotics Research Intern @NVIDIA | 💡 Ex-Microsoft Research Asia, Tsinghua | 📢 Seeking opportunities in Robotics & VLM

ID: 1667638756336361472

linkhttp://yili.vision calendar_today10-06-2023 21:04:09

31 Tweet

179 Followers

189 Following

Shengyi Qian

@jasonqsy

a year ago

Exciting News! Our new paper "3D-MVP" is out! We propose a novel approach for 3D multi-view pretraining using masked autoencoders, leveraging Robotic View Transformer (RVT) to improve generalization for downstream robotics tasks. jasonqsy.github.io/3DMVP/

thumb_up_off_alt99

chat_bubble_outline4

repeat26

shareShare

DeepSeek

@deepseek_ai

9 months ago

🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team DeepSeek exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,

thumb_up_off_alt21,21K

chat_bubble_outline1,1K

repeat2,2K

shareShare

Chris Paxton

@chris_j_paxton

9 months ago

working reproduction of pi0! awesome stuff

thumb_up_off_alt58

chat_bubble_outline3

repeat6

shareShare

Abhishek Gupta

@abhishekunique7

9 months ago

Over the last few months, we’ve been thinking about how to learn from “off-domain” data - data from non-robot sources like video or simulation. These data sources are not quite good enough to learn policies (even monolithic VLA models) directly, but they still contain lots of

thumb_up_off_alt120

chat_bubble_outline2

repeat17

shareShare

Ted Xiao

@xiao_ted

9 months ago

There is so much potential in moving beyond simple natural language when building robot foundation models. Trajectories are a great way to do this -- and HAMSTER has taken trajectory-conditioned policies to the scale of VLAs! Congrats to the whole HAMSTER team.

thumb_up_off_alt56

chat_bubble_outline0

repeat4

shareShare

Remi Cadene

@remicadene

9 months ago

⛔ STOP WHAT YOU'RE DOING ⛔ THERE IS A NEW ROBOT IN TOWN ~ LeKiwi 🥝 ~ ... build it yourself to automate daily choirs with AI 🤗 1/🧵 A thraed!

thumb_up_off_alt352

chat_bubble_outline21

repeat62

shareShare

Chris Paxton

@chris_j_paxton

9 months ago

Its extremely clear to me that this sort of approach is the future, especially in relatively structured warehouse and logistics environments. When I was doing industrial manipulation, it was common for setup to take multiple days. Integration with existing systems and workflows

thumb_up_off_alt138

chat_bubble_outline9

repeat11

shareShare

YI LI

@yi_li_uw

9 months ago

It is exciting to see another Hierarchical VLA models in the same month! 🎉 Hi Robot from Physical Intelligence uses language, Helix from Figure uses latent embedding, our HAMSTER from @NVIDIA uses 2d paths. We’re all inspired by Thinking, Fast and Slow—combining fast, intuitive (System

It is exciting to see another Hierarchical VLA models in the same month! 🎉
Hi Robot from <a href="/physical_int/">Physical Intelligence</a> uses language, Helix from <a href="/Figure/">Figure</a> uses latent embedding, our HAMSTER from @NVIDIA uses 2d paths.
We’re all inspired by Thinking, Fast and Slow—combining fast, intuitive (System

thumb_up_off_alt28

chat_bubble_outline0

repeat7

shareShare

Yuzhe Qin

@qinyuzhe

9 months ago

Meet our first general-purpose robot at Dexmate dexmate.ai/vega Adjustable height from 0.66m to 2.2m: compact enough for an SUV, tall enough to reach those impossible high shelves. Powerful dual arms (15lbs payload each) and omni-directional mobility for ultimate

thumb_up_off_alt207

chat_bubble_outline13

repeat33

shareShare

Yuke Zhu

@yukez

8 months ago

Thrilled to announce GR00T N1, our open foundation model for generalist humanoid robots! GR00T N1 adopts a dual-system design, leverages the entire data pyramid for model training, and supports various robot embodiments. GR00T N1 embodies years of fundamental research, spanning

thumb_up_off_alt325

chat_bubble_outline8

repeat58

shareShare

Remi Cadene

@remicadene

7 months ago

Meet SO-101, next-gen robot arm for all, by Hugging Face 🤗 Enables smooth takeover to boost AI capabilities, faster assembly (20mn), same affordable price ($100 per arm) 🤯 Get yours today! Links in thread below 👇

thumb_up_off_alt717

chat_bubble_outline27

repeat139

shareShare

NVIDIA Robotics

@nvidiarobotics

7 months ago

🎊 That's a wrap on #ICLR2025. Shout out to all the amazing research in #robotics, machine vision, and more. Missed it? Check out our #NVIDIAResearch paper on hierarchical action models for open-world robot manipulation. 📄 nvda.ws/42OZKcC

thumb_up_off_alt101

chat_bubble_outline5

repeat19

shareShare

Tesla Optimus

@tesla_optimus

6 months ago

Was just getting warmed up

thumb_up_off_alt53,53K

chat_bubble_outline4,4K

repeat10,10K

shareShare

Junyao Shi

@junyaoshi

6 months ago

On my way to Atlanta to present ZeroMimic: Distilling Robotic Manipulation Skills from Web Videos at IEEE ICRA! Stay tuned for an in-depth post about how ZeroMimic distills zero-shot policies from web human videos. 🌐 Project site: zeromimic.github.io

thumb_up_off_alt36

chat_bubble_outline0

repeat4

shareShare

Joel Jang

@jang_yoel

6 months ago

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

thumb_up_off_alt326

chat_bubble_outline7

repeat65

shareShare

Ryan Hoque

@ryan_hoque

6 months ago

Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of dexterous human manipulation to date — 829 hours of egocentric video + paired 3D hand poses across 194 tasks. Now on arxiv: arxiv.org/abs/2505.11709 (1/4)

thumb_up_off_alt568

chat_bubble_outline15

repeat95

shareShare