Wenhao Yu (@stacormed) Twitter Tweets • TwiCopy

Ayzaan Wahid

2 years ago

For the past year we've been working on ALOHA Unleashed 🌋 @GoogleDeepmind - pushing the scale and dexterity of tasks on our ALOHA 2 fleet. Here is a thread with some of the coolest videos! The first task is hanging a shirt on a hanger (autonomous 1x)

thumb_up_off_alt537

chat_bubble_outline31

repeat110

shareShare

Google DeepMind

@googledeepmind

a year ago

How can Gemini 1.5 Pro’s long context window help robots navigate the world? 🤖 A thread of our latest experiments. 🧵

thumb_up_off_alt1,1K

chat_bubble_outline123

repeat193

shareShare

Wenhao Yu

@stacormed

a year ago

At Vienna for ICML this week! Let me know if you are down to catch up and look forward to the great discussions and talks ahead! Also come check out our work on PIVOT pivot-prompt.github.io during Tuesday’s poster session!

thumb_up_off_alt36

chat_bubble_outline1

repeat1

shareShare

Yaru Niu

@yaru_niu

a year ago

We just open sourced the hardware and software of LocoMan: github.com/linchangyi1/Lo…. Try it out yourself!

thumb_up_off_alt49

chat_bubble_outline1

repeat8

shareShare

Wenhao Yu

@stacormed

a year ago

How can we leverage the common sense knowledge from a VLM to understand the progress (and even quality!) of a robotics trajectory? Check out GVL on a surprisingly simple and elegant way to do that! Awesome work by Jason!

thumb_up_off_alt19

chat_bubble_outline0

repeat4

shareShare

Wenhao Yu

@stacormed

a year ago

Wow this is really good! In some way I’m more impressed that it’s teleoperated than if it’s autonomous cuz it feels very plausible to develop a highly specialized RL-based policy to do this, but being able to tele op this opens up a wide range of data to be collected.

thumb_up_off_alt15

chat_bubble_outline2

repeat0

shareShare

Wenhao Yu

@stacormed

a year ago

💪💪

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Wenhao Yu

@stacormed

10 months ago

2nd Earth Rover Challenge is coming! Eager to see how much progress AI will make in navigating real cities against real human agents!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Sundar Pichai

@sundarpichai

8 months ago

We’ve always thought of robotics as a helpful testing ground for translating AI advances into the physical world. Today we’re taking our next step in this journey with our newest Gemini 2.0 robotics models. They show state of the art performance on two important benchmarks -

thumb_up_off_alt3,3K

chat_bubble_outline176

repeat351

shareShare

Wenhao Yu

@stacormed

8 months ago

Super excited to share what we’ve been working on!

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

Yixin Lin

@yixin_lin_

8 months ago

Complementary to Gemini Robotics -- the massive vision-language-action (VLA) model released yesterday -- we also investigated how far we can push Gemini for robotics _purely from simulation data_ in Proc4Gem: 🧵

thumb_up_off_alt368

chat_bubble_outline5

repeat42

shareShare

Boyuan Chen

@boyuanchen0

7 months ago

Deadline extended! You now have until May 25th (10 days post-NeurIPS) to submit to our ICML World Model Workshop. Looking forward to your papers!

thumb_up_off_alt37

chat_bubble_outline1

repeat3

shareShare

Wenhao Yu

@stacormed

5 months ago

How do imbue robots with the ability to imagine the world and complete tasks better? Join us at CoRL 25 workshop on Robotics World Modeling and share your latest work in this area!

thumb_up_off_alt19

chat_bubble_outline0

repeat4

shareShare

Wenhao Yu

@stacormed

2 months ago

Excited to share our latest work on Gemini Robotics 1.5! Our model can effectively learn from experience of drastically different robots, think on its own, and act as an agent. It’s an important step towards creating a general, intelligent, and friendly robot!

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Wenhao Yu

@stacormed

2 months ago

Gemini Robotics 1.5 is not only general, but also fairly dexterous! Enjoy some fun videos of robot doing insertion, zipping, and more (remember this is the *same checkpoint* that also controls two other very different robots) 😆

thumb_up_off_alt65

chat_bubble_outline0

repeat10

shareShare

Caden Lu

@jyluxx

a month ago

Interacting with Gemini Robotics 1.5 is so fun! Our Embodied Reasoning model planned the multi-step task and orchestrated our Vision Language Action model for precise execution!

thumb_up_off_alt40

chat_bubble_outline1

repeat10

shareShare