Raven Huang (@ravenhuang4) Twitter Tweets • TwiCopy

Chung Min Kim

4 years ago

It’s notoriously difficult to model the mechanics of compliant robot jaw tips during grasping! We found that a new tool from computer graphics can help. IPC-GraspSim, from AUTOLab UC Berkeley. Paper, data, video: sites.google.com/berkeley.edu/i… (1/9)

thumb_up_off_alt40

chat_bubble_outline1

repeat10

shareShare

Kaushik Shivakumar

@19kaushiks

3 years ago

Wouldn’t it be nice if ChatGPT could find your missing keys for you? Our latest research from Berkeley AI Research + Google AI suggests that robots can use large language models (LLMs) to find hidden objects faster. 🧵👇

thumb_up_off_alt161

chat_bubble_outline7

repeat34

shareShare

Max Fu

@letian_fu

a year ago

Can vision and language models be extended to include touch? Yes! We will present a new touch-vision-language dataset collected in the wild and Touch-Vision-Language Models (TVLMs) trained on this dataset at #ICML2024. 🙌 1/6 tactile-vlm.github.io

thumb_up_off_alt145

chat_bubble_outline7

repeat41

shareShare

Max Fu

@letian_fu

a year ago

Vision-language models perform diverse tasks via in-context learning. Time for robots to do the same! Introducing In-Context Robot Transformer (ICRT): a robot policy that learns new tasks by prompting with robot trajectories, without any fine-tuning. icrt.dev [1/N]

thumb_up_off_alt307

chat_bubble_outline3

repeat59

shareShare

Fangchen Liu

@fangchenliu_

8 months ago

1/N Most Vision-Language-Action models need tons of data for finetuning, and still fail for new objects and instructions. Introducing OTTER, a lightweight, easy-to-train model that uses text-aware visual features to nail unseen tasks out of the box! Here's how it works 👇

thumb_up_off_alt311

chat_bubble_outline12

repeat64

shareShare

Raven Huang

@ravenhuang4

6 months ago

Can we scale up robot data collection without a robot? We propose a pipeline to scale robot dataset from one human demonstration. Through a real2render2real pipeline, policies trained with the generated data can be deployed directly on a real robot.

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare

Raven Huang

@ravenhuang4

6 months ago

Can we track object part motions from a monocular video? Check out POD! With an object scan and a monocular video, we can learn an object configuration model. This could be useful for reconstructing articulated objects for robot learning.

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Fei-Fei Li

@drfeifei

3 months ago

(1/N) How close are we to enabling robots to solve the long-horizon, complex tasks that matter in everyday life? 🚨 We are thrilled to invite you to join the 1st BEHAVIOR Challenge @NeurIPS 2025, submission deadline: 11/15. 🏆 Prizes: 🥇 $1,000 🥈 $500 🥉 $300

thumb_up_off_alt952

chat_bubble_outline31

repeat184

shareShare