
Moo Jin Kim
@moo_jin_kim
CS PhD student @Stanford | Research Intern @NVIDIA | AI/ML & Robotics
ID: 1518627093197692928
https://moojink.com 25-04-2022 16:24:57
43 Tweet
1,1K Takipçi
99 Takip Edilen


Can we train VLAs to think about what to do next—visually—before executing tasks? In this work led by Qingqing Zhao, we found that *visual* chain-of-thought reasoning enhances policy success rates + enables VLAs to leverage unlabeled video data during pretraining! #CVPR2025
