Moo Jin Kim (@moo_jin_kim) Twitter Tweets • TwiCopy

Moo Jin Kim

@moo_jin_kim

+ Follow

CS PhD student @Stanford | Research Intern @NVIDIA | AI/ML & Robotics

ID: 1518627093197692928

linkhttps://moojink.com calendar_today25-04-2022 16:24:57

43 Tweet

1,1K Followers

99 Following

Moo Jin Kim

@moo_jin_kim

a year ago

Awesome Zipeng Fu!! 👏 Love the demos, and great to see that everything is open-source!

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Can we train VLAs to think about what to do next—visually—before executing tasks? In this work led by Qingqing Zhao, we found that *visual* chain-of-thought reasoning enhances policy success rates + enables VLAs to leverage unlabeled video data during pretraining! #CVPR2025

thumb_up_off_alt81

chat_bubble_outline0

repeat13

shareShare

Fahim Tajwar

@fahimtajwar10

6 months ago

RL with verifiable reward has shown impressive results in improving LLM reasoning, but what can we do when we do not have ground truth answers? Introducing Self-Rewarding Training (SRT): where language models provide their own reward for RL training! 🧵 1/n

thumb_up_off_alt819

chat_bubble_outline20

repeat136

shareShare