Jitendra MALIK (@jitendramalikcv) 's Twitter Profile
Jitendra MALIK

@jitendramalikcv

ID: 1466979338322993156

calendar_today04-12-2021 03:55:31

31 Tweet

4,4K Followers

1 Following

Karttikeya Mangalam (@karttikeya_m) 's Twitter Profile Photo

Every CV guy I know has privately admitted at some point that current video datasets do not really seem to care about time. That the video tasks are "too short" & don't test much time understanding We introduce EgoSchema -- A litmus test for truly long-form video understanding

Every CV guy I know has privately admitted at some point that current video datasets do not really seem to care about time. 

That the video tasks are "too short" & don't test much time understanding

We introduce EgoSchema -- A litmus test for truly long-form video understanding
Yutong Bai (@yutongbai1002) 's Twitter Profile Photo

How far can we go with vision alone? Excited to reveal our Large Vision Model! Trained with 420B tokens, effective scalability, and enabling new avenues in vision tasks! (1/N) Kudos to Xinyang (Young) Geng Karttikeya Mangalam Amir Bar, Alan Yuille Trevor Darrell Jitendra MALIK Alyosha Efros!

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Together with the Ego4D consortium, today we're releasing Ego-Exo4D, the largest ever public dataset of its kind to support research on video learning & multimodal perception — including 1,400+ hours of videos of skilled human activities. Download ➡️ bit.ly/3teP49w

Jitendra MALIK (@jitendramalikcv) 's Twitter Profile Photo

We cast real-world humanoid control as a next token prediction problem, akin to predicting the next word in language. Check out our robot walking in San Francisco (Ilija Radosavovic et al) …anoid-next-token-prediction.github.io

Jitendra MALIK (@jitendramalikcv) 's Twitter Profile Photo

Another success of sim-to-real for training robot policies! This task, using two multi-fingered hands, requires considerable dexterity, and is hopefully representative of other household tasks that we wish to solve in the future.

Toru (@toruo_o) 's Twitter Profile Photo

Imitation learning works™ – but you need good data 🥹 How to get high-quality visuotactile demos from a bimanual robot with multifingered hands, and learn smooth policies? Check our new work “Learning Visuotactile Skills with Two Multifingered Hands”! 🙌 toruowo.github.io/hato/

Peter Stone (@peterstone_tx) 's Twitter Profile Photo

10 years after DQN, what are deep RL’s impacts on robotics? Which robotic problems have seen the most thrilling real-world successes thanks to DRL? Where do we still need to push the boundaries, and how? Our latest survey explores these questions! Read on for more details. 👇

10 years after DQN, what are deep RL’s impacts on robotics? Which robotic problems have seen the most thrilling real-world successes thanks to DRL? Where do we still need to push the boundaries, and how?

Our latest survey explores these questions!  Read on for more details. 👇
Vongani Maluleke (@vonekels) 's Twitter Profile Photo

Please see the website for more details. synNsync🪩is joint work with my awesome ✨co-authors✨: Lea Müller Jathushan Rajasegaran Georgios Pavlakos Shiry Ginosar Angjoo Kanazawa Jitendra MALIK Website🖥️: von31.github.io/synNsync/ Data💾: github.com/Von31/swing_da… Arxiv📜: arxiv.org/abs/2409.04440 🧵6/6

Jitendra MALIK (@jitendramalikcv) 's Twitter Profile Photo

Happy to share these exciting new results on video synthesis of humans in movement. Arguably, these establish the power of having explicit 3D representations. Popular video generation models like Sora don't do that, making it hard for the resulting video to be 4D consistent.

Jitendra MALIK (@jitendramalikcv) 's Twitter Profile Photo

I'm happy to post course materials for my class at UC Berkeley "Robots that Learn", taught with the outstanding assistance of Toru. Lecture videos at youtube.com/playlist?list=… Lecture notes & other course materials at robots-that-learn.github.io

Jitendra MALIK (@jitendramalikcv) 's Twitter Profile Photo

Angjoo Kanazawa Angjoo Kanazawa and I taught CS 280, graduate computer vision, this semester at UC Berkeley. We found a combination of classical and modern CV material that worked well, and are happy to share our lecture material from the class. cs280-berkeley.github.io Enjoy!