Jitendra MALIK (@jitendramalikcv) Twitter Tweets • TwiCopy

Karttikeya Mangalam

2 years ago

Every CV guy I know has privately admitted at some point that current video datasets do not really seem to care about time. That the video tasks are "too short" & don't test much time understanding We introduce EgoSchema -- A litmus test for truly long-form video understanding

thumb_up_off_alt172

chat_bubble_outline6

repeat30

shareShare

Yutong Bai

@yutongbai1002

2 years ago

How far can we go with vision alone? Excited to reveal our Large Vision Model! Trained with 420B tokens, effective scalability, and enabling new avenues in vision tasks! (1/N) Kudos to Xinyang (Young) Geng Karttikeya Mangalam Amir Bar, Alan Yuille Trevor Darrell Jitendra MALIK Alyosha Efros!

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat158

shareShare

Jitendra MALIK

@jitendramalikcv

2 years ago

Happy to present LVM (Large Vision Model). Scalable and tasks can be specified via prompts. Enjoy!

thumb_up_off_alt228

chat_bubble_outline1

repeat28

shareShare

AI at Meta

@aiatmeta

2 years ago

Together with the Ego4D consortium, today we're releasing Ego-Exo4D, the largest ever public dataset of its kind to support research on video learning & multimodal perception — including 1,400+ hours of videos of skilled human activities. Download ➡️ bit.ly/3teP49w

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat238

shareShare

Ilija Radosavovic

@ir413

2 years ago

hello san francisco

thumb_up_off_alt478

chat_bubble_outline43

repeat43

shareShare

Jitendra MALIK

@jitendramalikcv

2 years ago

Want to make your photorealistic 3D avatar dance like your favorite actor? Check this out!

thumb_up_off_alt34

chat_bubble_outline0

repeat2

shareShare

Jitendra MALIK

@jitendramalikcv

2 years ago

We cast real-world humanoid control as a next token prediction problem, akin to predicting the next word in language. Check out our robot walking in San Francisco (Ilija Radosavovic et al) …anoid-next-token-prediction.github.io

thumb_up_off_alt192

chat_bubble_outline3

repeat24

shareShare

Jitendra MALIK

@jitendramalikcv

2 years ago

Another success of sim-to-real for training robot policies! This task, using two multi-fingered hands, requires considerable dexterity, and is hopefully representative of other household tasks that we wish to solve in the future.

thumb_up_off_alt63

chat_bubble_outline1

repeat4

shareShare

Toru

@toruo_o

2 years ago

Imitation learning works™ – but you need good data 🥹 How to get high-quality visuotactile demos from a bimanual robot with multifingered hands, and learn smooth policies? Check our new work “Learning Visuotactile Skills with Two Multifingered Hands”! 🙌 toruowo.github.io/hato/

thumb_up_off_alt289

chat_bubble_outline8

repeat76

shareShare

Neerja Thakkar

@neerjathakkar

a year ago

It was great to work with Karttikeya Mangalam, Andrea Bajcsy and Jitendra MALIK! Project: neerja.me/atp_latent_cor… Arxiv: arxiv.org/abs/2312.06653

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Peter Stone

@peterstone_tx

a year ago

10 years after DQN, what are deep RL’s impacts on robotics? Which robotic problems have seen the most thrilling real-world successes thanks to DRL? Where do we still need to push the boundaries, and how? Our latest survey explores these questions! Read on for more details. 👇

thumb_up_off_alt512

chat_bubble_outline2

repeat99

shareShare

Himanshu Gaurav Singh

@cinnabar233

a year ago

Fun collaboration w/ Antonio Loquercio, Carlo Sferrazza, Haozhi Qi, @jane_h_wu, Pieter Abbeel, Jitendra MALIK Checkout our paper at arxiv.org/pdf/2409.08273. We release code at github.com/hgaurav2k/hop.

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Vongani Maluleke

@vonekels

a year ago

Please see the website for more details. synNsync🪩is joint work with my awesome ✨co-authors✨: Lea Müller Jathushan Rajasegaran Georgios Pavlakos Shiry Ginosar Angjoo Kanazawa Jitendra MALIK Website🖥️: von31.github.io/synNsync/ Data💾: github.com/Von31/swing_da… Arxiv📜: arxiv.org/abs/2409.04440 🧵6/6

thumb_up_off_alt25

chat_bubble_outline0

repeat1

shareShare

Jitendra MALIK

@jitendramalikcv

a year ago

Autoregressive modeling is not just for language, it can equally be used to model human behavior. This paper shows how..

thumb_up_off_alt83

chat_bubble_outline0

repeat2

shareShare

Jitendra MALIK

@jitendramalikcv

a year ago

Touche', Sergey!

thumb_up_off_alt54

chat_bubble_outline0

repeat0

shareShare

Jitendra MALIK

@jitendramalikcv

a year ago

Happy to share these exciting new results on video synthesis of humans in movement. Arguably, these establish the power of having explicit 3D representations. Popular video generation models like Sora don't do that, making it hard for the resulting video to be 4D consistent.

thumb_up_off_alt70

chat_bubble_outline0

repeat7

shareShare

Jitendra MALIK

@jitendramalikcv

a year ago

I'm happy to post course materials for my class at UC Berkeley "Robots that Learn", taught with the outstanding assistance of Toru. Lecture videos at youtube.com/playlist?list=… Lecture notes & other course materials at robots-that-learn.github.io

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat248

shareShare

Jitendra MALIK

@jitendramalikcv

8 months ago

Enjoy watching a humanoid walking around UC Berkeley. It only looks inebriated :-)

thumb_up_off_alt87

chat_bubble_outline1

repeat2

shareShare

Jitendra MALIK

@jitendramalikcv

7 months ago

Angjoo Kanazawa Angjoo Kanazawa and I taught CS 280, graduate computer vision, this semester at UC Berkeley. We found a combination of classical and modern CV material that worked well, and are happy to share our lecture material from the class. cs280-berkeley.github.io Enjoy!

thumb_up_off_alt745

chat_bubble_outline8

repeat102

shareShare