Carl Doersch (@carldoersch) Twitter Tweets • TwiCopy

Carl Doersch

@carldoersch

+ Follow

Researcher at DeepMind

ID: 852856132632797184

linkhttp://carldoersch.com calendar_today14-04-2017 12:08:42

61 Tweet

2,2K Takipçi

289 Takip Edilen

Carl Doersch

@carldoersch

2 years ago

Just in time for CVPR, we've released code to generate "rainbow visualizations" from a set of point tracks: it semi-automatically segments foreground objects and corrects for camera motion. Try our colab demo at colab.sandbox.google.com/github/deepmin… (vid source youtube.com/watch?v=yuQFQ8…)

thumb_up_off_alt710

chat_bubble_outline3

repeat110

shareShare

Dima Damen

@dimadamen

a year ago

Can you win 2nd Perception Test Challenge? European Conference on Computer Vision #ECCV2026 workshop: ptchallenge-workshop.github.io Diagnose Audio-visual MLM on ability to model memory, physics, abstraction &semantics through 6 tasks: VQA, Point Tracking, Box T, action/sound localisation - Jointly! Google DeepMind +win 💰

thumb_up_off_alt54

chat_bubble_outline2

repeat13

shareShare

Skanda

@skandakoppula

a year ago

We're excited to release TAPVid-3D: an evaluation benchmark of 4,000+ real world videos and 2.1 million metric 3D point trajectories, for the task of Tracking Any Point in 3D!

thumb_up_off_alt290

chat_bubble_outline6

repeat59

shareShare

Carl Doersch

@carldoersch

a year ago

Want to make a difference with point tracking? The medical community needs help tracking tissue deformation during surgery! Participate in the STIR challenge (stir-challenge.github.io) at MICCAI, deadline in September.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Carl Doersch

@carldoersch

a year ago

Want a robot to solve a task, specified in language? Generate a video of a person doing it, and then retarget the action to the robot with the help of point tracking! Cool collab with Homanga Bharadhwaj during his student researcher stint at Google.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Daniel Geng

@dangengdg

a year ago

What happens when you train a video generation model to be conditioned on motion? Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!

thumb_up_off_alt673

chat_bubble_outline20

repeat147

shareShare

Kelsey Allen

@kelseyrallen

7 months ago

Humans can tell the difference between a realistic generated video and an unrealistic one – can models? Excited to share TRAJAN: the world’s first point TRAJectory AutoeNcoder for evaluating motion realism in generated and corrupted videos. 🌐 trajan-paper.github.io 🧵

thumb_up_off_alt60

chat_bubble_outline3

repeat13

shareShare

Carl Doersch

@carldoersch

2 months ago

I "found" a few of the tasks in this video, and it's hard to convey the feeling to those who don't know the training data. Just know that a few episodes started with someone saying something like "this robot hasn't seen anything like this task; I doubt it'll work..."

thumb_up_off_alt8

chat_bubble_outline1

repeat3

shareShare