Vincent Sitzmann (@vincesitzmann) 's Twitter Profile
Vincent Sitzmann

@vincesitzmann

Teaching AI to see, model, and interact with our 3D world. Assistant Professor @ MIT, leading the Scene Representation Group (scenerepresentations.org).

ID: 4872666044

linkhttp://vincentsitzmann.com calendar_today07-02-2016 08:09:03

737 Tweet

15,15K Takipçi

305 Takip Edilen

Phillip Isola (@phillip_isola) 's Twitter Profile Photo

Vincent Sitzmann I agree, I think these initiatives are well meaning but have gone too far. To me they cross the line from useful guidlines to infantalizing dictums.

Chonghyuk (Andrew) Song (@ndsong95) 's Twitter Profile Photo

Introducing Generative View Stitching (GVS), a non-autoregressive sampling method for length extrapolation of video diffusion models. GVS enables collision-free camera-guided video generation for predefined trajectories, including Oscar Reutersvärd's Impossible Staircase (1/9).

Phillip Isola (@phillip_isola) 's Twitter Profile Photo

Arxiv has been such a wonderful service but I think this is a step in the wrong direction. We have other venues for peer review. To me the value of arxiv lies precisely in its lack of excessive moderation. I'd prefer it as "github for science," rather than yet another journal.

George Cazenavette (@gcazenavette) 's Twitter Profile Photo

Happy to finally share our latest work on Dataset Distillation! "Dataset Distillation for Pre-Trained Self-Supervised Vision Models," set to appear at #NeurIPS 2025! We learn 1 image per class to train linear heads for pre-trained models. linear-gradient-matching.github.io 1/6

Dmytro Mishkin 🇺🇦 (@ducha_aiki) 's Twitter Profile Photo

Understanding Multi-View Transformers Michal Stary Julien Gaubil Ayush Tewari Vincent Sitzmann tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. arxiv.org/abs/2510.24907

Understanding Multi-View Transformers

Michal Stary <a href="/jgaubil/">Julien Gaubil</a> <a href="/_atewari/">Ayush Tewari</a>  <a href="/vincesitzmann/">Vincent Sitzmann</a> 

tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. 
arxiv.org/abs/2510.24907
Elliott / Shangzhe Wu (@elliottszwu) 's Twitter Profile Photo

I'm looking for two PhD students to join our team at Cambridge to work on 3D/4D modeling in various domains including generative media, robotics, and biology. Apply to the PhD in Engineering program by December 2 ⌛️: postgraduate.study.cam.ac.uk/courses/direct…

I'm looking for two PhD students to join our team at Cambridge to work on 3D/4D modeling in various domains including generative media, robotics, and biology.

Apply to the PhD in Engineering program by December 2 ⌛️: postgraduate.study.cam.ac.uk/courses/direct…