Vincent Sitzmann
@vincesitzmann
Teaching AI to see, model, and interact with our 3D world. Assistant Professor @ MIT, leading the Scene Representation Group (scenerepresentations.org).
ID: 4872666044
http://vincentsitzmann.com 07-02-2016 08:09:03
737 Tweet
15,15K Takipçi
305 Takip Edilen
Vincent Sitzmann I agree, I think these initiatives are well meaning but have gone too far. To me they cross the line from useful guidlines to infantalizing dictums.
Understanding Multi-View Transformers Michal Stary Julien Gaubil Ayush Tewari Vincent Sitzmann tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. arxiv.org/abs/2510.24907