Rundi Wu (@chriswu6080) Twitter Tweets • TwiCopy

Rundi Wu

@chriswu6080

2 years ago

This looks super super! Feels like walking into an image!

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

PhysDreamer Physics-Based Interaction with 3D Objects via Video Generation Realistic object interactions are crucial for creating immersive virtual experiences, yet synthesizing realistic 3D object dynamics in response to novel interactions remains a significant

thumb_up_off_alt420

chat_bubble_outline4

repeat78

shareShare

Tianyuan Zhang

@tianyuanzhang99

2 years ago

3D Gaussian is great, but how can you interact with it 🌹👋? Introducing #PhysDreamer: Create your own realistic interactive 3D assets from only static images! Discover how we do this below👇 🧵1/: Website: physdreamer.github.io

thumb_up_off_alt380

chat_bubble_outline13

repeat78

shareShare

Rundi Wu

@chriswu6080

2 years ago

I’ll be at #ICLR2024 Vienna during May 7-11! Come and check out our paper! Happy to chat about anything! Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape. Poster on May 9 at 10:45am.

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Rundi Wu

@chriswu6080

a year ago

Check out our Generative Camera Dolly🎥 tl;dr: fine-tuning a video model for novel view synthesis of dynamic scenes.

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Rundi Wu

@chriswu6080

a year ago

I'm at #CVPR2024 Seattle this week. Happy to chat about anything! Please come and visit our ReconFusion poster on Friday 21 Jun 10:30 a.m, Arch 4A-E Poster #193. reconfusion.github.io

thumb_up_off_alt84

chat_bubble_outline1

repeat9

shareShare

Jeremy Klotz

@jklotz_

a year ago

At #ECCV2024, we presented Minimalist Vision with Freeform Pixels, a new vision paradigm that uses a small number of freeform pixels to solve lightweight vision tasks. We are honored to have received the Best Paper Award! Check out the project here: cave.cs.columbia.edu/projects/categ…

thumb_up_off_alt121

chat_bubble_outline5

repeat20

shareShare

Jon Barron

@jon_barron

a year ago

CAT4D. The time has come.

thumb_up_off_alt213

chat_bubble_outline6

repeat29

shareShare

Ben Poole

@poolio

a year ago

Stop watching videos, start interacting with worlds. Stoked to share CAT4D, our new method for turning videos into dynamic 3D scenes that you can move through in real-time!

thumb_up_off_alt369

chat_bubble_outline14

repeat51

shareShare

Aleksander Holynski

@holynski_

a year ago

Check out our new paper that turns (text, sparse images, videos) => (dynamic 3D scenes)! I can't get over how cool the interactive demo is. Try it out for yourself on the project page: cat-4d.github.io

thumb_up_off_alt292

chat_bubble_outline13

repeat52

shareShare

Ben Poole

@poolio

a year ago

Woohoo, big congrats to the World Labs team! Tech looks similar to CAT3D (cat3d.github.io): multi-view diffusion model + 3DGS, maybe w/360 data + depth priors. To bring these worlds to life with dynamics, check out our new work on CAT4D: cat-4d.github.io 😺

thumb_up_off_alt161

chat_bubble_outline2

repeat9

shareShare

Ruiqi Gao

@ruiqigao

a year ago

A common question nowadays: Which is better, diffusion or flow matching? 🤔 Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.

thumb_up_off_alt922

chat_bubble_outline16

repeat201

shareShare

Zhengqi Li

@zhengqi_li

a year ago

Introducing MegaSaM! 🎥 Accurate, fast, & robust structure + camera estimation from casual monocular videos of dynamic scenes! MegaSaM outputs camera parameters and consistent video depth, scaling to long videos with unconstrained camera paths and complex scene dynamics!

thumb_up_off_alt490

chat_bubble_outline8

repeat90

shareShare

Rundi Wu

@chriswu6080

a year ago

How to perform robust 3D reconstruction in the presence of various inconsistencies during capture (e.g., dynamic or lighting changes)? Checkout Alex Trevithick 's SimVS --- simulating the world inconsistencies using video generation models for robust view synthesis!

thumb_up_off_alt24

chat_bubble_outline0

repeat1

shareShare

Linyi Jin

@jin_linyi

a year ago

Introducing 👀Stereo4D👀 A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories. We used Stereo4D to make a dataset of over 100k real-world 4D scenes.

thumb_up_off_alt524

chat_bubble_outline13

repeat102

shareShare

Rundi Wu

@chriswu6080

a year ago

Very impressive large-scale 3D scene generation results!!!

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Stan Szymanowicz

@stanszymanowicz

8 months ago

⚡️ Introducing Bolt3D ⚡️ Bolt3D generates interactive 3D scenes in less than 7 seconds on a single GPU from one or more images. It features a latent diffusion model that *directly* generates 3D Gaussians of seen and unseen regions, without any test time optimization. 🧵👇 (1/9)

thumb_up_off_alt524

chat_bubble_outline27

repeat90

shareShare

Rundi Wu

@chriswu6080

8 months ago

Super cool results! Congrats!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Rundi Wu

Rundi Wu

AK

Tianyuan Zhang

Rundi Wu

Rundi Wu

Rundi Wu

Jeremy Klotz

Jon Barron

Ben Poole

Aleksander Holynski

Ben Poole

Ruiqi Gao

Zhengqi Li

Rundi Wu

Linyi Jin

Rundi Wu

Stan Szymanowicz

Rundi Wu