Alexey Bokhovkin (@abokhovkin) 's Twitter Profile
Alexey Bokhovkin

@abokhovkin

Computer Vision researcher @ TUM
3D Indoor Understanding

ID: 1334896500275548161

calendar_today04-12-2020 16:25:15

42 Tweet

343 Followers

233 Following

Matthias Niessner (@mattniessner) 's Twitter Profile Photo

Can we match visual features jointly across multiple frames? Yes! Barbara Roessle's #ICCV2023 paper proposes a differentiable pose optimization for end2end feature matching across multiple frames, thus obtaining better poses! barbararoessle.github.io/e2e_multi_viewโ€ฆ youtu.be/uuLb6GfM9Cg

Angela Dai (@angelaqdai) 's Twitter Profile Photo

We've released the ScanNet++ data! Check it out: kaldir.vc.in.tum.de/scannetpp/ 280 high-fidelity 3D scenes w/ 1mm geometry, DSLR+iPhone images, semantics We're currently beta-testing, please bear with us - approval may initially take up to 2 weeks Test scenes and benchmark to come!

We've released the ScanNet++ data!
Check it out: kaldir.vc.in.tum.de/scannetpp/
280 high-fidelity 3D scenes w/ 1mm geometry, DSLR+iPhone images, semantics

We're currently beta-testing, please bear with us - approval may initially take up to 2 weeks

Test scenes and benchmark to come!
Matthias Niessner (@mattniessner) 's Twitter Profile Photo

Diffusion models are awesome! Check out our survey on ๐ƒ๐ข๐Ÿ๐Ÿ๐ฎ๐ฌ๐ข๐จ๐ง ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐Ÿ๐จ๐ซ ๐•๐ข๐ฌ๐ฎ๐š๐ฅ ๐‚๐จ๐ฆ๐ฉ๐ฎ๐ญ๐ข๐ง๐ ! We give an introduction to diffusion models and highlight how they are used by state-of-the-art methods in graphics and vision. arxiv.org/abs/2310.07204

Diffusion models are awesome! 

Check out our survey on ๐ƒ๐ข๐Ÿ๐Ÿ๐ฎ๐ฌ๐ข๐จ๐ง ๐Œ๐จ๐๐ž๐ฅ๐ฌ ๐Ÿ๐จ๐ซ ๐•๐ข๐ฌ๐ฎ๐š๐ฅ ๐‚๐จ๐ฆ๐ฉ๐ฎ๐ญ๐ข๐ง๐ ! We give an introduction to diffusion models and highlight how they are used by state-of-the-art methods in graphics and vision.

arxiv.org/abs/2310.07204
Angela Dai (@angelaqdai) 's Twitter Profile Photo

Check out Christian Diller's CG-HOI :) We generate realistic 3D human-object interactions, from object geometry and text description. A key ingredient is explicit modeling of contact, during training and as guidance during inference. cg-hoi.christian-diller.de youtube.com/watch?v=GNyQwTโ€ฆ

Angela Dai (@angelaqdai) 's Twitter Profile Photo

Check out our #CVPR'24 papers on 3D human interactions, generative 3D modeling, and uncertainty-aware and unsupervised 3D semantic scene understanding! Congrats to Lei Li David Rozenberszki Christian Diller Yawar Siddiqui Shivangi Jiapeng Tang Anh-Quan Cao for their amazing work!

Check out our #CVPR'24 papers on 3D human interactions, generative 3D modeling, and uncertainty-aware and unsupervised 3D semantic scene understanding!

Congrats to <a href="/craigleili/">Lei Li</a> <a href="/david_roz_/">David Rozenberszki</a> <a href="/chrdiller/">Christian Diller</a> <a href="/yawarnihal/">Yawar Siddiqui</a> <a href="/shivangi2201/">Shivangi</a> <a href="/jiapeng_tang/">Jiapeng Tang</a> <a href="/AnhQuanCAO/">Anh-Quan Cao</a> for their amazing work!
Alexey Artemov (@artonson) 's Twitter Profile Photo

AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans A method for unsupervised instance segmentation of 3D outdoor LiDAR scenes. Project: artonson.github.io/publications/2โ€ฆ Vid: youtube.com/watch?v=ioKJWYโ€ฆ Paper : arxiv.org/pdf/2403.16318โ€ฆ

Angela Dai (@angelaqdai) 's Twitter Profile Photo

Excited to present GenZI at #CVPR2024! Lei Li introduces GenZI, the first zero-shot approach to creating realistic 3D human-scene interactions by leveraging interaction priors from large VLMs. Code and data on our website! craigleili.github.io/projects/genzi/ youtu.be/ozfs6E0JIMY

Angela Dai (@angelaqdai) 's Twitter Profile Photo

Excited to present DiffCAD coming to #SIGGRAPH2024! Daoyi Gao introduces the first probabilistic single-view CAD retrieval & alignment. We train only on synthetic -> generalize robustly to real images! Check out the code: daoyig.github.io/DiffCAD_/ w/David Rozenberszki, Stefan Leutenegger

Angela Dai (@angelaqdai) 's Twitter Profile Photo

How can we generate high-fidelity, complex 3D scenes? Quan Meng's LT3SD decomposes 3D scenes into latent tree representations, with diffusion on the latent trees enabling seamless infinite 3D scene synthesis! w/ Lei Li, Matthias Niessner quan-meng.github.io/projects/lt3sd/

Matthias Niessner (@mattniessner) 's Twitter Profile Photo

๐Ÿ“ข๐Ÿ“ข ๐†๐š๐ฎ๐ฌ๐ฌ๐ข๐š๐ง๐’๐ฉ๐ž๐ž๐œ๐ก: Audio-Driven Gaussian Avatars ๐Ÿ“ข๐Ÿ“ข We synthesize photorealistic and 3D-consistent talking human head avatars driven directly from spoken audio. More specifically, we introduce an efficient 3DGS-based representation, combined with an

Angela Dai (@angelaqdai) 's Twitter Profile Photo

๐Ÿ“ขDNF: Generating 4D animations with dictionary-based neural fields! Xinyi Zhang presents a new dictionary-based neural field for unconditional 4D generation of deforming shapes -- generating motions with high-quality shape and temporal consistency. xzhang-t.github.io/project/DNF/

Matthias Niessner (@mattniessner) 's Twitter Profile Photo

๐Ÿ“ข๐Ÿ“ข๐†๐€๐…: ๐†๐š๐ฎ๐ฌ๐ฌ๐ข๐š๐ง ๐€๐ฏ๐š๐ญ๐š๐ซ ๐‘๐ž๐œ๐จ๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง ๐Ÿ๐ซ๐จ๐ฆ ๐Œ๐จ๐ง๐จ๐œ๐ฎ๐ฅ๐š๐ซ ๐•๐ข๐๐ž๐จ๐ฌ ๐ฏ๐ข๐š ๐Œ๐ฎ๐ฅ๐ญ๐ข-๐ฏ๐ข๐ž๐ฐ ๐ƒ๐ข๐Ÿ๐Ÿ๐ฎ๐ฌ๐ข๐จ๐ง๐Ÿ“ข๐Ÿ“ข We reconstruct animatable Gaussian head avatars from monocular videos captured by commodity devices such as

Angela Dai (@angelaqdai) 's Twitter Profile Photo

๐Ÿ“ขMeshArt: Generating Articulated Meshes with Structure-guided Transformers Daoyi Gao generates articulated meshes with a hierarchical transformer, modeling articulation-aware structures that guide mesh synthesis. w/ Yawar Siddiqui Lei Li Project: daoyig.github.io/Mesh_Art/

Angela Dai (@angelaqdai) 's Twitter Profile Photo

Excited to announce ScanNet++ v2!๐ŸŽ‰ Chandan Yeshwanth and Yueh-Cheng Liu have been working tirelessly to bring: ๐Ÿ”น1006 high-fidelity 3D scans ๐Ÿ”น+ DSLR & iPhone captures ๐Ÿ”น+ rich semantics Elevating 3D scene understanding to the next level!๐Ÿš€ w/ Matthias Niessner kaldir.vc.in.tum.de/scannetpp

Angela Dai (@angelaqdai) 's Twitter Profile Photo

๐Ÿ“ข ScanNet++ v2 Benchmark Release! ๐Ÿ† Test your state-of-the-art models on: ๐Ÿ”น Novel View Synthesis ๐Ÿ“ธโžก๏ธ๐Ÿ–ผ๏ธ ๐Ÿ”น 3D Semantic & Instance Segmentation ๐Ÿค–๐Ÿ”๐Ÿ•ถ๏ธ Shoutout to Chandan Yeshwanth and Yueh-Cheng Liu for their incredible work๐Ÿ‘ ๐Ÿš€Check it out: kaldir.vc.in.tum.de/scannetpp/

Matthias Niessner (@mattniessner) 's Twitter Profile Photo

๐Ÿ“ขAnimating the Uncaptured ๐Ÿ“ข We animate 3D humanoid meshes using video diffusion priors given a text prompt. ๐ŸŽฅyoutu.be/_YL1J_V3smI ๐ŸŒmarcb.pro/atu Realistic motion generation for 3D characters - without motion capture! ๐Ÿš€ Great work by Marc Benedรญ Angela Dai

Angela Dai (@angelaqdai) 's Twitter Profile Photo

๐Ÿ“ขExCap3D: Multilevel Captioning of Objects in 3D Scenes Chandan Yeshwanth generates consistent object and part-level descriptions of objects in 3D scenes, and introduces a new dataset with 190k captions for 34k ScanNet++ objects. Project: cy94.github.io/excap3d w/ David Rozenberszki

Alexey Bokhovkin (@abokhovkin) 's Twitter Profile Photo

๐Ÿ“ขSceneFactor code is released! SceneFactor is a factored latent diffusion for controllable, large-scale scene synthesis and editing! w/ Quan Meng, Shubham Tulsiani, Angela Dai Check out the code here: github.com/alexeybokhovkiโ€ฆ. We present SceneFactor at #CVPR2025 on Fri 13, -10:30