Anh Thai
@ngailapdi
ID: 4712972292
https://anhthai1997.wordpress.com/ 05-01-2016 10:08:06
203 Tweet
612 Followers
1,1K Following
📢#CVPR2025 Introducing InstaManip, a novel multimodal autoregressive model for few-shot image editing. 🎯InstaManip can learn a new image editing operation from textual and visual guidance via in-context learning, and apply it to new query images. [1/8] bolinlai.github.io/projects/Insta…
Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos Ziren Gong, Xiaohan Li, Fabio Tosi, Jiawei Han, Stefano Mattoccia, Jianfei Cai, Matteo Poggi tl;dr: CLIP->SLAM3R; CLIP+DINO+CG3D->2D-3D fused descriptor arxiv.org/abs/2507.22052
Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images Xiangyu Sun, Haoyi jiang, Liu Liu, Seungtae Nam, Gyeongjin Kang, Xinjie wang, Wei Sui, Zhizhong Su, Wenyu Liu, Xinggang Wang, Eunbyung Park tl;dr:
GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting Jiaxin Wei, Stefan Leutenegger, Simon Schaefer tl;dr: fuse mesh and 3DGS->rendered images->pretrained diffusion model+random mask augmentation->removes artifacts+inpainting+completion arxiv.org/abs/2508.14717