Yukang Cao (@yukangcao) 's Twitter Profile
Yukang Cao

@yukangcao

3D Computer Vision || Postdoc @NTUsg | Ex-{Ph.D @HKUniversity, B.Eng @ZJU_China} (To learn, to fail; To learn more, to fail more)

ID: 1680204767954599937

linkhttps://yukangcao.github.io/ calendar_today15-07-2023 13:16:53

53 Tweet

277 Takipçi

289 Takip Edilen

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

📢Text-to-3D Foundation Model📢 Our #3DTopia has major updates, with 1) newly released technical report, and 2) our own *refined captions* for the Objaverse quality set - Code: github.com/3DTopia/3DTopia - Paper: arxiv.org/pdf/2403.02234… - Refined Objaverse: github.com/3DTopia/3DTopi…

Tengfei Wang (@dylantfwang) 's Twitter Profile Photo

👏🏻 👏🏻We now have several major updates for OpenLRM: (1) We release the full model trained on both objaverse and MVImgNet. (2) We release the full training code, which can help reproduce image-to-3d models. Code: github.com/3DTopia/OpenLRM HF demo: huggingface.co/spaces/zxhezex…

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

🔥Interactive Text-to-Texture Synthesis🔥 We present #InTeX, an interactive framework for 3D text-to-texture synthesis, with *region repainting* and *real-time editing on laptop* - Project: me.kiui.moe/intex/ - Paper: arxiv.org/abs/2403.11878 - Code: github.com/ashawkey/InTeX

Yukang Cao (@yukangcao) 's Twitter Profile Photo

Wants to learn more about the past, present, and future of 3D human modeling? Check our recent work: A Survey on 3D Human Avatar Modeling - From Reconstruction to Generation. Hope you will get some valuable insights from it! arxiv: arxiv.org/abs/2406.04253

Wants to learn more about the past, present, and future of 3D human modeling?

Check our recent work: A Survey on 3D Human Avatar Modeling - From Reconstruction to Generation.

Hope you will get some valuable insights from it!

arxiv: arxiv.org/abs/2406.04253
Yukang Cao (@yukangcao) 's Twitter Profile Photo

🔥Experiencing issues with image generation due to visible artifacts like watermarks or invisible artifacts (e.g., adversarial noise) in the training images? 📢Check ArtiFade for generating high-quality subject from blemished images. 📖arxiv.org/abs/2409.03745 Shaozhe Hao

🔥Experiencing issues with image generation due to visible artifacts like watermarks or invisible artifacts (e.g., adversarial noise) in the training images?  

📢Check ArtiFade for generating high-quality subject from blemished images.  

📖arxiv.org/abs/2409.03745
<a href="/haoshaozhe/">Shaozhe Hao</a>
Yukang Cao (@yukangcao) 's Twitter Profile Photo

🔥Curious about how you'd look in different outfits? We present GS-VTON, a versatile pipeline that allows for editing the clothing of 3D human subjects with image prompts. - Project: yukangcao.github.io/GS-VTON - Paper: arxiv.org/abs/2410.05259 - Code: github.com/yukangcao/GS-V…

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

🤩Try On Any Outfit from Any Angle🤩 We introduce 🧥GS-VTON👠 to enable **free-view 3D virtual try-on** (VTON) by transferring the pre-trained knowledge from 2D VTON models to 3D - Project: yukangcao.github.io/GS-VTON/ - Paper: arxiv.org/pdf/2410.05259 - Code: github.com/yukangcao/GS-V…

Yukang Cao (@yukangcao) 's Twitter Profile Photo

🧙‍♂️Equip your 4D human generation with object interactions We introduce #AvatarGO for zero-shot 4D Human-Object Interaction Generation and Animation. Project: yukangcao.github.io/AvatarGO/ Paper: arxiv.org/abs/2410.07164 Code: github.com/yukangcao/Avat…

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

🔥4D Human-Object Interaction Generation🔥 * Wanna see "Iron Man lifting an axe of Thor"? * #AvatarGO is a zero-shot framework to generate 4D human-object interaction from texts - Project: yukangcao.github.io/AvatarGO/ - Paper: arxiv.org/pdf/2410.07164 - Code: github.com/yukangcao/Avat…

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

📢 Welcome to check our GenAI work ICLR 2026 🇸🇬 * Video Gen - FasterCache: vchitect.github.io/FasterCache/ * 3D Gen - Phidias: rag-3d.github.io * 4D Gen - DynamicCity: dynamic-city.github.io - AvatarGO: yukangcao.github.io/AvatarGO/ * Multimodal LLM - Oryx: oryx-mllm.github.io

📢 Welcome to check our GenAI work <a href="/iclr_conf/">ICLR 2026</a> 🇸🇬

* Video Gen
- FasterCache: vchitect.github.io/FasterCache/

* 3D Gen
- Phidias: rag-3d.github.io

* 4D Gen
- DynamicCity: dynamic-city.github.io
- AvatarGO: yukangcao.github.io/AvatarGO/

* Multimodal LLM
- Oryx: oryx-mllm.github.io
Yukang Cao (@yukangcao) 's Twitter Profile Photo

🔥Tuning-free 2D image morphing🔥 Tired of complex training and strict semantic/layout demands? Meet #FreeMorph #ICCV2025: tuning-free image morphing across diverse situations -Project: yukangcao.github.io/FreeMorph -Paper: arxiv.org/abs/2507.01953 -Code: github.com/yukangcao/Free…

Ziwei Liu (@liuziwei7) 's Twitter Profile Photo

🤩Tuning-Free Image Morphing🤩 #FreeMorph enables tuning-free generalized image morphing that accommodates inputs with different semantics or layouts #ICCV2025 - Paper Hugging Face: huggingface.co/papers/2507.01… - Project: yukangcao.github.io/FreeMorph/ - Code: github.com/yukangcao/Free…

Yukang Cao (@yukangcao) 's Twitter Profile Photo

Morph4Data has also been released now! It provides image pairs with diverse semantics and layouts, valuable for evaluating image morphing techniques.

Yukang Cao (@yukangcao) 's Twitter Profile Photo

📢How does AI learn to perceive the structures of space and time? We release a survey on reconstructing 4D spatial intelligence from video, unifying methods into 5 progressive levels. Check out how far AI has come: 📚arxiv.org/abs/2507.21045 📜github.com/yukangcao/Awes…

📢How does AI learn to perceive the structures of space and time?

We release a survey on reconstructing 4D spatial intelligence from video, unifying methods into 5 progressive levels.

Check out how far AI has come:

📚arxiv.org/abs/2507.21045
📜github.com/yukangcao/Awes…
Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, Fangzhou Hong, Zhaoxi Chen, Xin Li, Wenping Wang, Yuan Liu, Ziwei Liu tl;dr: in title arxiv.org/abs/2507.21045

Reconstructing 4D Spatial Intelligence: A Survey

<a href="/yukangcao/">Yukang Cao</a>, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, <a href="/hongfz16/">Fangzhou Hong</a>, <a href="/Frozen_Burning/">Zhaoxi Chen</a>, Xin Li, Wenping Wang, <a href="/YuanLiu41955461/">Yuan Liu</a>, <a href="/liuziwei7/">Ziwei Liu</a>

tl;dr: in title

arxiv.org/abs/2507.21045
Yukang Cao (@yukangcao) 's Twitter Profile Photo

Our collected 3D-VTONBench, which includes 60 data subjects captured across various poses and garments, has been released at github.com/yukangcao/GS-V…. Feel free to give it a try~