Yukang Cao (@yukangcao) Twitter Tweets • TwiCopy

Ziwei Liu

2 years ago

📢Text-to-3D Foundation Model📢 Our #3DTopia has major updates, with 1) newly released technical report, and 2) our own *refined captions* for the Objaverse quality set - Code: github.com/3DTopia/3DTopia - Paper: arxiv.org/pdf/2403.02234… - Refined Objaverse: github.com/3DTopia/3DTopi…

thumb_up_off_alt174

chat_bubble_outline0

repeat45

shareShare

Tengfei Wang

@dylantfwang

2 years ago

👏🏻 👏🏻We now have several major updates for OpenLRM: (1) We release the full model trained on both objaverse and MVImgNet. (2) We release the full training code, which can help reproduce image-to-3d models. Code: github.com/3DTopia/OpenLRM HF demo: huggingface.co/spaces/zxhezex…

thumb_up_off_alt94

chat_bubble_outline0

repeat17

shareShare

Ziwei Liu

@liuziwei7

2 years ago

🔥Interactive Text-to-Texture Synthesis🔥 We present #InTeX, an interactive framework for 3D text-to-texture synthesis, with *region repainting* and *real-time editing on laptop* - Project: me.kiui.moe/intex/ - Paper: arxiv.org/abs/2403.11878 - Code: github.com/ashawkey/InTeX

thumb_up_off_alt170

chat_bubble_outline2

repeat37

shareShare

MrNeRF

@janusch_patas

a year ago

A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation arxiv.org/abs/2406.04253

thumb_up_off_alt80

chat_bubble_outline0

repeat17

shareShare

Yukang Cao

@yukangcao

a year ago

Wants to learn more about the past, present, and future of 3D human modeling? Check our recent work: A Survey on 3D Human Avatar Modeling - From Reconstruction to Generation. Hope you will get some valuable insights from it! arxiv: arxiv.org/abs/2406.04253

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

Yukang Cao

@yukangcao

a year ago

🔥Experiencing issues with image generation due to visible artifacts like watermarks or invisible artifacts (e.g., adversarial noise) in the training images? 📢Check ArtiFade for generating high-quality subject from blemished images. 📖arxiv.org/abs/2409.03745 Shaozhe Hao

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Yukang Cao

@yukangcao

a year ago

🔥Curious about how you'd look in different outfits? We present GS-VTON, a versatile pipeline that allows for editing the clothing of 3D human subjects with image prompts. - Project: yukangcao.github.io/GS-VTON - Paper: arxiv.org/abs/2410.05259 - Code: github.com/yukangcao/GS-V…

thumb_up_off_alt221

chat_bubble_outline2

repeat54

shareShare

Ziwei Liu

@liuziwei7

a year ago

🤩Try On Any Outfit from Any Angle🤩 We introduce 🧥GS-VTON👠 to enable **free-view 3D virtual try-on** (VTON) by transferring the pre-trained knowledge from 2D VTON models to 3D - Project: yukangcao.github.io/GS-VTON/ - Paper: arxiv.org/pdf/2410.05259 - Code: github.com/yukangcao/GS-V…

thumb_up_off_alt73

chat_bubble_outline2

repeat8

shareShare

Yukang Cao

@yukangcao

a year ago

🧙‍♂️Equip your 4D human generation with object interactions We introduce #AvatarGO for zero-shot 4D Human-Object Interaction Generation and Animation. Project: yukangcao.github.io/AvatarGO/ Paper: arxiv.org/abs/2410.07164 Code: github.com/yukangcao/Avat…

thumb_up_off_alt70

chat_bubble_outline2

repeat15

shareShare

Ziwei Liu

@liuziwei7

a year ago

🔥4D Human-Object Interaction Generation🔥 * Wanna see "Iron Man lifting an axe of Thor"? * #AvatarGO is a zero-shot framework to generate 4D human-object interaction from texts - Project: yukangcao.github.io/AvatarGO/ - Paper: arxiv.org/pdf/2410.07164 - Code: github.com/yukangcao/Avat…

thumb_up_off_alt34

chat_bubble_outline0

repeat6

shareShare

Yukang Cao

@yukangcao

8 months ago

🛠️Code for GS-VTON has been released at github.com/yukangcao/GS-V…

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Yukang Cao

@yukangcao

8 months ago

🥨Code has been released at github.com/yukangcao/Avat…

thumb_up_off_alt48

chat_bubble_outline0

repeat4

shareShare

Ziwei Liu

@liuziwei7

7 months ago

📢 Welcome to check our GenAI work ICLR 2026 🇸🇬 * Video Gen - FasterCache: vchitect.github.io/FasterCache/ * 3D Gen - Phidias: rag-3d.github.io * 4D Gen - DynamicCity: dynamic-city.github.io - AvatarGO: yukangcao.github.io/AvatarGO/ * Multimodal LLM - Oryx: oryx-mllm.github.io

📢 Welcome to check our GenAI work <a href="/iclr_conf/">ICLR 2026</a> 🇸🇬

* Video Gen
- FasterCache: vchitect.github.io/FasterCache/

* 3D Gen
- Phidias: rag-3d.github.io

* 4D Gen
- DynamicCity: dynamic-city.github.io
- AvatarGO: yukangcao.github.io/AvatarGO/

* Multimodal LLM
- Oryx: oryx-mllm.github.io

thumb_up_off_alt129

chat_bubble_outline2

repeat22

shareShare

AK

@_akhaliq

4 months ago

FreeMorph Tuning-Free Generalized Image Morphing with Diffusion Model

thumb_up_off_alt141

chat_bubble_outline1

repeat17

shareShare

Yukang Cao

@yukangcao

4 months ago

🔥Tuning-free 2D image morphing🔥 Tired of complex training and strict semantic/layout demands? Meet #FreeMorph #ICCV2025: tuning-free image morphing across diverse situations -Project: yukangcao.github.io/FreeMorph -Paper: arxiv.org/abs/2507.01953 -Code: github.com/yukangcao/Free…

thumb_up_off_alt80

chat_bubble_outline1

repeat17

shareShare

Ziwei Liu

@liuziwei7

4 months ago

🤩Tuning-Free Image Morphing🤩 #FreeMorph enables tuning-free generalized image morphing that accommodates inputs with different semantics or layouts #ICCV2025 - Paper Hugging Face: huggingface.co/papers/2507.01… - Project: yukangcao.github.io/FreeMorph/ - Code: github.com/yukangcao/Free…

thumb_up_off_alt55

chat_bubble_outline0

repeat13

shareShare

Yukang Cao

@yukangcao

4 months ago

Morph4Data has also been released now! It provides image pairs with diverse semantics and layouts, valuable for evaluating image morphing techniques.

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Yukang Cao

@yukangcao

3 months ago

📢How does AI learn to perceive the structures of space and time? We release a survey on reconstructing 4D spatial intelligence from video, unifying methods into 5 progressive levels. Check out how far AI has come: 📚arxiv.org/abs/2507.21045 📜github.com/yukangcao/Awes…

thumb_up_off_alt28

chat_bubble_outline0

repeat8

shareShare

Zhenjun Zhao

@zhenjun_zhao

3 months ago

Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, Fangzhou Hong, Zhaoxi Chen, Xin Li, Wenping Wang, Yuan Liu, Ziwei Liu tl;dr: in title arxiv.org/abs/2507.21045

Reconstructing 4D Spatial Intelligence: A Survey

<a href="/yukangcao/">Yukang Cao</a>, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, <a href="/hongfz16/">Fangzhou Hong</a>, <a href="/Frozen_Burning/">Zhaoxi Chen</a>, Xin Li, Wenping Wang, <a href="/YuanLiu41955461/">Yuan Liu</a>, <a href="/liuziwei7/">Ziwei Liu</a>

tl;dr: in title

arxiv.org/abs/2507.21045

thumb_up_off_alt70

chat_bubble_outline2

repeat23

shareShare

Yukang Cao

@yukangcao

3 months ago

Our collected 3D-VTONBench, which includes 60 data subjects captured across various poses and garments, has been released at github.com/yukangcao/GS-V…. Feel free to give it a try~

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare