Yiming Xie (@yimingxie4) Twitter Tweets • TwiCopy

Jia-Bin Huang

2 years ago

Research summary for the last 3 years... 2021: Replace every CNN with a Transformer 2022: Replace every GAN with diffusion models 2023: Replace every NeRF with Gaussian splatting

thumb_up_off_alt1,1K

chat_bubble_outline32

repeat233

shareShare

Check out our latest work. HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models. We propose a diffusion model that can generate realistic 3D human-object interactions (HOIs) driven by textual prompts . Paper: arxiv.org/abs/2312.06553 Project:

thumb_up_off_alt114

chat_bubble_outline1

repeat26

shareShare

Yiming Xie

@yimingxie4

2 years ago

Our work OmniControl is accepted in #ICLR2024! Incorporating flexible spatial control signals into a text-conditioned human motion generation model. Project: neu-vi.github.io/omnicontrol/ Code: github.com/neu-vi/omnicon…

thumb_up_off_alt80

chat_bubble_outline0

repeat8

shareShare

Yiming Xie

@yimingxie4

2 years ago

Glad to be a recipient of the 2024 Apple Scholars in AI/ML PhD fellowship! Thanks Apple and all my mentors and collaborators! machinelearning.apple.com/updates/apple-…

thumb_up_off_alt112

chat_bubble_outline11

repeat4

shareShare

Yiming Xie

@yimingxie4

2 years ago

I will present OmniControl (arxiv.org/abs/2310.08580) at #ICLR2024. ⏰: Tuesday (May 7) 4:30 p.m. (Halle B #54) Come say hi!

thumb_up_off_alt54

chat_bubble_outline0

repeat8

shareShare

Huaizu Jiang

@huaizujiang

a year ago

Excited to share our recent work HouseCrafter, which can lift a floorplan into a complete large 3D indoor scene (e.g. a house). Our key insight is to adapt a 2D diffusion model to generate consistent multi-view RGB-D images for reconstruction. Paper: arxiv.org/abs/2406.20077

thumb_up_off_alt55

chat_bubble_outline0

repeat6

shareShare

Dreaming Tulpa 🥓👑

@dreamingtulpa

a year ago

Want to see what your next flat, house or film set could look like in 3D? HouseCrafter can lift a floorplan into a complete 3D indoor scene. neu-vi.github.io/houseCrafter/

thumb_up_off_alt261

chat_bubble_outline11

repeat62

shareShare

Stability AI

@stabilityai

a year ago

We are pleased to announce the availability of Stable Video 4D, our very first video-to-video generation model that allows users to upload a single video and receive dynamic novel-view videos of eight new angles, delivering a new level of versatility and creativity. In

thumb_up_off_alt1,1K

chat_bubble_outline47

repeat239

shareShare

Huaizu Jiang

@huaizujiang

a year ago

#ECCV2024 We've tamed human motion diffusion models to generate stylized motions. Check out our work SMooDi: Stylized Motion Diffusion Model. One step closer to high-fidelity human motion generation. Paper: arxiv.org/abs/2407.12783 Code: github.com/neu-vi/SMooDi

thumb_up_off_alt59

chat_bubble_outline1

repeat9

shareShare

wxDai

@daiwenxun

a year ago

🔥Today, we announce the MotionLCM-V2, a state-of-the-art text-to-motion model in motion generation quality, motion-text alignment capability, and inference speed. ✍️Blogpost: huggingface.co/blog/wxDai/mot… 💻Code: github.com/Dai-Wenxun/Mot…

thumb_up_off_alt13

chat_bubble_outline1

repeat9

shareShare

Hongyu Li

@hongyu_lii

10 months ago

Can we robustly track an object’s 6D pose in contact-rich, occluded scenarios? Yes! Our solution, V-HOP, fuses vision and touch through a visuo-haptic transformer for precise, real-time tracking. arXiv: arxiv.org/abs/2502.17434 Project: lhy.xyz/projects/v-hop/

thumb_up_off_alt167

chat_bubble_outline6

repeat28

shareShare

Stability AI

@stabilityai

9 months ago

Introducing Stable Virtual Camera: This multi-view diffusion model transforms 2D images into immersive 3D videos with realistic depth and perspective—without complex reconstruction or scene-specific optimization.

thumb_up_off_alt1,1K

chat_bubble_outline51

repeat438

shareShare

Yiming Xie

@yimingxie4

8 months ago

🎉Come check out our poster #ICLR2025! SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency 🗓️ Thursday, April 24 ⏰ 3:00 PM – 5:30 PM 📍 Hall 3 + Hall 2B, Poster #112 🧑‍💻 Presented by Chun-Han Yao Huaizu Jiang 🔗 sv4d.github.io

thumb_up_off_alt20

chat_bubble_outline0

repeat1

shareShare

Stability AI

@stabilityai

7 months ago

We’ve upgraded Stable Video Diffusion 4D to Stable Video 4D 2.0 (SV4D 2.0), improving the quality of 4D outputs generated from a single object-centric video. While 3D provides a static view of an object’s shape and size; 4D extends this by including time, showing how the object

thumb_up_off_alt274

chat_bubble_outline7

repeat57

shareShare

Huaizu Jiang

@huaizujiang

7 months ago

We revisit the representation in human motion generation, showing that absolute joint coordinates outperform the de facto kinematic-aware, local-relative, and redundant choice. Benefits include: ✅ Easy motion control/editing ✅ Direct generation of SMPL mesh vertices in motion

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

Fangrui Zhu

@fangrui_zhu

6 months ago

🌟LMMs e.g. GPT‑o3 can solve spatial tasks from RGBD videos—with strong perception and prompting. 🚀We introduce Struct2D, a method that boosts spatial reasoning in open-source models. Even Qwen-VL-3B + Struct2D outperforms existing 7B models. 📜arXiv: arxiv.org/abs/2506.04220

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

Lei Zhong

@leizhong_

4 months ago

1) 🚀 From Sketch to Animation! Ever wished your hand-drawn storyboards could come to life? 🎨 Meet Sketch2Anim — our framework that transforms sketches into expressive 3D animations. Presenting at #SIGGRAPH2025 🇨🇦🎉 🔗 Project: zhongleilz.github.io/Sketch2Anim/

thumb_up_off_alt16

chat_bubble_outline1

repeat5

shareShare

Chuan Guo

@chuan_guo92603

2 months ago

🚀 We’ll be hosting a Tutorial on "3D Human Motion Generation and Simulation" at ICCV 2026 in Honolulu, Hawaii! 🌺 📅 Date: October 19, 2026 ⏰ Time: 9:00–16:00 (HST) 🔗 More details & resources: 3dmogen.github.io #AIGC #Simulation #robotics #ComputerVision #ICCV2025

thumb_up_off_alt15

chat_bubble_outline3

repeat6

shareShare