Zhenxing Mi (@mifucius1) Twitter Tweets • TwiCopy

Zhenxing Mi

@mifucius1

+ Follow

PhD student @ HKUST

ID: 1421401082215866369

linkhttps://mizhenxing.github.io calendar_today31-07-2021 09:23:31

41 Tweet

100 Followers

588 Following

Zhenxing Mi

@mifucius1

3 years ago

The code of our ICLR2023 paper "Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance Fields" has been released. Dan Xu Code: github.com/MiZhenxing/Swi… Paper: openreview.net/forum?id=PQ2zo… Project page: mizhenxing.github.io/switchnerf

thumb_up_off_alt38

chat_bubble_outline2

repeat7

shareShare

HanRong YE

@leoyerrrr

3 years ago

#ICLR2023 Updates to TaskPrompter's codebase for joint 2D-3D multi-task understanding on Cityscapes-3D! We now predict disparity instead of depth, aligning with prevalent practices in the dataset. Please check github.com/prismformore/M…… Thank Prof Dan Xu for valuable guidance!😃

thumb_up_off_alt43

chat_bubble_outline4

repeat7

shareShare

HanRong YE

@leoyerrrr

2 years ago

How to design generative models to help segmentation tasks?🧐Introducing SegGen, our innovative approach for generating training data for image segmentation tasks, which greatly pushes the boundaries of performance for cutting-edge segmentation models. We creatively propose a

thumb_up_off_alt58

chat_bubble_outline10

repeat7

shareShare

Zhenxing Mi

@mifucius1

10 months ago

Excited to share our new paper "ThinkDiff" on arxiv. I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models It can make the diffusion models take "IQ tests"! It empowers diffusion models with multimodal in-context understanding and reasoning

thumb_up_off_alt30

chat_bubble_outline2

repeat7

shareShare

Zhenxing Mi

@mifucius1

8 months ago

The image generation of GPT-4o is amazing. It highlights image generation based on multimodal in-context learning. Our paper ThinkDiff investigates this direction and shows promising results, although far less powerful than GPT-4o and Gemini. Check out our paper and post!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Dan Xu ✈️ CVPR2025

@danxuhk

8 months ago

We propose a high-fidelity talking head generation framework that supports both single-modal and multi-modal driven signals. More details: Arxiv: arxiv.org/abs/2504.02542 Project page: harlanhong.github.io/publications/a… Github: github.com/harlanhong/ACT… HigginFace: huggingface.co/papers/2504.02…

thumb_up_off_alt39

chat_bubble_outline3

repeat7

shareShare

Zhenxing Mi

@mifucius1

5 months ago

Cool! 3D generation becomes more fantastic!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare