Jiahui Huang (@huangjh_hjh) 's Twitter Profile
Jiahui Huang

@huangjh_hjh

Research Scientist @NVIDIA Toronto AI Lab.

ID: 1241140288657051649

linkhttps://huangjh-pub.github.io/ calendar_today20-03-2020 23:11:25

45 Tweet

387 Followers

267 Following

Jiahui Huang (@huangjh_hjh) 's Twitter Profile Photo

⛳️Happy to present SCube, our newest 3D large-scale scene reconstruction model. It has been accepted by #NeurIPS2024! Great job Xuanchi Ren and Yifan!

Jiahui Huang (@huangjh_hjh) 's Twitter Profile Photo

📢Please check out our newest work on feed-forward reconstruction of dynamic monocular videos! With our bullet-time formulation, we reach great flexibility and state-of-the-art performance!

Yanke Song (@yannnke) 's Twitter Profile Photo

New #NVIDIA paper to make diffusion models better and faster 🚀 Multi-Student Distillation! We distill diffusion models into multiple 1-step students, allowing (a) improved quality by specializing in subsets and (b) improved latency by distilling into smaller architectures. 1/n

New #NVIDIA paper to make diffusion models better and faster 🚀 Multi-Student Distillation!

We distill diffusion models into multiple 1-step students, allowing (a) improved quality by specializing in subsets and (b) improved latency by distilling into smaller architectures.

1/n
Hanxue Liang (@hx_liang95) 's Twitter Profile Photo

🚀Excited to Introduce #BTimer: Real-Time Dynamic Scene Reconstruction from Monocular Videos! Struggling with novel view synthesis on dynamic scenes? Meet BTimer (BulletTimer) — the 1st motion-aware feed-forward model for real-time scene reconstruction at any desired time. ✅

Jiahui Huang (@huangjh_hjh) 's Twitter Profile Photo

Built upon X/SCube, our newest InfiniCube generates controllable dynamic 3DGS driving scenes from an HD map and bounding boxes. Generating these simulation environments unlocks many more possibilities in autonomous driving! Kudos to Yifan Lu and Xuanchi Ren! 🥳

NVIDIA DRIVE (@nvidiadrive) 's Twitter Profile Photo

Introducing InfiniCube, a scalable method for generating unbounded dynamic 3D driving scenes with high fidelity and controllability.

MrNeRF (@janusch_patas) 's Twitter Profile Photo

Spatio-Temporal Reconstruction Model for Large-Scale Outdoor Scenes Contributions: • We propose STORM, the first feed-forward, self-supervised method for fast and accurate reconstruction of dynamic 3D scenes from sparse, multi-timestep, posed camera images. • Our bottom-up

Jiahui Huang (@huangjh_hjh) 's Twitter Profile Photo

Whether you're a researcher or developer, #NVIDIACosmos world foundation models are now openly available under our permissive license to the physical AI community via NGC & Hugging Face. 🤗 See how Cosmos is democratizing #physicalAI development. #CES2025 bit.ly/4g4tVkR

Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

MATCHA:Towards Matching Anything feixue, Sven Elflein, Laura Leal-Taixe, Qunjie Zhou tl;dr: diffusion model->semantic+geometric features->transformer-based fusion->enhanced diffusion features->w/ DINOv2->unified feature->geometric/semantic/temporal matching arxiv.org/abs/2501.14945

MATCHA:Towards Matching Anything

<a href="/FeiXue94/">feixue</a>, <a href="/s_elflein/">Sven Elflein</a>, <a href="/lealtaixe/">Laura Leal-Taixe</a>, <a href="/QunjieZhou/">Qunjie Zhou</a>

tl;dr: diffusion model-&gt;semantic+geometric features-&gt;transformer-based fusion-&gt;enhanced diffusion features-&gt;w/ DINOv2-&gt;unified feature-&gt;geometric/semantic/temporal matching

arxiv.org/abs/2501.14945
Jay Z. Wu (@jayzhangjiewu) 's Twitter Profile Photo

Excited to share our #CVPR2025 paper: Difix3D+ Difix3D+ reimagines 3D reconstruction with single-step diffusion, distilling 2D generative priors for realistic novel view synthesis from large viewpoint shifts. 📄Paper: arxiv.org/abs/2503.01774 🌐Website: research.nvidia.com/labs/toronto-a…

Jiahui Huang (@huangjh_hjh) 's Twitter Profile Photo

📽️🔥 Please checkout GEN3C, a video model with an explicit 3D cache inside for better consistency and precise camera control! #CVPR2025 #NVIDIA

Tianchang Shen (@tianchangs) 's Twitter Profile Photo

📢 GEN3C is now open-sourced, with code released under Apache 2.0 and model weights under the NVIDIA Open Model License! 🚀 Along with it, we're releasing a GUI tool that lets you specify your desired video trajectory in 3D — come play with it and generate your own! The

MrNeRF (@janusch_patas) 's Twitter Profile Photo

ViPE: Video Pose Engine for 3D Geometric Perception Contributions: • A robust and efficient framework, ViPE, for estimating camera parameters and dense depth from diverse, in-the-wild videos. • A system design that integrates the strengths of classical SLAM (efficiency,