hüseyin temiz (@hsyn44) 's Twitter Profile
hüseyin temiz

@hsyn44

ID: 47487044

calendar_today16-06-2009 00:10:05

554 Tweet

60 Followers

513 Following

youngjoongkwon (@youngjoongkwon) 's Twitter Profile Photo

#ECCV2024 We have released the evaluation script, pretrained models, and rendered results of Generalizable Human Gaussians (GHG). Code: github.com/humansensingla… Paper: arxiv.org/abs/2407.12777 Website: humansensinglab.github.io/Generalizable-…

Matthias Niessner (@mattniessner) 's Twitter Profile Photo

📢Announcing our 3D head avatar benchmark📢 Two tasks with hidden test sets: - Dynamic Novel View Synthesis on Heads - Monocular FLAME-driven Head Avatar Reconstruction Our goal is to make research on 3D head avatars more comparable and ultimately increase the realism of

Jeff Li (@jiefengli_jeff) 's Twitter Profile Photo

📣📣📣 Excited to share GENMO: A Generalist Model for Human Motion. Words can’t perfectly describe human motion—so we build GENMO. It’s everything to motion. 🔥Video, Text, Music, Audio, Keyframes, Spatial Control…🔥 -- GENMO handles it all within a single model. 📹 Two

Avi Chawla (@_avichawla) 's Twitter Profile Photo

Fine-tune 100+ LLMs directly from a UI! LLaMA-Factory lets you train and fine-tune open-source LLMs and VLMs without writing any code. Supports 100+ models, multimodal fine-tuning, PPO, DPO, experiment tracking, and much more! 100% open-source with 50k stars!

Zhiwen(Aaron) Fan (@wayneinr) 's Twitter Profile Photo

Discover the right 3D Geometric Foundation Model for your task—whether it’s stereo matching, multi-view depth estimation, video depth, pose estimation, semantic understanding, or novel view synthesis. Explore more insights in our #E3DBench #FoundationModel #3D #GaussianSplatting.

Jack Saunders (@jack_r_saunders) 's Twitter Profile Photo

ChatHuman: Chatting about 3D Humans with Tools TLDR: A method for integrating many tools for digital human analysis with an LLM. This method uses RAG on academic publications to help an agent select appropriate tools based on context. 📽️ Project Page: chathuman.github.io 📜

Hunyuan (@tencenthunyuan) 's Twitter Profile Photo

Huge thanks to the DeepBeepMeep team deepbeepmeep for their support! Now, you only need 10 GB of VRAM—instead of 80 GB or 24 GB—to generate 15-second voice/song-driven videos. It’s fast, with no quality loss, and supports TeaCache. 🤗Hugging Face: huggingface.co/tencent/Hunyua…

Tobias Kirschstein (@tobiaskirschst1) 's Twitter Profile Photo

Join us at the workshop on Photo-realistic 3D Head Avatars at #CVPR2025, featuring invited talks by leading academic researchers and industry professionals at the forefront of avatar technology. Wednesday 11th, starting from 1:20pm workshop website: kaldir.vc.in.tum.de/nersemble_benc…

Join us at the workshop on Photo-realistic 3D Head Avatars at #CVPR2025, featuring invited talks by leading academic researchers and industry professionals at the forefront of avatar technology. Wednesday 11th, starting from 1:20pm
workshop website: kaldir.vc.in.tum.de/nersemble_benc…
amrita (@amritamaz) 's Twitter Profile Photo

The recordings from the #CVPR2025 tutorial "Volumetric Video in the Real World" are now available! nvlabs.github.io/cvpr2025-volum…

Ruben Wiersma (@rtwiersma) 's Twitter Profile Photo

The materials of my graduate school course at #SGP2025 on "Deep Learning on Meshes and Point Clouds" are live on rubenwiersma.nl/deeplearning! A 'map' for 3D deep learning, covering the basics and thoughts on when and where to use deep learning tools in 3D.

The materials of my graduate school course at #SGP2025 on "Deep Learning on Meshes and Point Clouds" are live on rubenwiersma.nl/deeplearning! A 'map' for 3D deep learning, covering the basics and thoughts on when and where to use deep learning tools in 3D.
Rui Li (@leedaray) 's Twitter Profile Photo

Recent update of🚀awesome-dust3r: github.com/ruili3/awesome…🚀 - A new trend of 3D geometric reasoning: LaRI (ruili3.github.io/lari), RaySt3R (rayst3r.github.io), Amodal3R (sm0kywu.github.io/Amodal3R/). - A new direction for DUSt3R4Science: CryoFastAR (arxiv: 2506.05864).

Ruilong Li (@ruilong_li) 's Twitter Profile Photo

For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc] Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! Paper & code: liruilong.cn/prope/

For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc]

Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! 

Paper & code: liruilong.cn/prope/
NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

Your 3D Gaussian Splats look great, but physics? They've been a flop. Discover how to do deformable simulations on 3DGS at #NVIDIAResearch's NVIDIA Kaolin & Warp workshop at #SIGGRAPH2025 👉 nvda.ws/4o61SGP

Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, Fangzhou Hong, Zhaoxi Chen, Xin Li, Wenping Wang, Yuan Liu, Ziwei Liu tl;dr: in title arxiv.org/abs/2507.21045

Reconstructing 4D Spatial Intelligence: A Survey

<a href="/yukangcao/">Yukang Cao</a>, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, <a href="/hongfz16/">Fangzhou Hong</a>, <a href="/Frozen_Burning/">Zhaoxi Chen</a>, Xin Li, Wenping Wang, <a href="/YuanLiu41955461/">Yuan Liu</a>, <a href="/liuziwei7/">Ziwei Liu</a>

tl;dr: in title

arxiv.org/abs/2507.21045
Alexandre Morgand (@almorgand) 's Twitter Profile Photo

"Cameras as Relative Positional Encoding" TLDR: comparison for conditioning transformers on cameras: token-level raymap, attention-level relative pose encodings, a (new) relative encoding Projective Positional Encoding -> camera frustums, (int|ext)insics for relative pos encoding

Rui Li (@leedaray) 's Twitter Profile Photo

Another major update of the "awesome-dust3r" (github.com/ruili3/awesome…) paper list. There are more VGG-T follow-ups and some interesting correlations, e.g., VGGT-Long/LONG3R, STream3R/Streaming 4D-VGGT. Let's see what happens for visual geometry in the DINO-v3 era :)