hüseyin temiz (@hsyn44) Twitter Tweets • TwiCopy

youngjoongkwon

a year ago

#ECCV2024 We have released the evaluation script, pretrained models, and rendered results of Generalizable Human Gaussians (GHG). Code: github.com/humansensingla… Paper: arxiv.org/abs/2407.12777 Website: humansensinglab.github.io/Generalizable-…

thumb_up_off_alt105

chat_bubble_outline0

repeat15

shareShare

Matthias Niessner

@mattniessner

7 months ago

📢Announcing our 3D head avatar benchmark📢 Two tasks with hidden test sets: - Dynamic Novel View Synthesis on Heads - Monocular FLAME-driven Head Avatar Reconstruction Our goal is to make research on 3D head avatars more comparable and ultimately increase the realism of

thumb_up_off_alt147

chat_bubble_outline2

repeat37

shareShare

Michael Tschannen

@mtschannen

5 months ago

📢 We just released the code for JetFormer at github.com/google-researc… Enjoy!

thumb_up_off_alt309

chat_bubble_outline5

repeat61

shareShare

Jeff Li

@jiefengli_jeff

5 months ago

📣📣📣 Excited to share GENMO: A Generalist Model for Human Motion. Words can’t perfectly describe human motion—so we build GENMO. It’s everything to motion. 🔥Video, Text, Music, Audio, Keyframes, Spatial Control…🔥 -- GENMO handles it all within a single model. 📹 Two

thumb_up_off_alt118

chat_bubble_outline2

repeat32

shareShare

Avi Chawla

@_avichawla

5 months ago

Fine-tune 100+ LLMs directly from a UI! LLaMA-Factory lets you train and fine-tune open-source LLMs and VLMs without writing any code. Supports 100+ models, multimodal fine-tuning, PPO, DPO, experiment tracking, and much more! 100% open-source with 50k stars!

thumb_up_off_alt2,2K

chat_bubble_outline27

repeat411

shareShare

MrNeRF

@janusch_patas

5 months ago

🚨 Code dropped! One of the most exciting papers of the year! Check it out! Link in the comments 👇

thumb_up_off_alt275

chat_bubble_outline3

repeat42

shareShare

Zhiwen(Aaron) Fan

@wayneinr

5 months ago

Discover the right 3D Geometric Foundation Model for your task—whether it’s stereo matching, multi-view depth estimation, video depth, pose estimation, semantic understanding, or novel view synthesis. Explore more insights in our #E3DBench #FoundationModel #3D #GaussianSplatting.

thumb_up_off_alt377

chat_bubble_outline4

repeat75

shareShare

Jack Saunders

@jack_r_saunders

5 months ago

ChatHuman: Chatting about 3D Humans with Tools TLDR: A method for integrating many tools for digital human analysis with an LLM. This method uses RAG on academic publications to help an agent select appropriate tools based on context. 📽️ Project Page: chathuman.github.io 📜

thumb_up_off_alt41

chat_bubble_outline0

repeat13

shareShare

Hunyuan

@tencenthunyuan

5 months ago

Huge thanks to the DeepBeepMeep team deepbeepmeep for their support! Now, you only need 10 GB of VRAM—instead of 80 GB or 24 GB—to generate 15-second voice/song-driven videos. It’s fast, with no quality loss, and supports TeaCache. 🤗Hugging Face: huggingface.co/tencent/Hunyua…

thumb_up_off_alt485

chat_bubble_outline8

repeat76

shareShare

Tobias Kirschstein

@tobiaskirschst1

5 months ago

Join us at the workshop on Photo-realistic 3D Head Avatars at #CVPR2025, featuring invited talks by leading academic researchers and industry professionals at the forefront of avatar technology. Wednesday 11th, starting from 1:20pm workshop website: kaldir.vc.in.tum.de/nersemble_benc…

thumb_up_off_alt77

chat_bubble_outline2

repeat17

shareShare

SpAItial AI

@spaitial_ai

5 months ago

Congrats to the #VGGT work for winning the best paper award at #CVPR2025! Very proud to have David Novotny our CTO SpAItial AI among the contributors -- congratulations!

Congrats to the #VGGT work for winning the best paper award at <a href="/CVPR/">#CVPR2025</a>!

Very proud to have <a href="/davnov134/">David Novotny</a> our CTO <a href="/SpAItial_AI/">SpAItial AI</a> among the contributors -- congratulations!

thumb_up_off_alt85

chat_bubble_outline0

repeat8

shareShare

Kosta Derpanis

@csprofkgd

5 months ago

Upcoming CVF-related #computervision conferences.

thumb_up_off_alt125

chat_bubble_outline7

repeat12

shareShare

amrita

@amritamaz

4 months ago

The recordings from the #CVPR2025 tutorial "Volumetric Video in the Real World" are now available! nvlabs.github.io/cvpr2025-volum…

thumb_up_off_alt82

chat_bubble_outline0

repeat13

shareShare

Ruben Wiersma

@rtwiersma

4 months ago

The materials of my graduate school course at #SGP2025 on "Deep Learning on Meshes and Point Clouds" are live on rubenwiersma.nl/deeplearning! A 'map' for 3D deep learning, covering the basics and thoughts on when and where to use deep learning tools in 3D.

thumb_up_off_alt350

chat_bubble_outline4

repeat60

shareShare

Rui Li

@leedaray

4 months ago

Recent update of🚀awesome-dust3r: github.com/ruili3/awesome…🚀 - A new trend of 3D geometric reasoning: LaRI (ruili3.github.io/lari), RaySt3R (rayst3r.github.io), Amodal3R (sm0kywu.github.io/Amodal3R/). - A new direction for DUSt3R4Science: CryoFastAR (arxiv: 2506.05864).

thumb_up_off_alt139

chat_bubble_outline1

repeat29

shareShare

Ruilong Li

@ruilong_li

4 months ago

For everyone interested in precise 📷camera control 📷 in transformers [e.g., video / world model etc] Stop settling for Plücker raymaps -- use camera-aware relative PE in your attention layers, like RoPE (for LLMs) but for cameras! Paper & code: liruilong.cn/prope/

thumb_up_off_alt410

chat_bubble_outline7

repeat75

shareShare

NVIDIA AI Developer

@nvidiaaidev

3 months ago

Your 3D Gaussian Splats look great, but physics? They've been a flop. Discover how to do deformable simulations on 3DGS at #NVIDIAResearch's NVIDIA Kaolin & Warp workshop at #SIGGRAPH2025 👉 nvda.ws/4o61SGP

thumb_up_off_alt97

chat_bubble_outline1

repeat19

shareShare

Zhenjun Zhao

@zhenjun_zhao

3 months ago

Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, Fangzhou Hong, Zhaoxi Chen, Xin Li, Wenping Wang, Yuan Liu, Ziwei Liu tl;dr: in title arxiv.org/abs/2507.21045

Reconstructing 4D Spatial Intelligence: A Survey

<a href="/yukangcao/">Yukang Cao</a>, Jiahao Lu, Zhisheng Huang, Zhuowei Shen, Chengfeng Zhao, <a href="/hongfz16/">Fangzhou Hong</a>, <a href="/Frozen_Burning/">Zhaoxi Chen</a>, Xin Li, Wenping Wang, <a href="/YuanLiu41955461/">Yuan Liu</a>, <a href="/liuziwei7/">Ziwei Liu</a>

tl;dr: in title

arxiv.org/abs/2507.21045

thumb_up_off_alt70

chat_bubble_outline2

repeat23

shareShare

Alexandre Morgand

@almorgand

3 months ago

"Cameras as Relative Positional Encoding" TLDR: comparison for conditioning transformers on cameras: token-level raymap, attention-level relative pose encodings, a (new) relative encoding Projective Positional Encoding -> camera frustums, (int|ext)insics for relative pos encoding

thumb_up_off_alt465

chat_bubble_outline2

repeat50

shareShare

Rui Li

@leedaray

3 months ago

Another major update of the "awesome-dust3r" (github.com/ruili3/awesome…) paper list. There are more VGG-T follow-ups and some interesting correlations, e.g., VGGT-Long/LONG3R, STream3R/Streaming 4D-VGGT. Let's see what happens for visual geometry in the DINO-v3 era :)

thumb_up_off_alt116

chat_bubble_outline2

repeat17

shareShare