Daniel Geng (@dangengdg) Twitter Tweets • TwiCopy

Daniel Geng

@dangengdg

+ Follow

PhD student at @UmichCSE. Interested in computer vision and generative models. Previously @GoogleDeepMind, @MetaAI, @berkeley_ai

ID: 770504727305949184

linkhttp://dangeng.github.io calendar_today30-08-2016 06:13:37

127 Tweet

1,1K Followers

818 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Ever noticed how one scene seamlessly transitions into another in films? That’s a match-cut—a subtle yet powerful cinematic trick. Our MatchDiffusion, generates two videos from text prompts, designed to form a seamless match-cut—effortlessly and training-free. 🎥✨ 1/n

thumb_up_off_alt93

chat_bubble_outline1

repeat34

shareShare

Trenton Chang

@chang_trenton

7 months ago

(1/) I'll be going to #NeurIPS2024 next week, where I'll be presenting exciting new work on detecting gaming using causal inference! Our work is motivated by some problems in Medicare (U.S. public health insurance): turns out it's incredibly easy to game that system.

thumb_up_off_alt14

chat_bubble_outline1

repeat4

shareShare

Xun Huang

@xunhuang1995

7 months ago

🚀 Introducing CausVid: Instant video generation that plays the moment you hit "Generate", while maintaining state-of-the-art quality! Project Page: causvid.github.io. More details in the long thread.

thumb_up_off_alt160

chat_bubble_outline4

repeat30

shareShare

Nicolas DUFOUR

@nico_dufour

7 months ago

🌍 Guessing where an image was taken is a hard, and often ambiguous problem. Introducing diffusion-based geolocation—we predict global locations by refining random guesses into trajectories across the Earth's surface! 🗺️ Paper, code, and demo: nicolas-dufour.github.io/plonk

thumb_up_off_alt145

chat_bubble_outline6

repeat37

shareShare

Daniel Geng

@dangengdg

6 months ago

I'll be presenting "Images that Sound" today at #NeurIPS2024! East Exhibit Hall A-C #2710. Come say hi to me and Andrew Owens :) (Ziyang Chen sadly could not make it, but will be there in spirit :') )

thumb_up_off_alt55

chat_bubble_outline0

repeat8

shareShare

Daniel Geng

@dangengdg

6 months ago

I had a lot of fun helping put this problem set together -- if you're teaching diffusion models + computer vision, consider using this homework for your course! (links at end of Ryan Tabrizi's thread!)

thumb_up_off_alt141

chat_bubble_outline1

repeat20

shareShare

Daniel Geng

@dangengdg

6 months ago

Hey all, I'll be answering questions about our "Motion Prompting" paper on alphaXiv (alphaXiv) (it's like arXiv, but adds a discussion section, and I think is quite well built!): alphaxiv.org/abs/2412.02700…

thumb_up_off_alt51

chat_bubble_outline1

repeat8

shareShare

David McAllister

@davidrmcall

5 months ago

Decentralized Diffusion Models power stronger models trained on more accessible infrastructure. DDMs mitigate the networking bottleneck that locks training into expensive and power-hungry centralized clusters. They scale gracefully to billions of parameters and generate

thumb_up_off_alt241

chat_bubble_outline6

repeat44

shareShare

Zhaoying Pan

@zhaoyingpan

5 months ago

Our workshop at #ICLR2025 is now open to submissions until 02/03 🥳 check our website if you are interested: sites.google.com/view/icbinb-20…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Oliver Wang

@oliver_wang2

4 months ago

A sister team to ours at Google DeepMind is looking for student researchers this summer. Please reach out if you are a PhD student working on media generation (diffusion models), or if you are a professor with students to recommend! 😀

thumb_up_off_alt168

chat_bubble_outline9

repeat22

shareShare

Chris Rockwell

@_crockwell

2 months ago

Ever wish YouTube had 3D labels? 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose! Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯 Download: huggingface.co/datasets/nvidi…

thumb_up_off_alt177

chat_bubble_outline2

repeat39

shareShare

Daniel Geng

@dangengdg

a month ago

Really cool paper about training interactivity into a video model with surprisingly little data! Great work!

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Jeongsoo Park

@jespark0

14 days ago

Can AI image detectors keep up with new fakes? Mostly, no. Existing detectors are trained using a handful of models. But there are thousands in the wild! Our work, Community Forensics, uses 4800+ generators to train detectors that generalize to new fakes. #CVPR2025 🧵 (1/5)

thumb_up_off_alt23

chat_bubble_outline1

repeat9

shareShare

Yiming Dou

@_yimingdou

14 days ago

Ever wondered how a scene sounds👂 when you interact👋 with it? Introducing our #CVPR2025 work "Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes" -- we make 3D scene reconstructions audibly interactive! yimingdou.com/hearing_hands/

thumb_up_off_alt92

chat_bubble_outline2

repeat33

shareShare

Ayush Shrivastava

@ayshrv

14 days ago

Excited to share our CVPR 2025 paper on cross-modal space-time correspondence! We present a method to match pixels across different modalities (RGB-Depth, RGB-Thermal, Photo-Sketch, and cross-style images) — trained entirely using unpaired data and self-supervision. Our

thumb_up_off_alt120

chat_bubble_outline1

repeat28

shareShare

Daniel Geng

Gate.io

Alejandro Pardo

Trenton Chang

Xun Huang

Nicolas DUFOUR

Daniel Geng

Daniel Geng

Daniel Geng

David McAllister

Zhaoying Pan

Oliver Wang

Chris Rockwell

Daniel Geng

Jeongsoo Park

Yiming Dou

Ayush Shrivastava