Ziyang Chen (@czyangchen) Twitter Tweets • TwiCopy

Ziyang Chen

@czyangchen

+ Follow

Ph.D. Student at @UMich, advised by @andrewhowens
multimodal learning, audio-visual learning
prev research Intern @Adobe and @AIatMeta

ID: 1404892067373928448

linkhttps://ificl.github.io/ calendar_today15-06-2021 20:02:35

80 Tweet

358 Takipçi

413 Takip Edilen

Sarah Jabbour

@sarahjabbour_

a year ago

📢Presenting 𝐃𝐄𝐏𝐈𝐂𝐓: Diffusion-Enabled Permutation Importance for Image Classification Tasks #ECCV2024 We use permutation importance to compute dataset-level explanations for image classifiers using diffusion models (without access to model parameters or training data!)

thumb_up_off_alt30

chat_bubble_outline1

repeat11

shareShare

Ayush Shrivastava

@ayshrv

a year ago

We present Global Matching Random Walks, a simple self-supervised approach to the Tracking Any Point (TAP) problem, accepted to #ECCV2024. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks (CRW).

thumb_up_off_alt85

chat_bubble_outline1

repeat23

shareShare

Daniel Geng

@dangengdg

a year ago

What happens when you train a video generation model to be conditioned on motion? Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!

thumb_up_off_alt673

chat_bubble_outline20

repeat147

shareShare

Ziyang Chen

@czyangchen

a year ago

Check out the awesome work from Tianwei Yin!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

hugo flores garcía 🌻

@hugggof

a year ago

new paper! 🗣️Sketch2Sound💥 Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals. paper: arxiv.org/abs/2412.08550 web: hugofloresgarcia.art/sketch2sound

thumb_up_off_alt61

chat_bubble_outline6

repeat21

shareShare

Linyi Jin

@jin_linyi

a year ago

Introducing 👀Stereo4D👀 A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories. We used Stereo4D to make a dataset of over 100k real-world 4D scenes.

thumb_up_off_alt524

chat_bubble_outline13

repeat102

shareShare

Daniel Geng

@dangengdg

a year ago

I'll be presenting "Images that Sound" today at #NeurIPS2024! East Exhibit Hall A-C #2710. Come say hi to me and Andrew Owens :) (Ziyang Chen sadly could not make it, but will be there in spirit :') )

thumb_up_off_alt55

chat_bubble_outline0

repeat8

shareShare

Sarah Jabbour

@sarahjabbour_

10 months ago

I’m on the PhD internship market for Spr/Summer 2025! I have experience in multimodal AI (EHR, X-ray, text), explainability for image models w/ genAI, clinician-AI interaction (surveyed 700+ doctors), and tabular foundation models. Please reach out if you think there’s a fit!

thumb_up_off_alt66

chat_bubble_outline1

repeat12

shareShare

Tiange Luo

@tiangeluo

7 months ago

Will VLMs adhere strictly to their learned priors, unable to perform visual reasoning on content never existed on the Internet? We propose ViLP, a benchmark designed to probe the visual-language priors of VLMs by constructing Question-Image-Answer triplets that deliberately

thumb_up_off_alt7

chat_bubble_outline4

repeat4

shareShare

Luma AI

@lumalabsai

6 months ago

Introducing Modify Video. Reimagine any video. Shoot it in post with director-grade control over style, character, and setting. Restyle expressive performances, swap entire worlds, or redesign the frame to your vision. Shoot once. Shape infinitely.

thumb_up_off_alt8,8K

chat_bubble_outline183

repeat633

shareShare

Ziyang Chen

@czyangchen

6 months ago

Come and join us at CVPR this year!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ziyang Chen

@czyangchen

5 months ago

Come to visit our poster!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ziyang Chen

@czyangchen

5 months ago

Come to join us on poster #285 this afternoon #CVPR2025!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ziyang Chen

@czyangchen

2 months ago

Welcome aboard, Ray3! 🎉 Congrats to the team—really proud to be part of it!

thumb_up_off_alt17

chat_bubble_outline2

repeat0

shareShare

Phillip Isola

@phillip_isola

2 months ago

In arxiv.org/abs/2510.02425, we find if you ask an LLM to “imagine seeing,” then how it processes text becomes more like how a vision system would represent that same scene. If you ask it to “imagine hearing,” its representation becomes more like that of an auditory model. 3/9

thumb_up_off_alt49

chat_bubble_outline2

repeat4

shareShare

Jiaming Song

@baaadas

a month ago

To those who are laid off by Meta today -- we are hiring Luma AI . DM if interested.

thumb_up_off_alt172

chat_bubble_outline4

repeat14

shareShare