Ayush Shrivastava (@ayshrv) Twitter Tweets • TwiCopy

Sara Beery

2 years ago

At #CVPR2023 we're hosting *Scholars and Big Models* -- a forum to discuss recent rapid changes in CV and how our academic community can adapt and thrive. Our panelists will discuss questions raised by you!! Make your voice heard 👇 forms.gle/hVDg4EhrDXuahF…

thumb_up_off_alt147

chat_bubble_outline2

repeat22

shareShare

AK

@_akhaliq

2 years ago

Seeing the World through Your Eyes paper page: huggingface.co/papers/2306.09… The reflective nature of the human eye is an underappreciated source of information about what the world around us looks like. By imaging the eyes of a moving person, we can collect multiple views of a scene

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat336

shareShare

Nilesh Kulkarni

@_nileshk

2 years ago

📢 Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data #CVPR2025. We train a model to predict a 3D implicit function from a single input image. This model is directly trained on raw RGB-D data. Website: nileshkulkarni.github.io/d2drdf Paper: arxiv.org/abs/2306.08671 (1/N)

thumb_up_off_alt66

chat_bubble_outline1

repeat21

shareShare

Taranjeet

@taranjeetio

2 years ago

🚀 We've hit some big milestones with embedchain: • 300K apps • 53K downloads • 5.7K GitHub stars Every step taught us something new. Today, we're taking those lessons & introducing a platform to manage data for LLM apps. No waitlist, link in next tweet 👇🏻👇🏻👇🏻

thumb_up_off_alt214

chat_bubble_outline37

repeat50

shareShare

Naihao(Neo) Deng

@naihaodeng

2 years ago

Annotator disagreement is common in NLP, but is it just noise? We are introducing a new strategy for annotator representation to help models better learn from data that has inherent disagreements. 🐙 Github code: github.com/MichiganNLP/An…

thumb_up_off_alt123

chat_bubble_outline3

repeat38

shareShare

Chris Rockwell

@_crockwell

a year ago

📢 Presenting 𝐅𝐀𝐑: 𝐅𝐥𝐞𝐱𝐢𝐛𝐥𝐞, 𝐀𝐜𝐜𝐮𝐫𝐚𝐭𝐞 𝐚𝐧𝐝 𝐑𝐨𝐛𝐮𝐬𝐭 𝟔𝐃𝐨𝐅 𝐑𝐞𝐥𝐚𝐭𝐢𝐯𝐞 𝐂𝐚𝐦𝐞𝐫𝐚 𝐏𝐨𝐬𝐞 𝐄𝐬𝐭𝐢𝐦𝐚𝐭𝐢𝐨𝐧 #CVPR2024 FAR builds upon complimentary Solver and Learning-Based works yielding accurate *and* robust pose! crockwell.github.io/far/

thumb_up_off_alt47

chat_bubble_outline1

repeat19

shareShare

Daniel Geng

@dangengdg

a year ago

What do you see in these images? These are called hybrid images, originally proposed by Aude Oliva et al. They change appearance depending on size or viewing distance, and are just one kind of perceptual illusion that our method, Factorized Diffusion, can make.

thumb_up_off_alt453

chat_bubble_outline10

repeat103

shareShare

Ziyang Chen

@czyangchen

a year ago

These spectrograms look like images, but can also be played as a sound! We call these images that sound. How do we make them? Look and listen below to find out, and to see more examples!

thumb_up_off_alt169

chat_bubble_outline1

repeat41

shareShare

Andrew Owens

@andrewhowens

a year ago

In case you were wondering what’s going on with the back of the #CVPR2024 T-shirt: it’s a hybrid image made by Aaron Inbum Park and Daniel Geng! When you look at it up close, you’ll just see the Seattle skyline, but when you view it from a distance, the text “CVPR” should appear.

thumb_up_off_alt439

chat_bubble_outline10

repeat52

shareShare

Sarah Jabbour

@sarahjabbour_

a year ago

📢Presenting 𝐃𝐄𝐏𝐈𝐂𝐓: Diffusion-Enabled Permutation Importance for Image Classification Tasks #ECCV2024 We use permutation importance to compute dataset-level explanations for image classifiers using diffusion models (without access to model parameters or training data!)

thumb_up_off_alt30

chat_bubble_outline1

repeat11

shareShare

Zhenjun Zhao

@zhenjun_zhao

a year ago

Self-Supervised Any-Point Tracking by Contrastive Random Walks Ayush Shrivastava, Andrew Owens tl;dr: global matching transformer->self-attention->transition matrix->contrastive random walk->cycle-consistent track arxiv.org/pdf/2409.16288

Self-Supervised Any-Point Tracking by Contrastive Random Walks

<a href="/ayshrv/">Ayush Shrivastava</a>, <a href="/andrewhowens/">Andrew Owens</a>

tl;dr: global matching transformer->self-attention->transition matrix->contrastive random walk->cycle-consistent track

arxiv.org/pdf/2409.16288

thumb_up_off_alt33

chat_bubble_outline1

repeat4

shareShare

Andrew Owens

@andrewhowens

a year ago

At #ECCV2024: a very simple, self-supervised tracking method! We train a transformer to perform all-pairs matching using the contrastive random walk. If you want to learn more, please come to our poster at 10:30am on Thursday (#214). w/ Ayush Shrivastava x.com/ayshrv/status/…

thumb_up_off_alt87

chat_bubble_outline1

repeat7

shareShare

Ayush Shrivastava

@ayshrv

a year ago

We will be presenting this today at #ECCV2024, come say hi. Thursday, 10.30 am - 12.30 pm, poster 214.

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Prithvijit

@prithvijitch

6 months ago

Join us at the WorldModelBench workshop at #CVPR2025 where we'll tackle systematic evaluation of World Models! Focus: benchmarks, metrics, downstream tasks, and safety. Submit papers now: worldmodelbench.github.io

thumb_up_off_alt33

chat_bubble_outline1

repeat14

shareShare

Prithvijit

@prithvijitch

5 months ago

Check out Cosmos-Reason1, a reasoning VLM from our team for - Physical Commonsense Reasoning (spatial, temporal, intuitive physics) - Embodied Reasoning (verifying task completion, action affordance and next plausible action prediction) Models, data curation and benchmarks

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

Stefan Stojanov

@sstj389

5 months ago

Video prediction foundation models implicitly learn how objects move in videos. Can we learn how to extract these representations to accurately track objects in videos _without_ any supervision? Yes! 🧵 Work done with: Rahul Venkatesh, Seungwoo (Simon) Kim, Jiajun Wu and Daniel Yamins

thumb_up_off_alt101

chat_bubble_outline3

repeat18

shareShare