
Sophia Sirko-Galouchenko
@sophia_sirko
PhD student in visual representation learning at Valeo.ai and Sorbonne Université (MLIA)
ID: 3140741211
06-04-2015 15:31:35
11 Tweet
59 Takipçi
138 Takip Edilen

The preprint of our work (with Salah Zaiem and Robin Algayres) on sample dependent ASR model selection is available on arXiv! In this paper we propose to train a decision module, that allows, given an audio sample, to use the smallest sufficient model leading to a good transcription

You want to give audio abilities to your VLM without compromising its vision performance? You want to align your audio encoder with a pretrained image encoder without suffering from the modality gap? Check our #NeurIPS2024 paper with Michel Olvera Stéphane LATHUILIÈRE and Slim Essid



1/ New & old work on self-supervised representation learning (SSL) with ViTs: MOCA ☕ - Predicting Masked Online Codebook Assignments w/ Spyros Gidaris Oriane Siméoni Antonín Vobecký Matthieu Cord N. Komodakis, P. Pérez #TMLR #ICLR2025 Grab a ☕ and brace for a story & a 🧵
