
Joan Serrà
@serrjoa
Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
ID: 3792459557
https://serrjoa.github.io/ 27-09-2015 11:48:31
3,3K Tweet
2,2K Takipçi
555 Takip Edilen





Arash Vahdat ✈️ #CVPR2025 Heavy-tailed diffusion models: lines of code to improve the ability of your diffusion model to handle extreme events in heavy-tailed distributions. ll;dr: replace you gaussian distribution with a tuned t-student one. Arash Vahdat ✈️ #CVPR2025 #uncv2025 #cvpr2025




🧵(1/6) Delighted to share our ICML Conference 2025 spotlight paper: the Feynman-Kac Correctors (FKCs) in Diffusion Picture this: it’s inference time and we want to generate new samples from our diffusion model. But we don’t want to just copy the training data – we may want to sample








A thread by Alain Riou about our recent ISMIR Conference work, SLAP! paper: arxiv.org/abs/2506.17815 code: github.com/Pliploop/SLAP/… TLDR: Joint multimodal models without negatives (No more contrastive 😈) - Better performance! - Better scalability! - Closed modality gap! 🧵⏬

