Simon Leglaive (@simonleglaive) 's Twitter Profile
Simon Leglaive

@simonleglaive

Assistant Professor at CentraleSupélec in Rennes, France.

Signal processing and machine learning for audio.

ID: 2796314946

linkhttps://sleglaive.github.io calendar_today07-09-2014 17:45:55

123 Tweet

250 Followers

411 Following

Thomas Hueber (@thomashueber) 's Twitter Profile Photo

Open PhD position at GIPSA-lab "Automatic prediction on intonation from speech gestures, application to voice substitution". ML+real-time systems+experiments+speech science! tinyurl.com/mv44h7rx RT appreciated ! #silentpitch CNRS Sciences informatiques @GrenobleINP Université Grenoble Alpes

Signal Processing, Uni Hamburg (@sp_uhh) 's Twitter Profile Photo

𝗣𝗵𝗗 𝗽𝗼𝘀𝗶𝘁𝗶𝗼𝗻 „𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻-𝗕𝗮𝘀𝗲𝗱 𝗗𝗲𝗲𝗽 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗠𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿 𝗦𝗽𝗲𝗲𝗰𝗵 𝗦𝗶𝗴𝗻𝗮𝗹 𝗣𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴“. Join us in pushing the state-of-the-art in this exciting field. Details and on how to apply: inf.uni-hamburg.de/en/inst/ab/sp/…

Yoshiaki Bando (@yoshipon0520) 's Twitter Profile Photo

Our paper entitled "Neural Blind Source Separation and Diarization for Distant Speech Recognition" is accepted to Interspeech 2024! Our neural FCASA is a method to jointly separate and diarize speech mixtures without supervision by isolated signals. arxiv.org/abs/2406.08396

Vincent Lostanlen (@lostanlen) 's Twitter Profile Photo

"Model-based deep learning for music information research" to appear in IEEE Signal Processing Magazine with G. Richard, Y.-H. Yang, and M. Müller I wrote about differentiable scattering transforms and perceptual–neural–physical sound matching (PNP) hal.science/hal-04611461/

"Model-based deep learning for music information research"
to appear in IEEE Signal Processing Magazine
with G. Richard, Y.-H. Yang, and M. Müller

I wrote about differentiable scattering transforms and perceptual–neural–physical sound matching (PNP)
hal.science/hal-04611461/
Magdalena Fuentes (@mfu3ntes) 's Twitter Profile Photo

We’re happy to announce our recent JOSS paper! 🎉Soundata: Reproducible use of audio datasets ⚙️github.com/soundata/sound… 📄 joss.theoj.org/papers/10.2110…

Jean-Marie Lemercier (@jm_lemercier) 's Twitter Profile Photo

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models We present a fully unsupervised method for blind speech dereverberation using a diffusion prior and a parametric subband filter Paper 📜 arxiv.org/abs/2405.04272 Audio/Code 🔊 uhh.de/sp-inf-buddy

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models

We present a fully unsupervised method for blind speech dereverberation using a diffusion prior and a parametric subband filter

Paper 📜 arxiv.org/abs/2405.04272
Audio/Code 🔊 uhh.de/sp-inf-buddy
Guénolé Fiche (@guenolefiche) 's Twitter Profile Photo

How can learned quantized representations help to address human mesh recovery? In VQ-HPS, to be presented at #ECCV2024, we frame HMR as a classification task in a quantized latent space. (1/6)

How can learned quantized representations help to address human mesh recovery? 

In VQ-HPS, to be presented at #ECCV2024, we frame HMR as a classification task in a quantized latent space. (1/6)
Hend Elghazaly (@htelghazaly) 's Twitter Profile Photo

Thrilled to have contributed to the evaluation of speech enhancement methods in the CHiME-7 UDASE task, now published in Computer Speech & Language. 📚🔊 #CHiMEChallenge Read more: doi.org/10.1016/j.csl.…

Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

We're organizing a special issue at Computer Speech & Language about Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition. Deadline: December 2, 2024 sciencedirect.com/journal/comput… CHiME Challenge

Gilles Louppe (@glouppe) 's Twitter Profile Photo

Great piece of work led by François Rozet in which we revisit the good old EM algorithm to learn diffusion models from corrupted data only. Bonus: This also includes a new posterior sampling scheme for diffusion models!

Vincent Lostanlen (@lostanlen) 's Twitter Profile Photo

During my postdoc at Cornell (2017–2020), i worked on machine listening of flight calls for bird migration monitoring as part of the NSF project BIRDVOX This IEEE TASLP article concludes the project hal.science/hal-04670882

During my postdoc at Cornell (2017–2020), i worked on machine listening of flight calls for bird migration monitoring as part of the NSF project BIRDVOX

This IEEE TASLP article concludes the project
hal.science/hal-04670882
Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

We are thrilled to announce the Interspeech 2025 URGENT Challenge, starting on 11/15! Join us in building universal speech enhancement models to tackle in-the-wild speech data using large-scale, multilingual data. Details: urgent-challenge.github.io/urgent2025/

We are thrilled to announce the Interspeech 2025 URGENT Challenge, starting on 11/15! 
Join us in building universal speech enhancement models to tackle in-the-wild speech data using large-scale, multilingual data. Details: urgent-challenge.github.io/urgent2025/
Ed Newton-Rex (@ednewtonrex) 's Twitter Profile Photo

In my keynote at ISMIR Conference yesterday I played video messages from musicians asking the assembled AI researchers not to train on their music without their consent. It’s testament to the respect the ISMIR community has for musicians that the reaction was overwhelmingly positive.

arXiv Sound (@arxivsound) 's Twitter Profile Photo

``AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder,'' Samir Sadok, Simon Leglaive, Laurent Girin, Ga\"el Richard, Xavier Alameda-Pineda, ift.tt/eDFJaWg

Alain Riou (@howariou) 's Twitter Profile Photo

PESTO 2.0 è rilasciato! 🥳🥳🥳 With Brazilian chef Bernardo Torres (and others), we revisit this traditional italian sauce, invented in Milan at ISMIR Conference 2023 🇮🇹 And you can taste it in REAL-TIME at home (~5 ms latency) ⏱️ 1/6

Paola Garcia (@leibnypaola) 's Twitter Profile Photo

CHiME Challenge ⭐⭐ We are happy to announce the release of the tasks for the 9th CHiME Speech Separation and Recognition Challenge (CHiME-9). ⚡⚡ Please visit the CHiME Challenge website for details chimechallenge.org ⚡⚡

Grant Sanderson (@3blue1brown) 's Twitter Profile Photo

New video on the details of diffusion models: youtu.be/iv-5mZ_9CPY Produced by Welch Labs, this is the first in a small series of 3b1b this summer. I enjoyed providing editorial feedback throughout the last several months, and couldn't be happier with the result.

Wen-Chin Huang (@unilightwf) 's Twitter Profile Photo

Enjoyed a great INTERSPEECH 2025 experience! (my first since 2019 at Austria😮‍💨) Kudos to the organizers! Please find our tutorial slides here: voicemos-challenge-2023.github.io/speech-synthes… Also if you work on MOS prediction make sure you check out SHEET! github.com/unilight/sheet

Enjoyed a great INTERSPEECH 2025 experience! (my first since 2019 at Austria😮‍💨) Kudos to the organizers!

Please find our tutorial slides here: voicemos-challenge-2023.github.io/speech-synthes…

Also if you work on MOS prediction make sure you check out SHEET! github.com/unilight/sheet
Thomas Hueber (@thomashueber) 's Twitter Profile Photo

🚨 Open PhD Position (fully funded) – Grenoble, France Join us at GIPSA-lab (CNRS 🌍, Université Grenoble Alpes collab. RobotLearn Research Team @ Inria Grenoble) to explore how Speech Language Models can learn like children: through physical and social interaction🧠🤖🎙️ Details 👉 tinyurl.com/bde988b3

Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

We are seeking reviewers for speech & language processing at ICASSP'26. Please consider nominating yourself or a colleague, and help spread the word! Reviewing is a great first step to contribute to the community :)