Simon Leglaive (@simonleglaive) Twitter Tweets • TwiCopy

Thomas Hueber

2 years ago

Open PhD position at GIPSA-lab "Automatic prediction on intonation from speech gestures, application to voice substitution". ML+real-time systems+experiments+speech science! tinyurl.com/mv44h7rx RT appreciated ! #silentpitch CNRS Sciences informatiques @GrenobleINP Université Grenoble Alpes

thumb_up_off_alt8

chat_bubble_outline0

repeat9

shareShare

Signal Processing, Uni Hamburg

@sp_uhh

2 years ago

𝗣𝗵𝗗 𝗽𝗼𝘀𝗶𝘁𝗶𝗼𝗻 „𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻-𝗕𝗮𝘀𝗲𝗱 𝗗𝗲𝗲𝗽 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗠𝗼𝗱𝗲𝗹𝘀 𝗳𝗼𝗿 𝗦𝗽𝗲𝗲𝗰𝗵 𝗦𝗶𝗴𝗻𝗮𝗹 𝗣𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴“. Join us in pushing the state-of-the-art in this exciting field. Details and on how to apply: inf.uni-hamburg.de/en/inst/ab/sp/…

thumb_up_off_alt6

chat_bubble_outline0

repeat5

shareShare

Yoshiaki Bando

@yoshipon0520

2 years ago

Our paper entitled "Neural Blind Source Separation and Diarization for Distant Speech Recognition" is accepted to Interspeech 2024! Our neural FCASA is a method to jointly separate and diarize speech mixtures without supervision by isolated signals. arxiv.org/abs/2406.08396

thumb_up_off_alt59

chat_bubble_outline2

repeat16

shareShare

Vincent Lostanlen

@lostanlen

2 years ago

"Model-based deep learning for music information research" to appear in IEEE Signal Processing Magazine with G. Richard, Y.-H. Yang, and M. Müller I wrote about differentiable scattering transforms and perceptual–neural–physical sound matching (PNP) hal.science/hal-04611461/

thumb_up_off_alt56

chat_bubble_outline0

repeat19

shareShare

Magdalena Fuentes

@mfu3ntes

a year ago

We’re happy to announce our recent JOSS paper! 🎉Soundata: Reproducible use of audio datasets ⚙️github.com/soundata/sound… 📄 joss.theoj.org/papers/10.2110…

thumb_up_off_alt41

chat_bubble_outline1

repeat11

shareShare

Jean-Marie Lemercier

@jm_lemercier

a year ago

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models We present a fully unsupervised method for blind speech dereverberation using a diffusion prior and a parametric subband filter Paper 📜 arxiv.org/abs/2405.04272 Audio/Code 🔊 uhh.de/sp-inf-buddy

thumb_up_off_alt12

chat_bubble_outline1

repeat4

shareShare

Guénolé Fiche

@guenolefiche

a year ago

How can learned quantized representations help to address human mesh recovery? In VQ-HPS, to be presented at #ECCV2024, we frame HMR as a classification task in a quantized latent space. (1/6)

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Hend Elghazaly

@htelghazaly

a year ago

Thrilled to have contributed to the evaluation of speech enhancement methods in the CHiME-7 UDASE task, now published in Computer Speech & Language. 📚🔊 #CHiMEChallenge Read more: doi.org/10.1016/j.csl.…

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Shinji Watanabe

@shinjiw_at_cmu

a year ago

We're organizing a special issue at Computer Speech & Language about Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition. Deadline: December 2, 2024 sciencedirect.com/journal/comput… CHiME Challenge

thumb_up_off_alt45

chat_bubble_outline0

repeat12

shareShare

Gilles Louppe

@glouppe

a year ago

Great piece of work led by François Rozet in which we revisit the good old EM algorithm to learn diffusion models from corrupted data only. Bonus: This also includes a new posterior sampling scheme for diffusion models!

thumb_up_off_alt45

chat_bubble_outline1

repeat9

shareShare

Vincent Lostanlen

@lostanlen

a year ago

During my postdoc at Cornell (2017–2020), i worked on machine listening of flight calls for bird migration monitoring as part of the NSF project BIRDVOX This IEEE TASLP article concludes the project hal.science/hal-04670882

thumb_up_off_alt44

chat_bubble_outline2

repeat13

shareShare

Shinji Watanabe

@shinjiw_at_cmu

a year ago

We are thrilled to announce the Interspeech 2025 URGENT Challenge, starting on 11/15! Join us in building universal speech enhancement models to tackle in-the-wild speech data using large-scale, multilingual data. Details: urgent-challenge.github.io/urgent2025/

thumb_up_off_alt111

chat_bubble_outline1

repeat44

shareShare

Ed Newton-Rex

@ednewtonrex

a year ago

In my keynote at ISMIR Conference yesterday I played video messages from musicians asking the assembled AI researchers not to train on their music without their consent. It’s testament to the respect the ISMIR community has for musicians that the reaction was overwhelmingly positive.

thumb_up_off_alt199

chat_bubble_outline10

repeat49

shareShare

arXiv Sound

@arxivsound

a year ago

``AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder,'' Samir Sadok, Simon Leglaive, Laurent Girin, Ga\"el Richard, Xavier Alameda-Pineda, ift.tt/eDFJaWg

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Alain Riou

@howariou

10 months ago

PESTO 2.0 è rilasciato! 🥳🥳🥳 With Brazilian chef Bernardo Torres (and others), we revisit this traditional italian sauce, invented in Milan at ISMIR Conference 2023 🇮🇹 And you can taste it in REAL-TIME at home (~5 ms latency) ⏱️ 1/6

thumb_up_off_alt24

chat_bubble_outline1

repeat7

shareShare

Paola Garcia

@leibnypaola

5 months ago

CHiME Challenge ⭐⭐ We are happy to announce the release of the tasks for the 9th CHiME Speech Separation and Recognition Challenge (CHiME-9). ⚡⚡ Please visit the CHiME Challenge website for details chimechallenge.org ⚡⚡

thumb_up_off_alt19

chat_bubble_outline0

repeat11

shareShare

Grant Sanderson

@3blue1brown

5 months ago

New video on the details of diffusion models: youtu.be/iv-5mZ_9CPY Produced by Welch Labs, this is the first in a small series of 3b1b this summer. I enjoyed providing editorial feedback throughout the last several months, and couldn't be happier with the result.

thumb_up_off_alt2,2K

chat_bubble_outline33

repeat403

shareShare

Wen-Chin Huang

@unilightwf

4 months ago

Enjoyed a great INTERSPEECH 2025 experience! (my first since 2019 at Austria😮‍💨) Kudos to the organizers! Please find our tutorial slides here: voicemos-challenge-2023.github.io/speech-synthes… Also if you work on MOS prediction make sure you check out SHEET! github.com/unilight/sheet

thumb_up_off_alt26

chat_bubble_outline0

repeat10

shareShare

Thomas Hueber

@thomashueber

4 months ago

🚨 Open PhD Position (fully funded) – Grenoble, France Join us at GIPSA-lab (CNRS 🌍, Université Grenoble Alpes collab. RobotLearn Research Team @ Inria Grenoble) to explore how Speech Language Models can learn like children: through physical and social interaction🧠🤖🎙️ Details 👉 tinyurl.com/bde988b3

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Shinji Watanabe

@shinjiw_at_cmu

3 months ago

We are seeking reviewers for speech & language processing at ICASSP'26. Please consider nominating yourself or a colleague, and help spread the word! Reviewing is a great first step to contribute to the community :)

thumb_up_off_alt26

chat_bubble_outline0

repeat17

shareShare