Thomas Pellegrini
@topel290118
ID: 958098903206973441
29-01-2018 22:05:53
109 Tweet
185 Takipçi
261 Takip Edilen
🎹 pyannote + 🗒 notebook = pyannotebook pyannotebook is a custom Project Jupyter widget built on top of #pyannote.core and #wavesurferjs. It can be used to visualize and edit temporal audio labels, without leaving the notebook.
We have 3 open 2-year PostDoc positions to work on ASR, sentiment analysis and driver's preferences modeling, for an in-car voice assistant, Come to work in Toulouse! ANITI Toulouse More info: irit.fr/~Thomas.Pelleg… #postdoc #postdocposition
Retraite : une génération de chercheuses et chercheurs très pénalisée. Les années de postdoc sont la plupart du temps des années blanches... Pour signer la pétition : chng.it/CxfpzC7g via Change.org France
🛠️ "New" audio tagging model on Hugging Face: hf.co/topel/ConvNeXt… - 28 M params - Trained on AudioSet - Test set mAP = 0.471 - provides 768-d global and frame-level embeddings Paper: isca-speech.org/archive/pdfs/i… #dcase #interspeech2023
Help needed! We are looking for reviewers for DCASE related topics for #ICASSP2024 ! If you have received an invitation, please respond to it. If you are not a reviewer but would like to be, contact Annamaria Mesaros or Romain Serizel to see about becoming one.
In this paper we show that is possible to create synthetic 2-speakers conversations with TTS and LLMs and fine-tune successfully Whisper for multi-speaker ASR generalizing well to real-world scenarios: arxiv.org/abs/2408.09215 Examples of such synth data: popcornell.github.io/SynthConvASRDe…