Thomas Pellegrini (@topel290118) 's Twitter Profile
Thomas Pellegrini

@topel290118

ID: 958098903206973441

calendar_today29-01-2018 22:05:53

109 Tweet

185 Takipçi

261 Takip Edilen

DCASE Workshop (@dcase_workshop) 's Twitter Profile Photo

Let's get a drink at the Stanislas place in the Nancy Opera House ! 🥂 The place just being a world UNESCO heritage site 🙄

Let's get a drink at the Stanislas place in the Nancy Opera House ! 🥂

The place just being a world UNESCO heritage site 🙄
Hervé "pyannote" Bredin (@hbredin) 's Twitter Profile Photo

🎹 pyannote + 🗒 notebook = pyannotebook pyannotebook is a custom Project Jupyter widget built on top of #pyannote.core and #wavesurferjs. It can be used to visualize and edit temporal audio labels, without leaving the notebook.

🎹 pyannote + 🗒 notebook = pyannotebook

pyannotebook is a custom <a href="/ProjectJupyter/">Project Jupyter</a> widget built on top of #pyannote.core and #wavesurferjs.

It can be used to visualize and edit temporal audio labels, without leaving the notebook.
Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

Etienne made a PyTorch package with datasets and dataloaders for Audio Captioning datasets (AudioCaps, Clotho, MACS for now): github.com/Labbeti/aac-da… pip install aac-datasets might be useful to AAC people

ANITI Toulouse (@aniti_toulouse) 's Twitter Profile Photo

📣Following its successful evaluation, we anticipate a continuation of #ANITI in a form that we provisionally call 𝘈𝘕𝘐𝘛𝘐 2.0. ⚡️This call for chair will allow researchers to initiate the construction of their projects and submit their proposal ➡️aniti.univ-toulouse.fr/recherche-ia/c…

📣Following its successful evaluation, we anticipate a continuation of #ANITI in a form that we provisionally call 𝘈𝘕𝘐𝘛𝘐 2.0.

⚡️This call for chair will allow researchers to initiate the construction of their projects and submit their proposal

➡️aniti.univ-toulouse.fr/recherche-ia/c…
Hervé "pyannote" Bredin (@hbredin) 's Twitter Profile Photo

🫂Enjoying #pyannote speaker diarization pipelines? 👥What if it could do speaker separation as well? 🗣️Share the word if you want to make it happen! 🥳We are hiring! Twice! 👇

Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

My language-based audio retrieval system, used in DCASE 2022 Task 6b, is now open-sourced: github.com/topel/IRIT-aud… Its originality was the use of AudioSet tags textual embeddings to describe the audio content Paper: ut3-toulouseinp.hal.science/hal-03812737/d…

Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

We have 3 open 2-year PostDoc positions to work on ASR, sentiment analysis and driver's preferences modeling, for an in-car voice assistant, Come to work in Toulouse! ANITI Toulouse More info: irit.fr/~Thomas.Pelleg… #postdoc #postdocposition

Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

Retraite : une génération de chercheuses et chercheurs très pénalisée. Les années de postdoc sont la plupart du temps des années blanches... Pour signer la pétition : chng.it/CxfpzC7g via Change.org France

Hervé "pyannote" Bredin (@hbredin) 's Twitter Profile Photo

I am considering starting an open-source business around #pyannote open-source state-of-the-art speaker diarization toolkit. Please help me make the right decisions by either filling this form and/or retweeting: forms.gle/eKhn7H2zTa68sM… A few promising stats:

Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

🛠️ "New" audio tagging model on Hugging Face: hf.co/topel/ConvNeXt… - 28 M params - Trained on AudioSet - Test set mAP = 0.471 - provides 768-d global and frame-level embeddings Paper: isca-speech.org/archive/pdfs/i… #dcase #interspeech2023

DCASE Workshop (@dcase_workshop) 's Twitter Profile Photo

Help needed! We are looking for reviewers for DCASE related topics for #ICASSP2024 ! If you have received an invitation, please respond to it. If you are not a reviewer but would like to be, contact Annamaria Mesaros or Romain Serizel to see about becoming one.

Jantina Tammes School: Digital Society, Tech & AI (@jtschool_ug) 's Twitter Profile Photo

📢20 November 2023, 15h30, we're organising a keynote lecture and panel discussion with prof. Casilli. He will show how historical global inequalities shape international #digital labour and #data supply chains. 🖱️ Moderator Michele Mole 👉tinyurl.com/2p9bxtzc #AI

📢20 November 2023, 15h30, we're organising a keynote lecture and panel discussion with prof. <a href="/AntonioCasilli/">Casilli</a>.  He will show how historical global inequalities shape international #digital labour and #data supply chains. 🖱️
Moderator <a href="/michele_mole/">Michele Mole</a>  
👉tinyurl.com/2p9bxtzc
#AI
Thomas Pellegrini (@topel290118) 's Twitter Profile Photo

Machine listening people, please consider participating to the audio captioning task of DCASE. A new baseline system is provided: CNext-trans, 28M params, 29.6% SPIDEr-FL score on Clotho-eval dcase.community/challenge2024/… github.com/Labbeti/dcase2… #DCASE #audiocaptioning

Machine listening people, please consider participating to the audio captioning task of DCASE. A new baseline system is provided: CNext-trans, 28M params, 29.6% SPIDEr-FL score on Clotho-eval

dcase.community/challenge2024/…

github.com/Labbeti/dcase2…

#DCASE #audiocaptioning
Hervé "pyannote" Bredin (@hbredin) 's Twitter Profile Photo

#pyannote on iOS 📱/ macOS 💻 anyone? Call for tender has just been published (20 days left) achatpublic.com/sdm/ent2/gen/f… Please share if you'd like to see this happen 🙏

NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

👀 Exciting advancements in Automated Audio Captioning.✨ The CMU-NVIDIA team has unveiled a groundbreaking approach that leverages multi-agent collaboration and GPU technology to enhance audio-to-text systems. 🔍 Key innovations include: ✅ Multi-encoder fusion: Combining

👀 Exciting advancements in Automated Audio Captioning.✨ The CMU-NVIDIA team has unveiled a groundbreaking approach that leverages multi-agent collaboration and GPU technology to enhance audio-to-text systems.  

🔍 Key innovations include:  

✅ Multi-encoder fusion: Combining
Samuele Cornell (@samuelecornell) 's Twitter Profile Photo

In this paper we show that is possible to create synthetic 2-speakers conversations with TTS and LLMs and fine-tune successfully Whisper for multi-speaker ASR generalizing well to real-world scenarios: arxiv.org/abs/2408.09215 Examples of such synth data: popcornell.github.io/SynthConvASRDe…