Thomas Pellegrini (@topel290118) Twitter Tweets • TwiCopy

DCASE Workshop

4 years ago

Let's get a drink at the Stanislas place in the Nancy Opera House ! 🥂 The place just being a world UNESCO heritage site 🙄

thumb_up_off_alt30

chat_bubble_outline0

repeat8

shareShare

🎹 pyannote + 🗒 notebook = pyannotebook pyannotebook is a custom Project Jupyter widget built on top of #pyannote.core and #wavesurferjs. It can be used to visualize and edit temporal audio labels, without leaving the notebook.

🎹 pyannote + 🗒 notebook = pyannotebook

pyannotebook is a custom <a href="/ProjectJupyter/">Project Jupyter</a> widget built on top of #pyannote.core and #wavesurferjs.

It can be used to visualize and edit temporal audio labels, without leaving the notebook.

thumb_up_off_alt81

chat_bubble_outline3

repeat13

shareShare

Thomas Pellegrini

@topel290118

3 years ago

Etienne made a PyTorch package with datasets and dataloaders for Audio Captioning datasets (AudioCaps, Clotho, MACS for now): github.com/Labbeti/aac-da… pip install aac-datasets might be useful to AAC people

thumb_up_off_alt27

chat_bubble_outline0

repeat5

shareShare

ANITI Toulouse

@aniti_toulouse

3 years ago

📣Following its successful evaluation, we anticipate a continuation of #ANITI in a form that we provisionally call 𝘈𝘕𝘐𝘛𝘐 2.0. ⚡️This call for chair will allow researchers to initiate the construction of their projects and submit their proposal ➡️aniti.univ-toulouse.fr/recherche-ia/c…

thumb_up_off_alt4

chat_bubble_outline2

repeat3

shareShare

Hervé "pyannote" Bredin

@hbredin

3 years ago

🫂Enjoying #pyannote speaker diarization pipelines? 👥What if it could do speaker separation as well? 🗣️Share the word if you want to make it happen! 🥳We are hiring! Twice! 👇

thumb_up_off_alt23

chat_bubble_outline1

repeat9

shareShare

Thomas Pellegrini

@topel290118

3 years ago

My language-based audio retrieval system, used in DCASE 2022 Task 6b, is now open-sourced: github.com/topel/IRIT-aud… Its originality was the use of AudioSet tags textual embeddings to describe the audio content Paper: ut3-toulouseinp.hal.science/hal-03812737/d…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Thomas Pellegrini

@topel290118

3 years ago

We have 3 open 2-year PostDoc positions to work on ASR, sentiment analysis and driver's preferences modeling, for an in-car voice assistant, Come to work in Toulouse! ANITI Toulouse More info: irit.fr/~Thomas.Pelleg… #postdoc #postdocposition

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Thomas Pellegrini

@topel290118

3 years ago

Retraite : une génération de chercheuses et chercheurs très pénalisée. Les années de postdoc sont la plupart du temps des années blanches... Pour signer la pétition : chng.it/CxfpzC7g via Change.org France

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Hervé "pyannote" Bredin

@hbredin

3 years ago

Would you pay for #pyannote custom or premium models? #AskingForAFriend (I'd appreciate a few ReTweets 🙏)

thumb_up_off_alt15

chat_bubble_outline6

repeat27

shareShare

Hervé "pyannote" Bredin

@hbredin

3 years ago

I am considering starting an open-source business around #pyannote open-source state-of-the-art speaker diarization toolkit. Please help me make the right decisions by either filling this form and/or retweeting: forms.gle/eKhn7H2zTa68sM… A few promising stats:

thumb_up_off_alt38

chat_bubble_outline3

repeat16

shareShare

Thomas Pellegrini

@topel290118

3 years ago

🛠️ "New" audio tagging model on Hugging Face: hf.co/topel/ConvNeXt… - 28 M params - Trained on AudioSet - Test set mAP = 0.471 - provides 768-d global and frame-level embeddings Paper: isca-speech.org/archive/pdfs/i… #dcase #interspeech2023

thumb_up_off_alt37

chat_bubble_outline0

repeat8

shareShare

DCASE Workshop

@dcase_workshop

3 years ago

Help needed! We are looking for reviewers for DCASE related topics for #ICASSP2024 ! If you have received an invitation, please respond to it. If you are not a reviewer but would like to be, contact Annamaria Mesaros or Romain Serizel to see about becoming one.

thumb_up_off_alt13

chat_bubble_outline0

repeat9

shareShare

Jantina Tammes School: Digital Society, Tech & AI

@jtschool_ug

3 years ago

📢20 November 2023, 15h30, we're organising a keynote lecture and panel discussion with prof. Casilli. He will show how historical global inequalities shape international #digital labour and #data supply chains. 🖱️ Moderator Michele Mole 👉tinyurl.com/2p9bxtzc #AI

📢20 November 2023, 15h30, we're organising a keynote lecture and panel discussion with prof. <a href="/AntonioCasilli/">Casilli</a>. He will show how historical global inequalities shape international #digital labour and #data supply chains. 🖱️
Moderator <a href="/michele_mole/">Michele Mole</a>
👉tinyurl.com/2p9bxtzc
#AI

thumb_up_off_alt11

chat_bubble_outline0

repeat10

shareShare

Thomas Pellegrini

@topel290118

2 years ago

Not going to Seoul... Thanks to the reviewers who take care of my CO2 footprint! 🤷

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

DCASE Challenge

@dcase_challenge

2 years ago

📢 DCASE challenge 2024 short task descriptions are out! ⤵️ dcase.community/challenge2024/ Stay tuned for more details.

thumb_up_off_alt25

chat_bubble_outline0

repeat5

shareShare

Thomas Pellegrini

@topel290118

2 years ago

Machine listening people, please consider participating to the audio captioning task of DCASE. A new baseline system is provided: CNext-trans, 28M params, 29.6% SPIDEr-FL score on Clotho-eval dcase.community/challenge2024/… github.com/Labbeti/dcase2… #DCASE #audiocaptioning

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Hervé "pyannote" Bredin

@hbredin

2 years ago

#pyannote on iOS 📱/ macOS 💻 anyone? Call for tender has just been published (20 days left) achatpublic.com/sdm/ent2/gen/f… Please share if you'd like to see this happen 🙏

thumb_up_off_alt4

chat_bubble_outline1

repeat4

shareShare

NVIDIA AI Developer

@nvidiaaidev

2 years ago

👀 Exciting advancements in Automated Audio Captioning.✨ The CMU-NVIDIA team has unveiled a groundbreaking approach that leverages multi-agent collaboration and GPU technology to enhance audio-to-text systems. 🔍 Key innovations include: ✅ Multi-encoder fusion: Combining

thumb_up_off_alt101

chat_bubble_outline10

repeat22

shareShare

Samuele Cornell

@samuelecornell

2 years ago

In this paper we show that is possible to create synthetic 2-speakers conversations with TTS and LLMs and fine-tune successfully Whisper for multi-speaker ASR generalizing well to real-world scenarios: arxiv.org/abs/2408.09215 Examples of such synth data: popcornell.github.io/SynthConvASRDe…

thumb_up_off_alt86

chat_bubble_outline6

repeat14

shareShare