Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile
Shinji Watanabe

@shinjiw_at_cmu

I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.

ID: 1433346301626880000

linkhttps://sites.google.com/view/shinjiwatanabe calendar_today02-09-2021 08:29:46

439 Tweet

3,3K Followers

342 Following

まっすー (@ymas0315) 's Twitter Profile Photo

Our multi-speaker ASR paper is now available on CSL😊 We integrate SOTA speech separation (TF-GridNet), self-supervised learning (WavLM), and ASR (Conformer-based joint CTC/AED) models in an end-to-end manner. I really appreciate excellent collaborators! authors.elsevier.com/a/1l5uB39HpSta…

Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

🚀 New at ASRU’25: The Demo/System/Data Track is now revamped! Accepted papers will be published in the official proceedings & IEEE Xplore. Great chance for industry & applied researchers to share real-world ASR/SLU work 🗓️ Deadline: June 25 🔗 2025.ieeeasru.org/calls/call-for… IEEE ASRU

Pooneh Mousavi (@mousavipooneh) 's Twitter Profile Photo

🚀 We're excited to announce our latest work: "Discrete Audio Tokens: More Than a Survey!" It presents a comprehensive survey and benchmark of audio tokenizers across speech, music, and general audio. preprint: arxiv.org/pdf/2506.10274 website: poonehmousavi.github.io/dates-website/

Siddhant Arora (@sid_arora_18) 's Twitter Profile Photo

New #INTERSPEECH2025, we propose a Chain-of-Thought post-training method to build spoken dialogue systems—generating intelligent responses with good audio quality while preserving speaking styles with just 300h of public conversational data! (1/5) 📜: arxiv.org/abs/2506.00722

New #INTERSPEECH2025, we propose a Chain-of-Thought post-training method to build spoken dialogue systems—generating intelligent responses with good audio quality while preserving speaking styles with just 300h of public conversational data! (1/5)
📜: arxiv.org/abs/2506.00722
Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

Had a great visit to INESC-ID to kick off a new collaboration with Prof. Alberto Abad! Excited to accelerate our joint research!

Had a great visit to INESC-ID to kick off a new collaboration with Prof. Alberto Abad! Excited to accelerate our joint research!
Minje Kim (@minje_research) 's Twitter Profile Photo

Join the inaugural 2025 Low-Resource Audio Codec (LRAC) Challenge on efficient neural speech codecs for resource-constrained devices! Learn more and sign up today! 👉 lrac.short.gy/call 📅 Registration is open; Challenge runs Aug. 1 – Sep. 30, 2025. #LRAC2025 #ICASSP2026

Paola Garcia (@leibnypaola) 's Twitter Profile Photo

CHiME Challenge ⭐⭐ We are happy to announce the release of the tasks for the 9th CHiME Speech Separation and Recognition Challenge (CHiME-9). ⚡⚡ Please visit the CHiME Challenge website for details chimechallenge.org ⚡⚡

Heiga Zen (全 炳河) (@heiga_zen) 's Twitter Profile Photo

Research Engineer, Tokyo: job-boards.greenhouse.io/deepmind/jobs/… Research Scientist, Tokyo: job-boards.greenhouse.io/deepmind/jobs/… Research Scientist, Bangalore: job-boards.greenhouse.io/deepmind/jobs/…

Heiga Zen (全 炳河) (@heiga_zen) 's Twitter Profile Photo

Google DeepMind 東京チームでは、多言語・多文化・マルチモーダル AI の専門家を募集しています! Gemini などの開発に貢献し、世界中の数十億のユーザに利用されるプロダクトを一緒に作りませんか? Research Engineer job-boards.greenhouse.io/deepmind/jobs/… Research Scientist job-boards.greenhouse.io/deepmind/jobs/…

Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

やっとピッツバーグに帰ってきました。今回は四週間と長い間日本にいました。その間色々な方と議論ができて本当に楽しかったです!!! 最後に東北大とNHK技研を訪問しました。

やっとピッツバーグに帰ってきました。今回は四週間と長い間日本にいました。その間色々な方と議論ができて本当に楽しかったです!!! 最後に東北大とNHK技研を訪問しました。
Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

Hi all, we are seeking an ICASSP '26 reviewer in the speech and language processing area. Please consider becoming a reviewer and contributing to our community!

Shinji Watanabe (@shinjiw_at_cmu) 's Twitter Profile Photo

📢 Excited to announce our 2-day workshop on "Foundations of Speech and Audio Foundation Models" at TTI Chicago, happening September 4–5! 🔗 Info & registration: sites.google.com/view/speech-ai… 📝 Poster submissions welcome! Join us for talks, discussions, and community building!

Huck Yang 🇸🇬 ICLR 2025 (@huckiyang) 's Twitter Profile Photo

SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding LLMs, Monday 28, 11 to 12:30 pm aclanthology.org/2025.acl-long.… (with Z. Wan, Y Yu, et al. Shinji Watanabe, Language Technologies Institute | @CarnegieMellon 京都大学情報学研究科データ科学コース, Prof. Cheng, Prof. Chu, Prof. Kurohashi)

SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding LLMs, Monday 28, 11 to 12:30 pm
aclanthology.org/2025.acl-long.…
(with Z. Wan, Y Yu, et al. <a href="/shinjiw_at_cmu/">Shinji Watanabe</a>, <a href="/LTIatCMU/">Language Technologies Institute | @CarnegieMellon</a> <a href="/kyotouniv_data/">京都大学情報学研究科データ科学コース</a>, Prof. Cheng, Prof. Chu, Prof.  Kurohashi)
Chris Donahue (@chrisdonahuey) 's Twitter Profile Photo

Excited to share our beta release of Music Arena, a live evaluation platform for state-of-the-art AI music generation models! 🎧 Listen to the latest models and 🗳️ vote for your favorite ⚔️ music-arena.org ⭐️ github.com/gclef-cmu/musi… 📜 arxiv.org/abs/2507.20900

Excited to share our beta release of Music Arena, a live evaluation platform for state-of-the-art AI music generation models!

🎧 Listen to the latest models and 🗳️ vote for your favorite

⚔️ music-arena.org 
⭐️ github.com/gclef-cmu/musi…
📜 arxiv.org/abs/2507.20900