William Chen (@chenwanch1) Twitter Tweets • TwiCopy

William Chen

@chenwanch1

+ Follow

PhD Student @LTIatCMU @SCSatCMU | Masters @LTIatCMU | Formerly @TXInstruments | @UCF ‘21

ID: 1403044813562433539

linkhttp://wanchichen.github.io calendar_today10-06-2021 17:42:13

247 Tweet

697 Followers

384 Following

Salah Zaiem

@salah_zaiem

8 months ago

We are looking for audio and speech generation people, in Zurich, Paris or London to join our team at Google Deepmind. We build cutting-edge speech, music and audio (also audio-visual) generation capabilities. Reach out to Jason or me if interested. Retweets very appreciated !

thumb_up_off_alt34

chat_bubble_outline0

repeat9

shareShare

Masao

@mmiagshatoy

7 months ago

Happy to share our #ICLR2025 paper: "Context-Aware Dynamic Pruning for Speech Foundation Models" 🎉 💡 We introduce context-aware inference-time pruning. 🎯 On Speech Translation (ST), it cuts inference time by 34% (relative) with no drop in BLEU. 📄 openreview.net/forum?id=u2QdC…

thumb_up_off_alt43

chat_bubble_outline0

repeat12

shareShare

Shinji Watanabe

@shinjiw_at_cmu

7 months ago

📢 Introducing VERSA: our new open-source toolkit for speech & audio evaluation! - 80+ metrics in one unified interface - Flexible input support - Distributed evaluation with Slurm - ESPnet compatible Check out the details wavlab.org/activities/202… github.com/wavlab-speech/…

thumb_up_off_alt137

chat_bubble_outline2

repeat32

shareShare

Huck Yang 🇸🇬 ICLR 2025

@huckiyang

7 months ago

We are happy that🦉 OWLS, 18B to 0.25B open ASR/AST limited data scaling laws, has been accepted to ICML Conference 2025 led by William Chen Jinchuan Tian (田晋川) from Shinji Watanabe WAVLab | @CarnegieMellon and NVIDIA AI Models: huggingface.co/collections/es… Paper: arxiv.org/pdf/2502.10373 Deepspeed ESPNet:

We are happy that🦉 OWLS, 18B to 0.25B open ASR/AST limited data scaling laws, has been accepted to <a href="/icmlconf/">ICML Conference</a> 2025 led by <a href="/chenwanch1/">William Chen</a> <a href="/MXzBFhjFpS1jyMI/">Jinchuan Tian (田晋川)</a> from <a href="/shinjiw_at_cmu/">Shinji Watanabe</a> <a href="/WavLab/">WAVLab | @CarnegieMellon</a> and <a href="/NVIDIAAI/">NVIDIA AI</a>
Models: huggingface.co/collections/es…
Paper: arxiv.org/pdf/2502.10373
Deepspeed ESPNet:

thumb_up_off_alt41

chat_bubble_outline5

repeat10

shareShare

Andrew Rouditchenko 🇺🇦

@arouditchenko

7 months ago

Do you really need audio to fine-tune your Audio LLM? 🤔 Answer below: Introducing Omni-R1, a simple GRPO fine‑tuning method for Qwen2.5‑Omni on audio question answering. It sets new state‑of‑the‑art accuracies on the MMAU benchmark for Audio LLMs. arxiv.org/abs/2505.09439

thumb_up_off_alt145

chat_bubble_outline2

repeat35

shareShare

Shuichiro Shimizu / 清水周一郎

@cromz22

7 months ago

Excited to share our survey paper accepted to #ACL2025NLP Findings: When Large Language Models Meet Speech: A Survey on Integration Approaches by Zhengdong Yang, Shuichiro Shimizu, Yahan Yu, Chenhui Chu (Chenhui Chu) 1/5

thumb_up_off_alt62

chat_bubble_outline2

repeat7

shareShare

William Chen

@chenwanch1

7 months ago

7/7 papers accepted to #Interspeech2025 🎉 Lots of interesting work from my fantastic co-authors on long-form processing, multilingualism, and multi-modal foundation models. See y’all in Rotterdam 🇳🇱

thumb_up_off_alt79

chat_bubble_outline4

repeat7

shareShare

William Chen

@chenwanch1

6 months ago

I’ll be interning at Adobe Research in San Francisco this summer, working on audio generation. HMU if you’re in the area and want to chat about speech / audio AI!

thumb_up_off_alt90

chat_bubble_outline2

repeat2

shareShare

jiatongshi

@jiatongshi

6 months ago

🚀 Introducing Uni-VERSA: a unified model for multi-dimensional speech evaluation-naturalness, intelligibility, noise, prosody & more. ⚡ 109× faster than native VERSA metric computation 🤗 Pretrained models + Colab demo 🧰 VERSA integration coming! 🔗 huggingface.co/collections/es…

thumb_up_off_alt29

chat_bubble_outline2

repeat9

shareShare

Masao

@mmiagshatoy

6 months ago

🚀 Happy to share our #INTERSPEECH2025 paper: Using speaker & acoustic context, we dynamically　adjust model paths, resulting in a 25.7% relative BLEU improvement in speech translation. We also analyze how context influences model behavior. 📜 Paper: arxiv.org/abs/2505.18860

thumb_up_off_alt29

chat_bubble_outline1

repeat10

shareShare

jiatongshi

@jiatongshi

6 months ago

🔊 New release: #ARECHO -> Autoregressive Evaluation via Chain-based Hypothesis Optimization. • 87-metric coverage in one model 🧮 • Dynamic classifier chain 🤝 • Unified tokenization 🧩 • Confidence-aware decoding 🛡️ Built on #UniVERSA, heading to #VERSA. More ↓

thumb_up_off_alt11

chat_bubble_outline1

repeat3

shareShare

William Chen

Salah Zaiem

Masao

Shinji Watanabe

Huck Yang 🇸🇬 ICLR 2025

Andrew Rouditchenko 🇺🇦

Shuichiro Shimizu / 清水 周一郎

William Chen

William Chen

jiatongshi

Masao

jiatongshi

Shuichiro Shimizu / 清水周一郎