YenTung (@yentung11) Twitter Tweets • TwiCopy

YenTung

@yentung11

+ Follow

PhD Student in NTU. AI music & audio. I love audio effects, mixing, and mastering. Previously @SonyAI_global, @PositiveGrid, @AcadSinica

ID: 1365187990465613832

linkhttps://ytsrt66589.github.io/ calendar_today26-02-2021 06:32:53

55 Tweet

76 Takipçi

130 Takip Edilen

YenTung

@yentung11

9 months ago

Very cool way to leverage LLM. Upload your music and see the feedback! 🎸🎸

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

YenTung

@yentung11

6 months ago

So amazing.. 🥳🥳🥳

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

"signal processing meets neural nets" is probably my favourite genre of paper, two great examples: Making Convolutional Networks Shift-Invariant Again by Richard Zhang arxiv.org/abs/1904.11486 Alias-Free Generative Adversarial Networks by Karras et al. arxiv.org/abs/2106.12423

thumb_up_off_alt291

chat_bubble_outline9

repeat51

shareShare

arXiv Sound

@arxivsound

5 months ago

``Towards Generalizability to Tone and Content Variations in the Transcription of Amplifier Rendered Electric Guitar Audio,'' Yu-Hua Chen, Yuan-Chiao Cheng, Yen-Tung Yeh, Jui-Te Wu, Jyh-Shing Roger Jang, Yi-Hsuan Yang, ift.tt/6aOj1pr

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

YenTung

@yentung11

5 months ago

Amazing work by Yu-Hua Chen. A very strong guitar transcription model. Go and check demo page to listen the result !

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

arXiv Sound

@arxivsound

4 months ago

``DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions,'' Chin-Yun Yu, Marco A. Mart\'inez-Ram\'irez, Junghyun Koo, Ben Hayes, Wei-Hsiang Liao, Gy\"orgy Fazekas, Yuki Mitsufuji, ift.tt/I6Zye2M

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

Yong-Hyun Park

@hagsaeng_bag

4 months ago

When does sampling fail in discrete diffusion models — and how can we fix it? In our work, to appear at #ICLR2025, we show how to improve generation quality from DDMs without additional inference cost, simply by using a better sampling schedule! openreview.net/forum?id=pD6Ti…

thumb_up_off_alt71

chat_bubble_outline1

repeat15

shareShare

YCY

@yoyolicoris

4 months ago

I'm pleased to share my internship project Sony AI, DiffVox, a differentiable effects chain for vocals with hundreds of presets. arXiv: arxiv.org/abs/2504.14735 code: github.com/SonyResearch/d… 🤗: huggingface.co/spaces/yoyolic…

I'm pleased to share my internship project <a href="/SonyAI_global/">Sony AI</a>, DiffVox, a differentiable effects chain for vocals with hundreds of presets.

arXiv: arxiv.org/abs/2504.14735
code: github.com/SonyResearch/d…
🤗: huggingface.co/spaces/yoyolic…

thumb_up_off_alt70

chat_bubble_outline3

repeat11

shareShare

arXiv Sound

@arxivsound

2 months ago

``ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors,'' Junghyun Koo, Marco A. Martinez-Ramirez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji, ift.tt/ENPUVqh

thumb_up_off_alt18

chat_bubble_outline0

repeat8

shareShare

Alain Riou

@howariou

2 months ago

❌ We don’t need no negative samples ❌ We don’t need no large batches ❌ No modality gap in the classroom Very happy to introduce SLAP, our latest brick in the wall of multimodal SSL 🎶🧠 Joint work with King @Juj_guinot, accepted at #ISMIR2025! 🇰🇷 1/7

thumb_up_off_alt191

chat_bubble_outline7

repeat35

shareShare

Fun-Dwo Tsai

@fundwotsai2001

2 months ago

ICML2025: “MuseControlLite” for multifunctional music generation! Better than ControlNet for time-varying conditions while using 6.75 times fewer trainable parameters. Also supports audio infilling! arxiv.org/abs/2506.18729 github.com/fundwotsai2001… musecontrollite.github.io/web/

thumb_up_off_alt26

chat_bubble_outline2

repeat5

shareShare

Junghyun (Tony) Koo

@junghyun_koo

2 months ago

"Large-Scale Training Data Attribution for Music Generative Models via Unlearning" We explore how machine unlearning can be used for Training Data Attribution (TDA) in large-scale text-to-music diffusion models. 📜 Paper: arxiv.org/abs/2506.18312 Sony AI

thumb_up_off_alt42

chat_bubble_outline2

repeat17

shareShare

Hao-Wen (Herman) Dong 董皓文

@hermanhwdong

2 months ago

🔥Happy to announce that the AI for Music Workshop is coming to #NeurIPS2025! We have an amazing lineup of speakers! We call for papers & demos (due on August 22)! See you in San Diego!🏖️ Chris Donahue Ilaria Manco Akira MAEZAWA Anna Huang McAuley Lab UCSD Zachary Novack NeurIPS Conference

thumb_up_off_alt117

chat_bubble_outline2

repeat31

shareShare

MClem

@mclemcrew

2 months ago

Check out the paper here: arxiv.org/abs/2507.06329 And the website here: mclemcrew.github.io/mixassist-webs… S/O to Ana Marasović for all the leadership, support, and expertise they put into this work! We are so happy with how it turned out and hope it helps the community of music producers!

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

arXiv Sound

@arxivsound

2 months ago

Qihui Yang, Taylor Berg-Kirkpatrick, Julian McAuley, Zachary Novack, "WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling," arxiv.org/abs/2507.10534

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

neutone

@neutone_ai

a month ago

We have contributed a late breaking demo paper to ISMIR! By exploiting the compressed information dense latents of Neural Audio Codecs, we can perform zero-shot ‘timbre transfer’ using as little as a single source audio sample for reference. 🔥

thumb_up_off_alt38

chat_bubble_outline2

repeat8

shareShare

YenTung

YenTung

YenTung

Sander Dieleman

arXiv Sound

YenTung

arXiv Sound

Yong-Hyun Park

YCY

arXiv Sound

Alain Riou

Fun-Dwo Tsai

Junghyun (Tony) Koo

Hao-Wen (Herman) Dong 董皓文

MClem

arXiv Sound

neutone