Yin-Jyun Luo (@jun_luolo) Twitter Tweets • TwiCopy

Yin-Jyun Luo

@jun_luolo

+ Follow

Generative Audio and Disentangled Representations

PhD researcher @c4dm with @SpotifyResearch | Intern @StabilityAI | Prev. interns @SonyAI_global @AIST_JP

ID: 926799790691696640

linkhttps://yjlolo.github.io/ calendar_today04-11-2017 13:14:24

345 Tweet

385 Followers

496 Following

Yin-Jyun Luo

@jun_luolo

2 years ago

arxiv.org/pdf/2106.05241… "We point out that this run is cherry-picked from over 100 runs with different random initialisation." I really dig they discussed the issue of stability in Section E. It's been a lonely battle against algo. stability in unsupervised disentangling models.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

YCY

@yoyolicoris

2 years ago

Hi ISMIR Conference people! I'm going to present this in the afternoon. Feel free to stop by and come to our poster or ask questions in the slack channel p6-01-yu. See you there!

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Keitaro Tanaka

@kakanat1105

2 years ago

Thank you for visiting our poster in the LBD session 🎶 #ISMIR2023

thumb_up_off_alt27

chat_bubble_outline0

repeat1

shareShare

Yin-Jyun Luo

@jun_luolo

2 years ago

2/2 first authored papers accepted to ICASSP! Both on improving disentangled sequential autoencoders applied to musical instrument sounds and speech. Completely unsupervised. Looking forward to 🇰🇷🍖🍶 and of course meeting VAE folks!

thumb_up_off_alt38

chat_bubble_outline1

repeat2

shareShare

Yin-Jyun Luo

@jun_luolo

a year ago

Sad how we audio people are unexposed to the CV folks. We have attempted the challenges listed in this ICML24 openreview.net/pdf?id=AocOA4h… - difficulty in tuning latent dimension - requirement of domain-dependent data augmentation - overheads by auxiliary losses such as mutual info

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Yin-Jyun Luo

@jun_luolo

a year ago

Audio codecs are turning into VQ-based voice conversion models with an extra focus on compression. Disentanglement seems to be the sauce.

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

arXiv Sound

@arxivsound

a year ago

``Self-Supervised Multi-View Learning for Disentangled Music Audio Representations,'' Julia Wilkins, Sivan Ding, Magdalena Fuentes, Juan Pablo Bello, ift.tt/NFIb2yd

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Yin-Jyun Luo

@jun_luolo

a year ago

There goes the one proof to my ISMIR presence. It was very nice to catch up w/ the Taiwanese Gang, and it's my honour to be confronted by "why are you still doing disentanglement?" That's right, I will also be presenting DisMix openreview.net/pdf?id=Zhc6ZFm… at the NeurIPS Workshop!

thumb_up_off_alt16

chat_bubble_outline0

repeat0

shareShare

Keitaro Tanaka

@kakanat1105

a year ago

主著論文がAPSIPA Trans.にアクセプトされました🙌 Our paper has been accepted for publication in APSIPA Transactions!!🚀 A big thanks to the co-authors (professors!), reviewers, and everyone who supported this work. Special mention to Yin-Jyun Luo, たいし, and Yoshiaki Bando🙏

thumb_up_off_alt36

chat_bubble_outline0

repeat6

shareShare

Yin-Jyun Luo

@jun_luolo

9 months ago

Pumped to see a comeback of GMVAE among a sea of VQ! openreview.net/forum?id=cuFzE… Speaking of, Wei-Ning's arxiv.org/abs/1810.07217 on TTS has a substantial impact to my research on style transfer via (unsupervised) disentanglement. But it seems overshadowed by his own work HuBERT😅

thumb_up_off_alt22

chat_bubble_outline2

repeat0

shareShare

Kwang Moo Yi

@kwangmoo_yi

4 months ago

Preprint of today: Vavilala et al., "Generative Blocks World: Moving Things Around in Pictures" -- arxiv.org/abs/2506.20703 I have a soft spot for reviving old ideas in modern methods -- block world via primitives now with Diffusion models for generating/editing images.

thumb_up_off_alt76

chat_bubble_outline1

repeat16

shareShare