Jungo Kasai 笠井淳吾
@jungokasai
Co-founder & CTO @kotoba_tech (Hiring Now): "Building Speech Foundation Models." | PhD from @nlpnoah at @UW | IBM PhD Fellow | 孫正義育英財団生 | @Yale Undergraduate
ID: 786071512646377473
https://jungokasai.github.io/ 12-10-2016 05:10:28
521 Tweet
1,1K Followers
409 Following
Really honor to give a talk on-site at my second home! Here's the slide for my pitch about speech foundation models (collaboration with Kotoba Technologies). In the end, it's mostly about RVQ and audio tokens, but trust me it's fun to study speech as NLPer! asahiushio.com/files/2024_07_…
NEW ARENA: Text to Speech Arena for Japanese by Kotoba Technologies🔥 🔉Sound on Outside of English, TTS evaluation is quite scarce. The Arena, allows one to test open source models against the closed source giants. In the leaderboard you can compare open models like Bark, MOE-VITS,
Thank you Kotoba Technologies and special thanks to Jungo Kasai 笠井淳吾 and Noriyuki Kojima! Wonderful and rewarding experience in Tokyo for the summer, surrounded by such a passionate team of talented engineers. Always excited about Kotoba's next release and look forward to keeping in touch!
Kotoba Kotoba Technologies is at full speed! Two months of 100 H100 GPUs took us to: Expressive, controllable Japanese speech generation (voice cloning/diverse styles) End-to-end speech translation with low latency Offline, standalone speech translation This is just the beginning!