Rex Cheng (@hkchengrex) 's Twitter Profile
Rex Cheng

@hkchengrex

Ph.D. student at @IllinoisCDS. Computer vision and machine learning. ヽ(*゚д゚)ノ

ID: 3253365192

linkhttps://hkchengrex.com/ calendar_today23-06-2015 07:34:49

25 Tweet

171 Followers

133 Following

AK (@_akhaliq) 's Twitter Profile Photo

Tracking Anything with Decoupled Video Segmentation paper page: huggingface.co/papers/2309.03… Training data for video segmentation are expensive to annotate. This impedes extensions of end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary settings. To

Gradio (@gradio) 's Twitter Profile Photo

MMAudio just made Sora videos highly engaging and more watchable by adding relevant audio to them😎 Gradio app is up on Hugging Face Spaces: huggingface.co/spaces/hkcheng… Huge kudos to Rex Cheng and MMAudio team 🙌

Blaine Brown (@blizaine) 's Twitter Profile Photo

I love old trains & Google's Veo2 is incredible! Adding audio to the clips with this open-source generative Video-to-Audio model is complete magic! 🪄 🧵👇🔊

ハカセ アイ(Ai-Hakase)🐾最新トレンドAIのためのX 🐾 (@ai_hakase_) 's Twitter Profile Photo

【MMAudioがMacで利用可能に!】 ✎. FYIG: なんと、今年最高のAIモデルの一つ、MMAudioがMacで動くようになったそうです! ビデオクリップを渡すと、そのビデオにマッチした音楽やサウンドを自動生成できるんですって…まさに魔法のようですね!

【MMAudioがMacで利用可能に!】
✎. FYIG: 
なんと、今年最高のAIモデルの一つ、MMAudioがMacで動くようになったそうです!
ビデオクリップを渡すと、そのビデオにマッチした音楽やサウンドを自動生成できるんですって…まさに魔法のようですね!
Atsushi Tabata (@atsushieeeee) 's Twitter Profile Photo

🌳Just tested MMAudio with a summer forest video! The environmental sound reproduction exceeded my expectations ✨ The rustling leaves and birdsong create an amazing atmosphere. I'm truly impressed with how well it turned out! (1/4)

main (@main_horse) 's Twitter Profile Photo

This example from their paper (pub.sakana.ai/static/paper.p…), which is claimed to have 150x speedup, is actually 3x slower if you bench it...

This example from their paper (pub.sakana.ai/static/paper.p…), which is claimed to have 150x speedup, is actually 3x slower if you bench it...
Rex Cheng (@hkchengrex) 's Twitter Profile Photo

Really cool to see Corridor using MMAudio to make such an entertaining and informative video! They did a great job breaking down how MMAudio works and how it was trained. youtu.be/SLz3NWLyHxg Love to see our work existing outside of arXiv. Sony AI @IllinoisCDS

mi141 (@mi141) 's Twitter Profile Photo

🎉Excited to announce the 1st Workshop on Generative AI for Audio-Visual Content Creation (Gen4AVC) at #ICCV2025! Topics: Vision-to-audio, audio-to-vision, joint audio-visual generation & more. Let’s shape the future of immersive content!🚀 gen4avc.github.io

Takashi Shibuya (@yahshibu) 's Twitter Profile Photo

Excited to announce our Gen4AVC workshop #ICCV2025 ! Join us at #ICCV2025 for talks from amazing speakers & to share your work on audio-visual generation! We call for 4-page extended abstracts. 🗓️Deadline: July 1st, 2025 (23:59, AoE) 🌐More info: gen4avc.github.io

Excited to announce our Gen4AVC workshop <a href="/ICCVConference/">#ICCV2025</a> !
Join us at #ICCV2025 for talks from amazing speakers &amp; to share your work on audio-visual generation! We call for 4-page extended abstracts.

🗓️Deadline: July 1st, 2025 (23:59, AoE)
🌐More info: gen4avc.github.io