Rafael Valle
@rafaelvalleart
Research Manager and Scientist at NVIDIA.
UC Berkeley alumn.
Love, music, set and setting!
ID: 615836062
http://rafaelvalle.github.io 23-06-2012 05:26:22
159 Tweet
1,1K Followers
180 Following
Do you work on audio synthesis and need state of the art vocoders? BigVGAN v2 is out! BigVGAN v2 is the state-of-the-art in quality, faster and has commercial friendly checkpoints in 44, 24 and 22khz! By the way, it tops again the vocoding leaderboard! paperswithcode.com/sota/speech-sy…
We are presenting Audio Flamingo at ICML Conference at 11:30 am Tuesday, Hall C 4-9 #2803. Come chat with us about latest developments in Audio understanding and synthesis! In preparation for ICML, we made this demo to highlight Audio Flamingo's capabilities. youtube.com/watch?v=ucttuS…
Audio Flamingo 2 beats GPT-4o, Gemini 2.0 & Phi-4M on 20+ benchmarks —but its real super power? Emergent abilities like knowing that a drum track made of mechanical sounds is unusual: research.nvidia.com/labs/adlr/AF2/ Checkpoints for Synthetic Data Generation? Yes! github.com/NVIDIA/audio-f…