Julian Salazar (@julianslzr) 's Twitter Profile
Julian Salazar

@julianslzr

Senior Research Scientist at @GoogleDeepMind; audio LLMs, TTS & dialogue for Gemini, Project Astra, etc. 🤖 🗣️ prev. @AmazonScience @HarvardMath 🇨🇦 🇵🇭

ID: 1605763477

linkhttps://julianslzr.com calendar_today19-07-2013 11:39:01

376 Tweet

1,1K Takipçi

479 Takip Edilen

Taco Cohen (@tacocohen) 's Twitter Profile Photo

Imagine inventing attention and getting the runner up test of time award 😂 🥈 Congrats to all first and second place winners!

Ankur Bapna (@ankurbpn) 's Twitter Profile Photo

Happy to see the first feature powered by Gemini native audio outputs ship out to public - especially since it's MASSIVELY multilingual. Lots more coming soon 😉

Yuma Koizumi (@yuma_koizumi) 's Twitter Profile Photo

New multilingual speech restoration paper out Miipher-2 🚀! The RTF on a TPU is 0.0078: 1 million hours of data can be cleaned in 3 days using just 100 TPUs! Paper: arxiv.org/abs/2505.04457 Demo: google.github.io/df-conformer/m…

Google Gemini App (@geminiapp) 's Twitter Profile Photo

🎧 Audio Overviews are now available in 45 languages. Learn on-the-go by turning your documents, slides, Deep Research reports and more into podcast-style conversations.

Deedy (@deedydas) 's Twitter Profile Photo

Google silently launched a way to describe a speaker's voice "style" in plain English. I've never seen anything like it. It seamlessly does different accents, dialects, tone, and languages. Check out Chinese vs Jamaican and Tamil/English vs Punjabi/English in this clip below!