Julian Salazar (@julianslzr) Twitter Tweets • TwiCopy

Happy to see the first feature powered by Gemini native audio outputs ship out to public - especially since it's MASSIVELY multilingual. Lots more coming soon 😉

thumb_up_off_alt345

chat_bubble_outline17

repeat29

shareShare

New multilingual speech restoration paper out Miipher-2 🚀! The RTF on a TPU is 0.0078: 1 million hours of data can be cleaned in 3 days using just 100 TPUs! Paper: arxiv.org/abs/2505.04457 Demo: google.github.io/df-conformer/m…

thumb_up_off_alt80

chat_bubble_outline3

repeat29

shareShare

Julian Salazar

@julianslzr

6 months ago

iykyk

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Google Gemini App

@geminiapp

6 months ago

🎧 Audio Overviews are now available in 45 languages. Learn on-the-go by turning your documents, slides, Deep Research reports and more into podcast-style conversations.

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat155

shareShare

Deedy

@deedydas

6 months ago

Google silently launched a way to describe a speaker's voice "style" in plain English. I've never seen anything like it. It seamlessly does different accents, dialects, tone, and languages. Check out Chinese vs Jamaican and Tamil/English vs Punjabi/English in this clip below!

thumb_up_off_alt3,3K

chat_bubble_outline130

repeat310

shareShare

Julian Salazar

@julianslzr

6 months ago

!!!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Julian Salazar

good girl

Taco Cohen

Julian Salazar

Ankur Bapna

Yuma Koizumi

Julian Salazar

Google Gemini App

Deedy

Julian Salazar