Sreyan Ghosh (@sreyang) Twitter Tweets • TwiCopy

GAMMA UMD

5 months ago

🚀 Audio General Intelligence (AGI) is no longer a dream — it’s here. Introducing Audio Flamingo 3 — open-source, multimodal, and groundbreaking. It listens. It understands. It reasons across sound and language. 💥 Code, weights, datasets, paper — all open. 📄Paper:

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Bony Bean

@bonybean

5 months ago

NVIDIA Just Released Audio Flamingo 3: An Open-Source Model Advancing Audio General Intelligence: ift.tt/hzl1OsK

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Marktechpost AI Research News ⚡

@marktechpost

5 months ago

NVIDIA Releases Audio Flamingo 3: An Open-Source Model Advancing Audio General Intelligence NVIDIA’s Audio Flamingo 3 (AF3) is a fully open-source large audio-language model that significantly advances the field of Audio General Intelligence. Unlike earlier systems focused on

thumb_up_off_alt27

chat_bubble_outline1

repeat11

shareShare

Niels Rogge

@nielsrogge

5 months ago

Open-source audio scene is quite on 🔥 lately! - kyutai STT, TTS modules and Unmute fully open-sourced - NVIDIA drops 3 models: Parakeet (beats Whisper), Audio Flamingo 3 and Canary-Qwen-2.5B (new SOTA on Hugging Face leaderboard) - Mistral AI released 3B and 24B Voxtral

thumb_up_off_alt594

chat_bubble_outline12

repeat85

shareShare

Sakib

@zsakib_

5 months ago

zsxkib/audio-flamingo-3 from NVIDIA a chain-of-thought audio language model (that's small+fast) on Replicate you can upload an mp3 and ask: > what instruments do you hear?🙉 > transcribe any speech you hear🗣️ > please describe the audio in detail🎨 > answer the question💬

zsxkib/audio-flamingo-3 from <a href="/nvidia/">NVIDIA</a>
a chain-of-thought audio language model (that's small+fast) on <a href="/replicate/">Replicate</a>
you can upload an mp3 and ask:
> what instruments do you hear?🙉
> transcribe any speech you hear🗣️
> please describe the audio in detail🎨
> answer the question💬

thumb_up_off_alt17

chat_bubble_outline1

repeat3

shareShare

Sakib

@zsakib_

5 months ago

nvidia's audio-flamingo-3 in action (sound on 🔇→🎶) > upload an audio clip saying "what are the names of some famous actors who started their careers on broadway"🎭 > prompt "answer" TIL Tom Hanks was on broadway

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

NVIDIA AI Developer

@nvidiaaidev

5 months ago

🎶 Meet Audio-Flamingo 3 – a fully open LALM trained on sound, speech, and music datasets. 🎶 Handles 10-min audio, long-form text, and voice conversations. Perfect for audio QA, dialog, and reasoning. On Hugging Face ➡️ huggingface.co/nvidia/audio-f… From #NVIDIAResearch.

thumb_up_off_alt226

chat_bubble_outline6

repeat64

shareShare

naveen manwani

@naveenmanwani17

5 months ago

🚨Paper Alert 🚨 ➡️Paper Title: Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models 🌟Few pointers from the paper 🎯Authors of this paper presented “Audio Flamingo 3 (AF3)”, a fully open state-of-the-art (SOTA) large audio-language model

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

5 months ago

【聴くAIの革命！NVIDIA Audio Flamingo 3 (AF3)で音声理解が進化！】 NVIDIAが画期的な技術「Audio Flamingo 3 (AF3)」を発表！✨ 音声、音、音楽を統合的に理解するオープンソースの最先端AIモデルです👂💡

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Banghua Zhu

@banghuaz

5 months ago

That's exactly why I'm excited about the unique position of the post-training team at NVIDIA. We’re not just releasing open-weight models — we fully open source the data, code, and technical details. Small team, moving fast. The competition is fierce, and Chinese open model

thumb_up_off_alt383

chat_bubble_outline7

repeat27

shareShare

Bryan Catanzaro

@ctnzr

4 months ago

We've been making a lot of progress on LLM pretraining in NVFP4: developer.nvidia.com/blog/nvfp4-tra…

thumb_up_off_alt155

chat_bubble_outline2

repeat23

shareShare

Rabeeh Karimi

@karimirabeeh

4 months ago

We just released Nemotron-CC-Math 🚀 Equations on web aren’t just LaTeX-they’re in MathML,<pre> tags,inline,even images.Code shows up just as many ways. Most parsers drop it. Nemotron-CC-Math(133B tokens) reprocesses CommonCrawl math pages to capture math equations +code reliably

thumb_up_off_alt145

chat_bubble_outline3

repeat20

shareShare

Bryan Catanzaro

@ctnzr

4 months ago

As part of Nemotron, we're releasing a new Math dataset, made by rendering webpages using Lynx and then using an LLM to rewrite the result into LaTeX. Our models got much better at math when we started using this dataset. We hope it's helpful to the community. 💚

thumb_up_off_alt226

chat_bubble_outline10

repeat34

shareShare

Arushi Goel

@goelarushi27

3 months ago

Audio Flamingo 3 made it to a spotlight at #NeurIPS2025. See you in San Diego! #Nvidia #AGI

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

HanRong YE

@leoyerrrr

2 months ago

Off to ICCV! Also, we have an omni-modal LLM reveal coming next Monday… straight from Hawaiiiii 🌴

thumb_up_off_alt34

chat_bubble_outline1

repeat4

shareShare

HanRong YE

@leoyerrrr

2 months ago

OmniVinci is now #1 paper on Huggingface!!! 🤗 Building omni-modal LLMs is MORE than just mixing tokens 😉 At @NVIDIA, we explored deeper possibilities in building truly omni-modal systems — leading to OmniVinci-9B, which introduces three key innovations: - OmniAlignNet – a

thumb_up_off_alt143

chat_bubble_outline11

repeat27

shareShare

kyutai

@kyutai_labs

2 months ago

1/2 We’re releasing an in-depth tutorial on neural audio codecs, the secret sauce that makes it possible for audio LLMs to not sound like a horror movie:

thumb_up_off_alt424

chat_bubble_outline13

repeat54

shareShare

Shay Boloor

@stocksavvyshay

2 months ago

$NVDA JUST MADE HISTORY AS THE FIRST $5T COMPANY

thumb_up_off_alt4,4K

chat_bubble_outline179

repeat702

shareShare

Sreyan Ghosh

GAMMA UMD

Bony Bean

Marktechpost AI Research News ⚡

Niels Rogge

Sakib

Sakib

NVIDIA AI Developer

naveen manwani

ハカセ アイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

Banghua Zhu

Bryan Catanzaro

Rabeeh Karimi

Bryan Catanzaro

Arushi Goel

HanRong YE

HanRong YE

kyutai

Shay Boloor

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾