Huck Yang 🇸🇬 ICLR 2025 (@huckiyang) Twitter Tweets • TwiCopy

Sreyan Ghosh

5 months ago

We at NVIDIA and GAMMA UMD are excited to release Audio Flamingo 3, the most powerful, open, and capable large audio-language model to date! Paper: arxiv.org/abs/2507.08128 Open-source model, code, and data: research.nvidia.com/labs/adlr/AF3/ Try it out here: huggingface.co/spaces/nvidia/…

thumb_up_off_alt19

chat_bubble_outline1

repeat8

shareShare

William Chen

@chenwanch1

5 months ago

I’ll be presenting this Thursday 4:30pm at the West hall, poster 418. Drop by to learn more about our latest experience in burning compute!

thumb_up_off_alt8

chat_bubble_outline0

repeat5

shareShare

Piotr Żelasko

@piotrzelasko

5 months ago

Canary-Qwen-2.5B is our latest, and the first of its kind, ASR model from NVIDIA NeMo team. 🏆 1st place on Open ASR Leaderboard with WER 5.63% 🔥 RTFx=418 on A100 GPU - remarkably fast for its size 💰 CC-BY-4.0 license, commercial-friendly 🌎 English-only

thumb_up_off_alt170

chat_bubble_outline5

repeat29

shareShare

Shinji Watanabe

@shinjiw_at_cmu

5 months ago

Hi all, we are seeking an ICASSP '26 reviewer in the speech and language processing area. Please consider becoming a reviewer and contributing to our community!

thumb_up_off_alt16

chat_bubble_outline0

repeat8

shareShare

Huck Yang 🇸🇬 ICLR 2025

@huckiyang

5 months ago

NeKo (ネコ) aims to be your pet model to work with ASR/AST/OCR NVIDIA AI Developer Yenting Lin et al. - back to 2020 conformer-transducer is dominant; people were not very interested in working in ASR-LM (i.e., internal LM of ASR / contextual biasing were popular) appreciate Andreas

thumb_up_off_alt26

chat_bubble_outline1

repeat3

shareShare

Xiangming Gu @ ICLR 2025

@gu_xiangming

5 months ago

I noticed that OpenAI added learnable bias to attention logits before softmax. After softmax, they deleted the bias. This is similar to what I have done in my ICLR2025 paper: openreview.net/forum?id=78Nn4…. I used learnable key bias and set corresponding value bias zero. In this way,

I noticed that <a href="/OpenAI/">OpenAI</a> added learnable bias to attention logits before softmax. After softmax, they deleted the bias. This is similar to what I have done in my ICLR2025 paper: openreview.net/forum?id=78Nn4….
I used learnable key bias and set corresponding value bias zero. In this way,

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat166

shareShare

Siddhant Arora

@sid_arora_18

4 months ago

3. OpusLM: Unified Speech-Language Models (led by Jinchuan Tian (田晋川)) Open family of speech language models scaled up to 7B. Poster — Wed, 8:30–10:30 | Foyer 3.2 arxiv.org/abs/2506.17611 Grateful to all collaborators who made this possible!

thumb_up_off_alt5

chat_bubble_outline1

repeat4

shareShare

Huck Yang 🇸🇬 ICLR 2025

@huckiyang

4 months ago

a nice done of “beyond end2end 🗣️ ASR: integrating long context acoustics & linguistics” tutorial w/ Shinji Watanabe Taejin park (NVIDIA), Kyu Han (Oracle) at INTERSPEECH 2025 happy to cover the semantics parts during my hotel coding vibe 🤣 | 📖 slides: docs.google.com/presentation/d…

a nice done of “beyond end2end 🗣️ ASR: integrating long context acoustics & linguistics” tutorial w/ <a href="/shinjiw_at_cmu/">Shinji Watanabe</a> Taejin park (<a href="/nvidia/">NVIDIA</a>), Kyu Han (<a href="/Oracle/">Oracle</a>) at <a href="/ISCAInterspeech/">INTERSPEECH 2025</a> happy to cover the semantics parts during my hotel coding vibe 🤣 | 📖 slides: docs.google.com/presentation/d…

thumb_up_off_alt45

chat_bubble_outline0

repeat6

shareShare

The Economist

@theeconomist

4 months ago

More than ever, semiconductors hold the key to the 21st century. Yet Donald Trump’s approach to chipmaking is self-defeating. To remain the world’s foremost technological power, America needs its friends econ.st/4mOcyYZ

thumb_up_off_alt709

chat_bubble_outline43

repeat203

shareShare

Shinji Watanabe

@shinjiw_at_cmu

4 months ago

Our work on OWSM v4 received the Best Student Paper Award at #Interspeech2025! 🏆🎉 Huge congratulations to the team! 🚀👏 I’m especially happy to see our open science efforts for speech foundation models recognized by the community. 🙌 🔗 isca-archive.org/interspeech_20…

thumb_up_off_alt102

chat_bubble_outline5

repeat21

shareShare

NVIDIA Robotics

@nvidiarobotics

4 months ago

The NVIDIA Jetson Thor is here. 🎉 This powerful new robotics computer is designed to power the next generation of general and #HumanoidRobots in manufacturing, logistics, construction, healthcare, and beyond. It’s a massive leap forward for physical AI. Early adopters

thumb_up_off_alt1,1K

chat_bubble_outline66

repeat293

shareShare

Ryo Hachiuma

@rhachiuma

4 months ago

新しいPreprintが出ました。 Video perceptionにおける最も基本的なタスクの一つであるvideo segmentationを、自己回帰モデルとしてReframeすることで、統一されたアーキテクチャやテスト時のスケーリングなど、様々な利点を持つモデルを実現しました。

thumb_up_off_alt14

chat_bubble_outline0

repeat4

shareShare

KeisukeImoto

@keisukeimoto

4 months ago

"Audio-Centric AI: Towards Real-World Multimodal Reasoning and Application Use Cases" has been accepted as an AAAI 2026 workshop🎉 We're looking forward to seeing your contributions, with submissions due by 24 October 2025. sites.google.com/view/audio-aaa…

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

Yuntian Deng

@yuntiandeng

3 months ago

Ah, mystery since March 2023 finally solved 🦄👇 x.com/thinkymachines…

thumb_up_off_alt30

chat_bubble_outline1

repeat4

shareShare

NVIDIA Newsroom

@nvidianewsroom

3 months ago

Together, NVIDIA and OpenAI are expanding the frontier of AI — transforming nearly every industry and unlocking use cases once unimaginable. “There’s no partner but NVIDIA that can do this at this kind of scale, at this kind of speed,” said OpenAI CEO Sam Altman.

thumb_up_off_alt1,1K

chat_bubble_outline135

repeat319

shareShare

Ali Hatamizadeh

@ahatamiz1

3 months ago

Are you ready for web-scale pre-training with RL ? 🚀 🔥 New paper: RLP : Reinforcement Learning Pre‑training We flip the usual recipe for reasoning LLMs: instead of saving RL for post‑training, we bring exploration into pretraining. Core idea: treat chain‑of‑thought as an

thumb_up_off_alt595

chat_bubble_outline17

repeat88

shareShare

Phillip Isola

@phillip_isola

2 months ago

Over the past year, my lab has been working on fleshing out theory/applications of the Platonic Representation Hypothesis. Today I want to share two new works on this topic: Eliciting higher alignment: arxiv.org/abs/2510.02425 Unpaired rep learning: arxiv.org/abs/2510.08492 1/9

thumb_up_off_alt629

chat_bubble_outline8

repeat108

shareShare

Omar Khattab

@lateinteraction

2 months ago

btw Alex is a second-month PhD student; he did this work in 4 weeks i have my suspicions that Alex has secret recursive Alexes that do his work for him, but i haven't been able to confirm that haha really fun post on recursive LMs with interesting trace examples, check it out!

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat64

shareShare