Hongxu (Danny) Yin (@yin_hongxu) Twitter Tweets • TwiCopy

Hongxu (Danny) Yin

@yin_hongxu

+ Follow

Staff Research Scientist, NVIDIA Research | Ph.D. Princeton University | Forbes Top 60 Elite Chinese North America.

ID: 1207922192916402176

linkhttps://hongxu-yin.github.io calendar_today20-12-2019 07:14:36

96 Tweet

692 Followers

149 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🚀 Exciting news! We’ve just released a new LLM: Llama-3.1-Nemotron-51B = LLaMa-70B-Instruct + Block Distillation + NAS + Logics Distillation; Powered by a single H100 GPU with nearly the same accuracy! ⚡ This gives a 2.2x inference speed-up with MT Bench 8.99 ➡️ 8.94.

thumb_up_off_alt76

chat_bubble_outline2

repeat22

shareShare

AK

@_akhaliq

9 months ago

MaskLLM Learnable Semi-Structured Sparsity for Large Language Models discuss: huggingface.co/papers/2409.17… Large Language Models (LLMs) are distinguished by their massive parameter counts, which typically result in significant redundancy. This work introduces MaskLLM, a learnable

thumb_up_off_alt70

chat_bubble_outline1

repeat15

shareShare

Pavlo Molchanov

@pavlomolchanov

9 months ago

🚀 NeurIPS Conference Spotlight! 🥳 Imagine fine-tuning an LLM with just a sparsity mask! In our latest work, we freeze the LLM and use 2:4 structured sparsity to learn binary masks for each linear layer. Thanks to NVIDIA Ampere’s 2:4 sparsity, we can achieve up to 2x compute

thumb_up_off_alt159

chat_bubble_outline2

repeat35

shareShare

Hongxu (Danny) Yin

@yin_hongxu

9 months ago

Tomorrow we will be hosting the first Efficient DL for Foundation Model workshop tomorrow at #ECCV2024! The place will be Brown 3, at Milan time 2pm-6pm, with keynotes and posters. Co-organized by NVIDIA, Microsoft Research, MIT, and UCSD. Come and join us!

thumb_up_off_alt20

chat_bubble_outline0

repeat2

shareShare

Song Han

@songhan_mit

8 months ago

Explore VILA-U: multi-modal token in, multi-modal token out, a single autoregressive next-token prediction model for both image/video generation and understanding. VILA-U is open sourced: github.com/mit-han-lab/vi…

thumb_up_off_alt87

chat_bubble_outline0

repeat15

shareShare

Hongxu (Danny) Yin

@yin_hongxu

3 months ago

Very proud to announce VILA-HD, an ultracheap method for VLMs to crack high resolution tasks! 20x cheaper than current tiling based methods. Surpass GPT-4o, Gemini-1.5, and Qwen2 for high resolution benchmarks. Scale to 8Kx8K resolution. #CVPR 2025. Check below for repository.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Hongxu (Danny) Yin

@yin_hongxu

3 months ago

Thanks AK for the nice intro!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Pavlo Molchanov

@pavlomolchanov

3 months ago

🔥 Vision encoder upgrade: RADIOv2.5 = DFN_CLIP + DINOv2 + SAM + SigLIP + ToMe + multi-res training + teacher loss balancing + smart augmentations, CVPR2025. Current foundation models have too many limitations: i) tailored for a single task, ii) not flexible on resolution (like

thumb_up_off_alt670

chat_bubble_outline14

repeat135

shareShare

Hongxu (Danny) Yin

@yin_hongxu

2 months ago

#RSS2025 NaVILA constitutes a successful attempt for VILA to drive real world robotic dogs and humanoid! Fully deployable. Money saving. Fast inference. Check out our project page: navila-bot.github.io Many more amazing things to come!

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Pavlo Molchanov

@pavlomolchanov

2 months ago

New efficient Hybrid LLMs from @NVIDIA: Nemotron-H! Introducing a family of models combining Mamba-2, Self-Attention & FFNs for 8B, 47B and 56B sizes. • 3x faster and 1.5x smaller 47B model is on par with Qwen-72B and Llama-70B • 1.8x faster Hybrid 8B than transformers

thumb_up_off_alt309

chat_bubble_outline9

repeat90

shareShare

AK

@_akhaliq

2 months ago

Nvidia just dropped CLIMB on Hugging Face CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

thumb_up_off_alt338

chat_bubble_outline10

repeat69

shareShare

Hongxu (Danny) Yin

@yin_hongxu

2 months ago

Sadly I will not be attending ICLR in Singapore. We have been researching on VILA across the entire system, model, and application stack. Cost-saving. Agile. Capable, yet deployable. Talk to our colleagues this week at ICLR, and the upcoming CVPR, RSS, MLSys!

thumb_up_off_alt18

chat_bubble_outline0

repeat0

shareShare

Hongxu (Danny) Yin

@yin_hongxu

2 months ago

Check out the great competition from NVIDIA! Great prizes ahead!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare