Ligeng Zhu (@ligengzhu) Twitter Tweets • TwiCopy

Ligeng Zhu

@ligengzhu

6 months ago

huge congrats! Zihao rocks!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

🚀 Fast-dLLM: 27.6× Faster Diffusion LLMs with KV Cache & Parallel Decoding 💥 Key Features🌟 - Block-Wise KV Cache Reuses 90%+ attention activations via bidirectional caching (prefix/suffix), enabling 8.1×–27.6× throughput gains with <2% accuracy loss 🔄 -

thumb_up_off_alt174

chat_bubble_outline8

repeat34

shareShare

Ligeng Zhu

@ligengzhu

6 months ago

welcome haocheng and let's build something exciting!

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Ligeng Zhu

@ligengzhu

6 months ago

AReal to train RLs as easy as Boba!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Infini-AI-Lab

@infiniailab

5 months ago

🔥 We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation. 🚀 Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46% 🌐 Website: multiverse4fm.github.io 🧵 1/n

thumb_up_off_alt207

chat_bubble_outline2

repeat76

shareShare

Zhijian Liu

@zhijianliu_

5 months ago

Accelerating your LLM fine-tuning with SparseLoRA! 🚀

thumb_up_off_alt52

chat_bubble_outline0

repeat5

shareShare

Muyang Li

@lmxyy1999

5 months ago

🚀 #Nunchaku now supports FLUX.1-Kontext-dev! Edit images with just one sentence — style transfer, face swap, and more — now 2–3× faster and using 1/4 VRAM. ✅ Works with ComfyUI & Diffusers 🔗 Demo: svdquant.mit.edu/kontext/ 📂 Code: github.com/mit-han-lab/nu… 🤗 4-bit #SVDQuant

thumb_up_off_alt26

chat_bubble_outline0

repeat3

shareShare

Ligeng Zhu

@ligengzhu

5 months ago

Cheers🍻!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ligeng Zhu

@ligengzhu

5 months ago

#Grok4 #SUPERGROK grok4-heavy is achieving 100% on AIME'25. tool call / pass@k / multi-runs?

thumb_up_off_alt12

chat_bubble_outline1

repeat0

shareShare

Ligeng Zhu

@ligengzhu

4 months ago

VideoGen with sparsity!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Ligeng Zhu

@ligengzhu

4 months ago

looooooooong RL for video🎉

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

LMSYS Org

@lmsysorg

4 months ago

🚀Summer Fest Day 4: Turbocharging Vision-Language Models with SGLang + NVILA 4.4× throughput, 2.2× faster response time! We've integrated NVILA into SGLang, enabling high-performance, scalable serving of vision-language models. This unlocks a 4.4× TPS boost and significantly

thumb_up_off_alt27

chat_bubble_outline1

repeat15

shareShare

Ligeng Zhu

@ligengzhu

4 months ago

Empowered by SGLang, NVILA serving now has 4.4x throughput and 2.2x faster response 🚀🚀🚀 Awesome work made by Zijian Zhang w/ a lot help from SGLang team!

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Bolei Zhou

@zhoubolei

4 months ago

NeurIPS Conference This is great! But will you also consider setting up an official satellite location in China, given the fact that so many great NeurIPS papers come from China and so many Chinese researchers couldn't attend the conference due to the US/Canada Visa issue?

thumb_up_off_alt94

chat_bubble_outline1

repeat6

shareShare

LMSYS Org

@lmsysorg

4 months ago

🚀 Summer Fest Day 5: Multiple Token Prediction in SGLang by @Eigen_AI_ and SGLang Team 1.6× throughput, same quality — open-source & production-ready! We’ve integrated MTP into SGLang, unlocking up to 60% higher output throughput for models like DeepSeek V3, with zero quality

thumb_up_off_alt33

chat_bubble_outline3

repeat8

shareShare

Yi Wu

@jxwuyi

4 months ago

Tired intricate system code for RL training? 🤯 We release AReaL-lite – A lightweight AReaL version for AI researchers! 🚀#opensource ✨ Algorithm-first design & APIs🎉 ✨ 80% less code w. 90% AReaL's full efficiency 🎉 ✨ Customizable agentic RL🎉 🔗 github.com/inclusionAI/AR…

thumb_up_off_alt63

chat_bubble_outline3

repeat22

shareShare

Eigen AI

@eigen_ai_labs

4 months ago

🚀Founded by four dedicated MIT graduates, Eigen AI is the world's first company focusing on AEI – Artificial Efficient Intelligence, making AI accessible for all. Today OpenAI dropped GPT-OSS. We teamed up with our partners SGLang LMSYS Org and @NVIDIA to deliver open-source

thumb_up_off_alt67

chat_bubble_outline4

repeat21

shareShare

Ryan Hanrui Wang

@hanrui_w

4 months ago

Announcing Eigen AI Eigen AI, the world’s first company dedicated to AEI — Artificial Efficient Intelligence. 🚀 The future of AI is already here; it’s simply not evenly distributed. Our mission is to close that gap by driving radical efficiency so that every person and

thumb_up_off_alt52

chat_bubble_outline1

repeat13

shareShare

Ligeng Zhu

Ligeng Zhu

Enze Xie

Ligeng Zhu

Ligeng Zhu

Infini-AI-Lab

Zhijian Liu

Muyang Li

Ligeng Zhu

Ligeng Zhu

Ligeng Zhu

Ligeng Zhu

LMSYS Org

Ligeng Zhu

Bolei Zhou

LMSYS Org

Yi Wu

Eigen AI

Ryan Hanrui Wang