Lianmin Zheng (@lm_zheng) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Siyuan

@cyodyssey

a year ago

x.com/i/article/1861…

thumb_up_off_alt178

chat_bubble_outline75

repeat22

shareShare

We will present latest updates of verl at #ICLR2025: - recent RL recipes (DAPO, etc) - RL with tool calling & multi-turn - full sglang integration (with LMSYS Org ) - large scale optimizations, and many more Come join us!

We will present latest updates of verl at #ICLR2025:
- recent RL recipes (DAPO, etc)
- RL with tool calling & multi-turn
- full sglang integration (with <a href="/lmsysorg/">LMSYS Org</a> )
- large scale optimizations, and many more

Come join us!

thumb_up_off_alt108

chat_bubble_outline2

repeat15

shareShare

LMSYS Org

@lmsysorg

7 months ago

Excited to co-host the "Frontiers of Generative AI" afterparty with Abaka AI at #ICLR2025! 🚀 Come connect with fellow researchers and enthusiasts from LMSYS Org and beyond. 🗓️ When: April 26th (Sat), 18:30 - 21:30 🗺️ Where: 10-min from Singapore EXPO (Register for exact details)

thumb_up_off_alt45

chat_bubble_outline2

repeat13

shareShare

LMSYS Org

@lmsysorg

7 months ago

The SkyPilot team's benchmark for SGLang and vLLM shows that SGLang consistently outperforms with a 30% performance advantage. We will continue to optimize. Cheers!

thumb_up_off_alt25

chat_bubble_outline0

repeat5

shareShare

Ebby Amir

@ebbyamir

7 months ago

Introducing Grok Vision, multilingual audio, and realtime search in Voice Mode. Available now. Grok habla español Grok parle français Grok Türkçe konuşuyor グロクは日本語を話す ग्रोक हिंदी बोलता है

thumb_up_off_alt5,5K

chat_bubble_outline484

repeat1,1K

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

7 months ago

Around this time 2 years ago, the community helped us launch the very first Arena leaderboard! Today we’re publishing a blog to celebrate everything we’ve built together on LMArena! 🥳👏 Highlights: ☑️ 3M+ community votes 🤖 400+ models ranked across text, vision,

thumb_up_off_alt231

chat_bubble_outline9

repeat23

shareShare

Junyang Lin

@justinlin610

7 months ago

LMSYS Org thanks to the support of SGL! They have been dedicated to helping us deploy the large MoE models and improve the inference efficiency. 🌊

<a href="/lmsysorg/">LMSYS Org</a> thanks to the support of SGL! They have been dedicated to helping us deploy the large MoE models and improve the inference efficiency. 🌊

thumb_up_off_alt73

chat_bubble_outline1

repeat7

shareShare

Biao He

@hebiao064

7 months ago

📷 Thrilled to share my latest blog post (co-authored with Qingquan (QQ) Song ) on our open source work integrating the Flash Attention Backend end-to-end in SGLang—now enabled by default in the latest version! 😎 Dive into the details here: hebiao064.github.io/fa3-attn-backe… #LLM #SGLang

thumb_up_off_alt42

chat_bubble_outline3

repeat4

shareShare

Hao AI Lab

@haoailab

7 months ago

🚨 New Challenger: GROK joins the Game Arena Benchmark! We evaluated Grok3-mini-beta: thinkining on four games: 🧩 2048 | 🧱 Sokoban | 🍬 Candy Crush | 🎮 Phoenix Wright With fast progress, it’s already comparable to top models like OpenAI’s O1, previous O3-mini, and

thumb_up_off_alt97

chat_bubble_outline2

repeat18

shareShare

LMSYS Org

@lmsysorg

7 months ago

🚀 Breaking: SGLang provides the first open-source implementation to serve DeepSeek V3/R1 models with large-scale expert parallelism and prefill-decode disaggregation on 96 GPUs. It nearly matches the throughput reported by the official DeepSeek blog, achieving 52.3K input

🚀 Breaking: SGLang provides the first open-source implementation to serve <a href="/deepseek_ai/">DeepSeek</a> V3/R1 models with large-scale expert parallelism and prefill-decode disaggregation on 96 GPUs.
It nearly matches the throughput reported by the official DeepSeek blog, achieving 52.3K input

thumb_up_off_alt383

chat_bubble_outline10

repeat80

shareShare

zhyncs

@zhyncs42

7 months ago

MLSys 2025 is coming up! Want to meet the developers behind FlashInfer, XGrammar, and SGLang LMSYS Org in person? Join us for the Happy Hour on May 12—we’d love to see you there! lu.ma/dl99yjoe

thumb_up_off_alt35

chat_bubble_outline0

repeat9

shareShare

SkyPilot

@skypilot_org

7 months ago

E2E guide to self-host multi-node Llama 4 with one command, using SkyPilot, SGLang (LMSYS Org), and `llm` (Simon Willison). ▸ Spin up SGLang on 8x H100 (single and multi-node) ▸ Perf numbers included: token/s, TTFT, ITL ▸ Production-readiness: auth, HTTPS ▸ Tooling: easy

thumb_up_off_alt15

chat_bubble_outline1

repeat6

shareShare

Yixin Dong

@yi_xin_dong

7 months ago

We are hosting a happy hour with LMSYS Org at #mlsys2025! Join us for engaging talks on SGLang, the structured generation library XGrammar, and the high-performance kernel library FlashInfer. Enjoy great food, lively discussions, and connect with the community! Click to join 👉

We are hosting a happy hour with <a href="/lmsysorg/">LMSYS Org</a> at #mlsys2025! Join us for engaging talks on SGLang, the structured generation library XGrammar, and the high-performance kernel library FlashInfer. Enjoy great food, lively discussions, and connect with the community! Click to join 👉

thumb_up_off_alt80

chat_bubble_outline1

repeat12

shareShare

Elon Musk

@elonmusk

7 months ago

Please offer galaxy brain questions and answers. This is for Grok training. docs.google.com/forms/d/e/1FAI…

thumb_up_off_alt16,16K

chat_bubble_outline2,2K

repeat3,3K

shareShare

zhyncs

@zhyncs42

7 months ago

I’ll be joining my Baseten colleague Philip Kiely at the AI Engineer World’s Fair AI Engineer in San Francisco, June 3–5, to Introduce LLM serving with SGLang LMSYS Org. We’d love for you to stop by and exchange ideas in person!🤗

I’ll be joining my <a href="/basetenco/">Baseten</a> colleague <a href="/philip_kiely/">Philip Kiely</a> at the AI Engineer World’s Fair <a href="/aiDotEngineer/">AI Engineer</a> in San Francisco, June 3–5, to Introduce LLM serving with SGLang <a href="/lmsysorg/">LMSYS Org</a>. We’d love for you to stop by and exchange ideas in person!🤗

thumb_up_off_alt42

chat_bubble_outline2

repeat6

shareShare

Zihao Ye

@ye_combinator

7 months ago

We’re thrilled that FlashInfer won a Best Paper Award at MLSys 2025! 🎉 This wouldn’t have been possible without the community — huge thanks to LMSYS Org’s sglang for deep co-design (which is crtical for inference kernel evolution) and stress-testing over the years, and to

thumb_up_off_alt229

chat_bubble_outline16

repeat37

shareShare

Novita AI

@novita_labs

6 months ago

Thrilled to partner with LMSYS Org! 🚀 Our high-performance GPU cloud is powering their game-changing inference engine, revolutionizing LLM deployment with breakthrough RL framework and multi-LLM serving capabilities. Together, we're bringing lightning-fast AI inference to

Thrilled to partner with <a href="/lmsysorg/">LMSYS Org</a>! 🚀

Our high-performance GPU cloud is powering their game-changing inference engine, revolutionizing LLM deployment with breakthrough RL framework and multi-LLM serving capabilities.

Together, we're bringing lightning-fast AI inference to

thumb_up_off_alt16

chat_bubble_outline1

repeat7

shareShare

Benjamin F Spector

@bfspector

6 months ago

(1/5) We’ve never enjoyed watching people chop Llamas into tiny pieces. So, we’re excited to be releasing our Low-Latency-Llama Megakernel! We run the whole forward pass in single kernel. Megakernels are faster & more humane. Here’s how to treat your Llamas ethically: (Joint

thumb_up_off_alt863

chat_bubble_outline32

repeat142

shareShare

zhyncs

@zhyncs42

6 months ago

We sincerely appreciate NVIDIA’s generous sponsorship. It is truly regrettable that the SGLang team was unable to attend in person and witness the physical hardware firsthand. Our special thanks go to InnoMatrix for their invaluable assistance in managing everything seamlessly.

thumb_up_off_alt63

chat_bubble_outline0

repeat3

shareShare

Lianmin Zheng

@lm_zheng

6 months ago

Blackwell-specific optimizations are cooking! 🚀

thumb_up_off_alt73

chat_bubble_outline3

repeat5

shareShare

Lianmin Zheng

good girl

Siyuan

verl project

LMSYS Org

LMSYS Org

Ebby Amir

lmarena.ai (formerly lmsys.org)

Junyang Lin

Biao He

Hao AI Lab

LMSYS Org

zhyncs

SkyPilot

Yixin Dong

Elon Musk

zhyncs

Zihao Ye

Novita AI

Benjamin F Spector

zhyncs

Lianmin Zheng