LMSYS Org (@lmsysorg) Twitter Tweets • TwiCopy

LMSYS Org

@lmsysorg

+ Follow

Large Model Systems Organization: We developed SGLang sglang.ai, Chatbot Arena, and Vicuna! Please join our Slack channel at slack.sglang.ai

ID: 1822588444046249984

linkhttps://lmsys.org/ calendar_today11-08-2024 10:58:54

309 Tweet

5,5K Takipçi

138 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence

thumb_up_off_alt3,3K

chat_bubble_outline158

repeat614

shareShare

zhyncs

@zhyncs42

25 days ago

Huge thanks to the MoonCake team for bringing day 0 support for Kimi K2 in SGLang and KTransformers, including the integration of the new reasoning parser!

thumb_up_off_alt32

chat_bubble_outline2

repeat3

shareShare

LMSYS Org

@lmsysorg

25 days ago

SGLang is currently the only open-source LLM serving engine validated with PD disaggregation + large-scale EP on the H200 cluster with over 100 GPUs. Huge thanks to the MoonCake Kimi.ai team for helping verify it even before release!

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

LMSYS Org

@lmsysorg

25 days ago

Kimi Kimi.ai K2 on SGLang: As a trillion-parameter model, long context on 1–2 H200/B200 nodes struggles. Use SGLang’s PD disaggregation + large-scale EP—validated on 100+ H200s by MoonCake Team. New SOTA is here—start using it!

Kimi <a href="/Kimi_Moonshot/">Kimi.ai</a> K2 on SGLang:
As a trillion-parameter model, long context on 1–2 H200/B200 nodes struggles. Use SGLang’s PD disaggregation + large-scale EP—validated on 100+ H200s by MoonCake Team.
New SOTA is here—start using it!

thumb_up_off_alt39

chat_bubble_outline3

repeat8

shareShare

zhyncs

@zhyncs42

25 days ago

Kimi.ai K2 shares the same architecture as DeepSeek R1, with experts increased from 256 to 384—so all SGLang optimizations work out of the box. PD disaggregation + large-scale EP was validated on 100+ H200s by MoonCake Team before release. Always trust SGLang! 🚀

thumb_up_off_alt28

chat_bubble_outline1

repeat3

shareShare

LMSYS Org

@lmsysorg

22 days ago

🚀Summer Fest Day 3: Cost-Effective MoE Inference on CPU from Intel PyTorch team Deploying 671B DeepSeek R1 with zero GPUs? SGLang now supports high-performance CPU-only inference on Intel Xeon 6—enabling billion-scale MoE models like DeepSeek to run on commodity CPU servers.