Chujie Zheng (@chujiezheng) Twitter Tweets • TwiCopy

Chujie Zheng

@chujiezheng

3 months ago

🔥🔥🔥

thumb_up_off_alt21

chat_bubble_outline1

repeat1

shareShare

Chujie Zheng

@chujiezheng

3 months ago

Welcome Wenting!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to

thumb_up_off_alt6,6K

chat_bubble_outline205

repeat1,1K

shareShare

Qwen

@alibaba_qwen

3 months ago

Curtains up 🎭 Meet Qwen3-Next — smarter, cuter, and ready to take the stage. 🚀

thumb_up_off_alt1,1K

chat_bubble_outline80

repeat128

shareShare

Chujie Zheng

@chujiezheng

3 months ago

r u ready

thumb_up_off_alt163

chat_bubble_outline7

repeat4

shareShare

Qwen

@alibaba_qwen

3 months ago

🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) 🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed &

thumb_up_off_alt3,3K

chat_bubble_outline134

repeat544

shareShare

Yuchen Jin

@yuchenj_uw

3 months ago

Qwen3-Next (thinking & non-thinking) are now live in BF16 at Hyperbolic! Qwen3-Next is a huge efficiency leap: - 80B MoE with just 3B active params - 10x cheaper to train vs Qwen3-32B - 10x inference throughput for >32K tokens Proud to be a launch partner with Qwen -

thumb_up_off_alt461

chat_bubble_outline18

repeat52

shareShare

Junyang Lin

@justinlin610

3 months ago

Qwen3-Next, or to say, a preview of our next generation (3.5?) is out! This time we try to be bold, but actually we have been doing experiments on hybrid models and linear attention for about a year. We believe that our solution shoud be at least a stable and solid solution to

thumb_up_off_alt1,1K

chat_bubble_outline53

repeat121

shareShare

LMSYS Org

@lmsysorg

3 months ago

Qwen3-Next is out! SGLang has supported it on day 0 with speculative decoding. Try it out 👇

thumb_up_off_alt78

chat_bubble_outline0

repeat15

shareShare

vLLM

@vllm_project

3 months ago

Welcome Qwen3-Next! You can run it efficiently on vLLM with accelerated kernels and native memory management for hybrid models. blog.vllm.ai/2025/09/11/qwe…

thumb_up_off_alt307

chat_bubble_outline10

repeat42

shareShare

Chujie Zheng

@chujiezheng

3 months ago

Meet Qwen3-Next-80B-A3B, our next-generation model architecture featuring excellent performance and exceptional training & inference efficiency Now let’s do greater SCALING 🚀🚀

thumb_up_off_alt17

chat_bubble_outline2

repeat0

shareShare

Chujie Zheng

@chujiezheng

3 months ago

you guys always hero 🫡

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Qwen

@alibaba_qwen

3 months ago

🆙Qwen Code v0.0.10 & v0.0.11 bring new features and dev-friendly improvements: ✨New UX & Productivity · Subagents for smarter task decomposition · Todo Write tool for task tracking · “Welcome Back” project summary on reopen! · Customizable cache Strategy ⚡Performance & Dev

thumb_up_off_alt818

chat_bubble_outline31

repeat103

shareShare

Artificial Analysis

@artificialanlys

3 months ago

Alibaba has released Qwen3 Next 80B: an open weights hybrid reasoning model that achieves DeepSeek V3.1-level intelligence with only 3B active parameters Key takeaways: 💡 Novel architecture: First model to introduce Qwen's ‘Qwen3-Next’ foundation models, with several

thumb_up_off_alt732

chat_bubble_outline33

repeat108

shareShare

LM Studio

@lmstudio

3 months ago

LM Studio now supports Qwen3-Next with MLX on Mac! 🧵

thumb_up_off_alt550

chat_bubble_outline28

repeat69

shareShare

Alibaba Tongyi_Lab

@labtongyi96898

3 months ago

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,

thumb_up_off_alt3,3K

chat_bubble_outline99

repeat430

shareShare

Qwen

@alibaba_qwen

3 months ago

Struggling with the 3-minute limit on Qwen3-ASR-Flash? No more! Introducing the Qwen3-ASR-Toolkit 🚀 A free, open-source CLI to transcribe HOURS-long audio/video files at high speed. Unleash the full power of the Qwen3-ASR-Flash API! 💥 🧠 Smart VAD splitting (no awkward

thumb_up_off_alt710

chat_bubble_outline28

repeat90

shareShare