Chujie Zheng (@chujiezheng) 's Twitter Profile
Chujie Zheng

@chujiezheng

Researcher @Alibaba_Qwen | Opinions are my own

ID: 964900352871907330

linkhttps://chujiezheng.github.io/ calendar_today17-02-2018 16:32:25

443 Tweet

2,2K Followers

281 Following

Thinking Machines (@thinkymachines) 's Twitter Profile Photo

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to
Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) 🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed &

🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here!

🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!)
🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed &
Yuchen Jin (@yuchenj_uw) 's Twitter Profile Photo

Qwen3-Next (thinking & non-thinking) are now live in BF16 at Hyperbolic! Qwen3-Next is a huge efficiency leap: - 80B MoE with just 3B active params - 10x cheaper to train vs Qwen3-32B - 10x inference throughput for >32K tokens Proud to be a launch partner with Qwen -

Junyang Lin (@justinlin610) 's Twitter Profile Photo

Qwen3-Next, or to say, a preview of our next generation (3.5?) is out! This time we try to be bold, but actually we have been doing experiments on hybrid models and linear attention for about a year. We believe that our solution shoud be at least a stable and solid solution to

vLLM (@vllm_project) 's Twitter Profile Photo

Welcome Qwen3-Next! You can run it efficiently on vLLM with accelerated kernels and native memory management for hybrid models. blog.vllm.ai/2025/09/11/qwe…

Welcome Qwen3-Next! You can run it efficiently on vLLM with accelerated kernels and native memory management for hybrid models. 

blog.vllm.ai/2025/09/11/qwe…
Chujie Zheng (@chujiezheng) 's Twitter Profile Photo

Meet Qwen3-Next-80B-A3B, our next-generation model architecture featuring excellent performance and exceptional training & inference efficiency Now let’s do greater SCALING 🚀🚀

Qwen (@alibaba_qwen) 's Twitter Profile Photo

🆙Qwen Code v0.0.10 & v0.0.11 bring new features and dev-friendly improvements: ✨New UX & Productivity · Subagents for smarter task decomposition · Todo Write tool for task tracking · “Welcome Back” project summary on reopen! · Customizable cache Strategy ⚡Performance & Dev

🆙Qwen Code v0.0.10 & v0.0.11 bring new features and dev-friendly improvements:

✨New UX & Productivity

· Subagents for smarter task decomposition
· Todo Write tool for task tracking
· “Welcome Back” project summary on reopen!
· Customizable cache Strategy

⚡Performance & Dev
Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Alibaba has released Qwen3 Next 80B: an open weights hybrid reasoning model that achieves DeepSeek V3.1-level intelligence with only 3B active parameters Key takeaways: 💡 Novel architecture: First model to introduce Qwen's ‘Qwen3-Next’ foundation models, with several

Alibaba has released Qwen3 Next 80B: an open weights hybrid reasoning model that achieves DeepSeek V3.1-level intelligence with only 3B active parameters

Key takeaways:
💡 Novel architecture: First model to introduce <a href="/Alibaba_Qwen/">Qwen</a>'s  ‘Qwen3-Next’ foundation models, with several
Alibaba Tongyi_Lab (@labtongyi96898) 's Twitter Profile Photo

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,

1/7 We're launching Tongyi DeepResearch, the first fully open-source Web Agent to achieve performance on par with OpenAI's Deep Research with only 30B (Activated 3B) parameters! Tongyi DeepResearch agent demonstrates state-of-the-art results, scoring 32.9 on Humanity's Last Exam,
Qwen (@alibaba_qwen) 's Twitter Profile Photo

Struggling with the 3-minute limit on Qwen3-ASR-Flash? No more! Introducing the Qwen3-ASR-Toolkit 🚀 A free, open-source CLI to transcribe HOURS-long audio/video files at high speed. Unleash the full power of the Qwen3-ASR-Flash API! 💥 🧠 Smart VAD splitting (no awkward