Binyuan Hui (@huybery) Twitter Tweets • TwiCopy

Binyuan Hui

@huybery

+ Follow

🥝 Building Qwen @Alibaba_Qwen. Focus on CodeLLM (Pre-training and Post-training) / Reasoning / Agent. Ideas my own.

ID: 1131679907346505728

linkhttp://huybery.github.io calendar_today23-05-2019 21:54:50

951 Tweet

28,28K Takipçi

595 Takip Edilen

Awni Hannun

@awnihannun

4 months ago

Qwen3-Coder-Flash runs quite fast on an M4 Max with mlx-lm. Running the 4-bit here, generated 4,467 tokens at >107 tokens/sec:

thumb_up_off_alt1,1K

chat_bubble_outline50

repeat136

shareShare

OpenRouter

@openrouterai

4 months ago

Qwen Code being used with all kinds of models, including Horizon and Gemini.

thumb_up_off_alt389

chat_bubble_outline10

repeat33

shareShare

Binyuan Hui

@huybery

4 months ago

Imagine with Qwen-Image!

thumb_up_off_alt277

chat_bubble_outline13

repeat13

shareShare

Deep Learning For Code @ ICLR'25

@dl4code

4 months ago

📢 Call for Papers - Coding Agent Workshop 🤖💻 Are you doing research on AI and coding? The 4th DL4C workshop: Deep Learning for Code in the Agentic Era is going to be at NeurIPS 2025 and the call for paper is out! 🔍 We're seeking cutting-edge papers on topics including, but

thumb_up_off_alt21

chat_bubble_outline0

repeat5

shareShare

AK

@_akhaliq

4 months ago

Qwen-Image Qwen, a 20B MMDiT model for text-to-image generation is now available in anycoder using Replicate for generating images for your apps You can now generate images directly inside anycoder for your apps when vibe coding

Qwen-Image <a href="/Alibaba_Qwen/">Qwen</a>, a 20B MMDiT model for text-to-image generation is now available in anycoder using <a href="/replicate/">Replicate</a> for generating images for your apps

You can now generate images directly inside anycoder for your apps when vibe coding

thumb_up_off_alt161

chat_bubble_outline6

repeat27

shareShare

Binyuan Hui

@huybery

4 months ago

A hypothesis: gpt-oss is trained entirely on synthetic data, from pre-training to post-training. The approach enhances safety and helps smaller models achieve better performance.

thumb_up_off_alt1,1K

chat_bubble_outline54

repeat67

shareShare

Binyuan Hui

@huybery

4 months ago

So far, the most interesting aspect of gpt-oss for me is the harmony response format. Could it replace chatml? wdyt?

thumb_up_off_alt563

chat_bubble_outline22

repeat16

shareShare

Binyuan Hui

@huybery

4 months ago

something small-but-good today

thumb_up_off_alt998

chat_bubble_outline69

repeat29

shareShare

Binyuan Hui

@huybery

4 months ago

Yes!! We're still updating the dense model!

thumb_up_off_alt769

chat_bubble_outline29

repeat40

shareShare

Binyuan Hui

@huybery

4 months ago

We'll continuously enhance the qwen code (cli tool) based on your feedback and even release improved qwen-coder (model)! Our goal is to match Claude Code's performance while remaining fully open-source!

thumb_up_off_alt1,1K

chat_bubble_outline103

repeat107

shareShare

Fan Zhou✈️ICLR2025

@fazhou_998

4 months ago

1. npx @qwen-code/[email protected] 2. get 2000 free calls/day via Qwen Chat quick math: let's suppose avg agentic interaction ≈ 32k context 2000 × 32k ≈ 64 million tokens/day

thumb_up_off_alt104

chat_bubble_outline3

repeat10

shareShare

skibidi_dibidi_

@skibidi_dibidi_

4 months ago

Binyuan Hui Created using Qwen coder cli

thumb_up_off_alt143

chat_bubble_outline12

repeat9

shareShare

Yiheng Xu✈️ICLR2025

@yihengxu_

4 months ago

Excited to see Qwen3-Coder 480B as the default model for AK’s anycoder — thanks! Gave it a one-shot prompt to build an interactive Win95 desktop, and it just works!

Excited to see Qwen3-Coder 480B as the default model for <a href="/_akhaliq/">AK</a>’s anycoder — thanks! Gave it a one-shot prompt to build an interactive Win95 desktop, and it just works!

thumb_up_off_alt106

chat_bubble_outline9

repeat16

shareShare

Tianbao Xie

@tianbaox

4 months ago

🚀 OSWorld gets a major upgrade! OSWorld-Verified: 15 months community feedback → 300+ fixes (ambiguity, graders…), 50x faster eval through AWS parallelization More apple-to-apple comparison for reliable CUA evaluation ✨ 👇xlang.ai/blog/osworld-v…

thumb_up_off_alt134

chat_bubble_outline7

repeat29

shareShare

carlos

@_carlosejimenez

4 months ago

Recent open model scores on SWE-bench Bash Only: 🥇Qwen3-Coder 480B/A35B Instruct - 55.40% 🥈Kimi-K2-Instruct - 43.80% 🥉gpt-oss-120b - 26.00% See the full leaderboard below! 👇

thumb_up_off_alt216

chat_bubble_outline7

repeat25

shareShare

AK

@_akhaliq

4 months ago

New AI model drops? make an app for that in a few clicks vibe coding with Qwen3-Coder-480B-A35B-Instruct Qwen and OpenAI gpt-oss-20b example

thumb_up_off_alt81

chat_bubble_outline4

repeat13

shareShare

MuleRun

@mulerun_ai

4 months ago

🚀 Beta Test Dropping today: Mule Run — world’s first AI Agent marketplace. Think eBay but for AI — one entry, tons of Agents waiting. They game for you, code for you, even make you money… and mule keep running more. Beta is invite-only. Join in Discord:

thumb_up_off_alt726

chat_bubble_outline51

repeat274

shareShare