Chujie Zheng (@chujiezheng) 's Twitter Profile
Chujie Zheng

@chujiezheng

Researcher @Alibaba_Qwen | Opinions are my own

ID: 964900352871907330

linkhttps://chujiezheng.github.io/ calendar_today17-02-2018 16:32:25

443 Tweet

2,2K Followers

281 Following

Chujie Zheng (@chujiezheng) 's Twitter Profile Photo

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) πŸš€ πŸ“„ huggingface.co/papers/2507.18…

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) πŸš€

πŸ“„ huggingface.co/papers/2507.18…