Yizhe Zhang @ ICLR 2025 🇸🇬 (@yizhezhangnlp) 's Twitter Profile
Yizhe Zhang @ ICLR 2025 🇸🇬

@yizhezhangnlp

Research Scientist at Apple MLR | ex-researcher @ Microsoft Research, Meta AI | PhD @ Duke University

ID: 545985637

linkhttps://dreasysnail.github.io calendar_today05-04-2012 13:44:54

152 Tweet

1,1K Takipçi

486 Takip Edilen

Sansa Gong (@sansa19739319) 's Twitter Profile Photo

Our paper has been accepted to ICLR 2026 #ICLR2025 We hope this first 7B diffusion language model inspires the community to further explore the potential of diffusion models. Let’s push the boundaries together! 🚀 Code: github.com/HKUNLP/DiffuLL…

Our paper has been accepted to <a href="/iclr_conf/">ICLR 2026</a> #ICLR2025 

We hope this first 7B diffusion language model inspires the community to further explore the potential of diffusion models. Let’s push the boundaries together! 🚀

Code: github.com/HKUNLP/DiffuLL…
Xingyao Wang (@xingyaow_) 's Twitter Profile Photo

It's fascinating to experience firsthand how quickly we can reproduce some patterns from R1-Zero (RL starts directly from the base model!) with such minimal resources 👀

Jiayi Pan (@jiayi_pirate) 's Twitter Profile Photo

Introducing SWE-Gym: An Open Environment for Training Software Engineering Agents & Verifiers Using SWE-Gym, our agents + verifiers reach new open SOTA - 32%/26% on SWE-Bench Verified/Lite, showing strong scaling with more train / test compute arxiv.org/abs/2412.21139 [🧵]

Introducing SWE-Gym: An Open Environment for Training Software Engineering Agents &amp; Verifiers

Using SWE-Gym, our agents + verifiers reach new open SOTA - 32%/26% on SWE-Bench Verified/Lite,
showing strong scaling with more train / test compute

arxiv.org/abs/2412.21139  [🧵]
Yizhe Zhang @ ICLR 2025 🇸🇬 (@yizhezhangnlp) 's Twitter Profile Photo

Text Diffusion and AR LLM might not be substitutes for each other but rather complementary, enhancing each other. Mercury has taken a solid step forward.

Yizhe Zhang @ ICLR 2025 🇸🇬 (@yizhezhangnlp) 's Twitter Profile Photo

Check out our recent work towards enhancing controllability and strategic planning by integrating “macro actions” to improve emotionally intelligent conversations in chatbots! linkedin.com/feed/update/ur…

Yi Wu (@jxwuyi) 's Twitter Profile Photo

🎉 Milestone Release! AReaL-boba, our latest #RL system! github.com/inclusionAI/AR… #AI • data/code/model ALL🔥 #OPENSOURCE • Full #SGLang & 1.5x faster on 7B RL • SOTA 7B math reasoning: 61.9 AIME24 & 48.3 AIME25 • 200-sample 32B tuning match QwQ on AIME24 Qwen 1/3 👇

🎉 Milestone Release! AReaL-boba, our latest #RL system! github.com/inclusionAI/AR… #AI
• data/code/model ALL🔥 #OPENSOURCE
• Full #SGLang &amp; 1.5x faster on 7B RL
• SOTA 7B math reasoning: 61.9 AIME24 &amp; 48.3 AIME25
• 200-sample 32B tuning match QwQ on AIME24 <a href="/Alibaba_Qwen/">Qwen</a> 
1/3 👇
Jiatao Gu (@thoma_gu) 's Twitter Profile Photo

Due to some conflicts of time, our poster is rescheduled to Saturday afternoon 3-5:30 PM. Hall 3 + Hall 2B #625 Please stop by if you are interested! Thanks,

Due to some conflicts of time, our poster is rescheduled to Saturday afternoon 3-5:30 PM.
Hall 3 + Hall 2B #625

Please stop by if you are interested!
Thanks,
Shuangfei Zhai (@zhaisf) 's Twitter Profile Photo

Proud to report that TarFlow is accepted to #ICML2025 as a Spotlight 🎉 I’m really looking forward to new ideas and applications enabled by powerful Normalizing Flow models 🚀