Yang Chen (@ychennlp) 's Twitter Profile
Yang Chen

@ychennlp

accelerating @NVIDIA, phd @gtcomputing 🧊locked in

ID: 1043344251780833280

linkhttp://edchengg.github.io calendar_today22-09-2018 03:40:28

84 Tweet

844 Takipçi

491 Takip Edilen

Yang Chen (@ychennlp) 's Twitter Profile Photo

📢We conduct a systematic study to demystify the synergy between SFT and RL for reasoning models. The result? We trained a 7B model - AceReason-Nemotron-1.1, significantly improved from version 1.0 on math and coding benchmarks. ✅AIME2025 (math): 53.6% -> 64.8% ✅LiveCodeBench

📢We conduct a systematic study to demystify the synergy between SFT and RL for reasoning models.

The result? We trained a 7B model - AceReason-Nemotron-1.1, significantly improved from version 1.0 on math and coding benchmarks.

✅AIME2025 (math): 53.6% -> 64.8%
✅LiveCodeBench