
Yang Chen
@ychennlp
accelerating @NVIDIA, phd @gtcomputing 🧊locked in
ID: 1043344251780833280
http://edchengg.github.io 22-09-2018 03:40:28
84 Tweet
844 Followers
491 Following







Etash Guha Ryan Marten I tried to reproduce DS-R1-distilled-7B and AceReason-7B's performance on your split (06/24-01/25), and they turn out to be 41.9 and 54.6 correspondingly, which is obviously higher than your reported number. Anything wrong here? Etash Guha Ryan Marten


With stronger SFT backbone, AceReason-Nemotron-1.1-7B significantly outperforms its predecessor and sets a record-high performance among Qwen2.5-7B-based reasoning models. 📄Report: arxiv.org/pdf/2506.13284 🤗Model: huggingface.co/nvidia/AceReas… 📚SFT Data: huggingface.co/datasets/nvidi…




