
Yang Chen
@ychennlp
accelerating @NVIDIA, phd @gtcomputing 🧊locked in
ID: 1043344251780833280
http://edchengg.github.io 22-09-2018 03:40:28
84 Tweet
844 Takipçi
491 Takip Edilen







Etash Guha Ryan Marten I tried to reproduce DS-R1-distilled-7B and AceReason-7B's performance on your split (06/24-01/25), and they turn out to be 41.9 and 54.6 correspondingly, which is obviously higher than your reported number. Anything wrong here? Etash Guha Ryan Marten







