
Beidi Chen
@beidichen
Asst. Prof @CarnegieMellon, Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.
ID: 424387623
https://www.andrew.cmu.edu/user/beidic/ 29-11-2011 18:22:36
461 Tweet
14,14K Takipçi
375 Takip Edilen

Happy Throughput Thursday! We’re excited to release Tokasaurus: an LLM inference engine designed from the ground up for high-throughput workloads with large and small models. (Joint work with Ayush Chakravarthy, Ryan Ehrlich, Sabri Eyuboglu, Bradley Brown, Joseph Shetaye,








Our view on test-time scaling has been to train models to discover algos that enable them to solve harder problems. Amrith Setlur & Matthew Yang's new work e3 shows how RL done with this view produces best <2B LLM on math that extrapolates beyond training budget. 🧵⬇️





Hello MiniMax (official) exciting model but questionable claim on its better reasoning scaling than DeepSeek and Qwen. Nice try on reasoning longer to be SOTA but using flops to quantify the cost in Test-time scaling doesn’t work for hybrid model 🫣 chen zhuoming has


Xinyu Yang will be presenting this amazing work at ASAP seminar tomorrow! Do not miss his talk




