Devvrit
@devvrit_khatri
GradStudent@UTCompSci. Large Scale ML - Scalability and Efficiency
ID: 1211241136859172864
https://www.devvrit.com/ 29-12-2019 11:02:56
104 Tweet
211 Takipçi
183 Takip Edilen
This is a great blog explaining the progress in scaling RL and our work. Pretty clear, intuitive, and captures the key takeaways (and limitations :)). Thanks, Nathan Lambert!
🌟 Introducing General On-Policy Logit Distillation 🌟 Inspired by the latest from Thinking Machines, we extend on-policy distillation to enable ANY teacher to be distilled into ANY student, even if their tokenizers differ! We've added this to TRL so you can now take any pair of
Alexia Jolicoeur-Martineau arxiv.org/abs/2510.01123 Another way of doing recursive computations (Parallel-Distill-Refine)
🚨 New in ML Workshop at NeurIPS Conference We're so excited to invite you to the New In ML Workshop (NewInML @ NeurIPS 2025), taking place on Tuesday, December 2nd, 2025, at the San Diego Convention Center! Great opportunity, specifically for people who are new in machine learning! Details🧵