Richard Pang
@yzpang_
yzpang.me; @AIatMeta Llama research, prev: NYU, Meta FAIR, @uchicago, @googleai; research: llm, text gen, alignment, reasoning, human-lm collab, etc.
ID: 919679161903534081
http://yzpang.me 15-10-2017 21:39:33
96 Tweet
477 Takipçi
390 Takip Edilen
🎉 Excited to share that my internship work, ScPO, on self-training LLMs to improve reasoning without human labels, has been accepted to #ICML2025! Many thanks to my awesome collaborators at AI at Meta and @uncnlp🌞Looking forward to presenting ScPO in Vancouver 🇨🇦
Excited to share Prompt Curriculum Learning (PCL) from AI at Meta - we improve performance-efficiency tradeoffs for reasoning RL by predicting prompt difficulty with a value model updated on-policy, and selecting intermediate-difficulty prompts that yield high effective ratios.