Scott Niekum
@scottniekum
Associate professor at UMass Amherst CICS. AIignment, safety, reinforcement learning, imitation learning, and robotics.
ID: 1091815542334337024
https://people.cs.umass.edu/~sniekum/ 02-02-2019 21:48:05
572 Tweet
3,3K Takipçi
369 Takip Edilen
This project started with us annoyed at papers evaluating CoT "reasoning" with only GSM8k & MATH. We didn't expect to find such strong evidence that these are the only type of problem where CoT helps! Credit to Juan Diego Rodríguez (he/him) & Kyle Mahowald for driving the rigorous meta-analysis!
Our cross-university(s) collaborative work on "Scaling laws for Reward Model Overoptimization in Direct Alignment Algorithms" is accepted at NeurIPS Conference!
For those interested, the keynotes of the RL_Conference 2024 are now available online: youtube.com/@RL-conference… Unfortunately, Doina Precup's talk was not recorded, but we have: Andy Barto, Emma Brunskill, Finale Doshi-Velez, Sergey Levine, David Silver, and Peter Stone.
Huge congrats to Prasann Singhal for being one of the 8 CRA Outstanding Undergraduate Researcher Award winners! It has been an absolute privilege to work with Prasann during his time at UT. (And he's applying for PhD programs this year...hint hint...) Prasann's work... 🧵
RLZero will be presented at NeurIPS Conference 2025 . Learn more about the work in the thread below: