Pushpendre Rastogi
@pushpendre89
Senior research eng at Google Deepmind
ID: 588806879
http://pushpendre.github.io 24-05-2012 03:51:15
253 Tweet
240 Takipçi
553 Takip Edilen
Deciding between extending Art, SkyRL, and OpenRLHF for multi-objective RL . Needed process supervision + LoRA so Verl, trl, nemorl etc. are out. Anyone shipped with these? Which has the least footguns, good acceptance of PRs? Kyle Corbitt Sumanth Hegde Ankit Maloo #skyrl #openrlhf #roll