YIFENG LIU (@yifengliu_ai) 's Twitter Profile
YIFENG LIU

@yifengliu_ai

ID: 1778258634889383937

linkhttp://lauyikfung.github.io calendar_today11-04-2024 03:08:00

26 Tweet

82 Followers

20 Following

YIFENG LIU (@yifengliu_ai) 's Twitter Profile Photo

1/6 We introduce RPG, a principled framework for deriving and analyzing KL-regularized policy gradient methods, unifying GRPO/k3-estimator and REINFORCE++ under this framework and discovering better RL objectives than GRPO: Paper: arxiv.org/abs/2505.17508 Code:

1/6 We introduce RPG, a principled framework for deriving and analyzing KL-regularized policy gradient methods, unifying GRPO/k3-estimator and REINFORCE++ under this framework and discovering better RL objectives than GRPO:
Paper: arxiv.org/abs/2505.17508
Code: