Rui Yang
@ruiyang70669025
PhD student @ HKUST
ID: 1597825781937291265
https://yangrui2015.github.io 30-11-2022 05:32:31
54 Tweet
145 Followers
243 Following
Wanna train a SOTA reward model? 🌟New Blog Alert: "Reward Modeling for RLHF" (with Wei Xiong & Rui Yang) is live this weekend! 🌐✨ We delve into the insights behind achieving groundbreaking performance on the RewardBench (by Nathan Lambert). efficient-unicorn-451.notion.site/Reward-Modelin…