Yu Gu @ICLR 2025 (@yugu_nlp) 's Twitter Profile
Yu Gu @ICLR 2025

@yugu_nlp

Agents/AI researcher, not LLM researcher.
Ph.D. from @osunlp. ex-Research Intern @MSFTResearch.

ID: 1259941035087724547

linkhttp://entslscheia.github.io calendar_today11-05-2020 20:18:52

471 Tweet

1,1K Followers

650 Following

Yu Gu @ICLR 2025 (@yugu_nlp) 's Twitter Profile Photo

question to RL people: why the reward in RL has to be numerical? is it a design by nature? or is it mainly an expedient design to simplify the model? eager to learn about your opinions