@yugu_nlp : question to RL people: why the reward in RL has to be numerical? is it a design by nature? or is it mainly an expedient design to simplify the model? eager to learn about your opinions • TwiCopy

Yu Gu @ICLR 2025

@yugu_nlp

+ Follow

Agents/AI researcher, not LLM researcher.
Ph.D. from @osunlp. ex-Research Intern @MSFTResearch.

ID: 1259941035087724547

linkhttp://entslscheia.github.io calendar_today11-05-2020 20:18:52

471 Tweet

1,1K Followers

650 Following

Yu Gu @ICLR 2025

@yugu_nlp

4 months ago

question to RL people: why the reward in RL has to be numerical? is it a design by nature? or is it mainly an expedient design to simplify the model? eager to learn about your opinions

thumb_up_off_alt16

chat_bubble_outline4

repeat0

shareShare