Junyu Zhang
@jyzhang1208
MSCS @IllinoisCS, Undergrad @HuazhongUST.
ID: 1684232293177774080
https://jyzhang1208.github.io 26-07-2023 16:01:28
6 Tweet
29 Followers
78 Following
1/ How can we model the future rewards (returns) for RL agents? While prior methods round the returns into discrete bins or predict a finite number of quantiles, we use flexible models to predict the fine-grained structure of the full return distribution: pd-perry.github.io/value-flows.