Sudhir Pratap Yadav
@sudhirpyadav
I speak what I feel is true -- Open to change mind and discuss on anything. My views keep changing with information and maturity of my mind.
ID: 2179986666
07-11-2013 12:17:51
4,4K Tweet
43 Takipçi
494 Takip Edilen
Visit our aamas2022 paper "Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" with Lukas Schäfer Filippos Christianos Josiah Hanna. Talk & Q&As on 11 May 18:00-19:00 (1A5-3) and 13 May 9:00-10:00 (3C1-2); all BST. ➡️ Paper: arxiv.org/abs/2107.08966
New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ Hongsuk Benjamin Choi: hongsukchoi.github.io/fpo-control