Greg Farquhar
@greg_far
ID: 925419726045577217
31-10-2017 17:50:30
23 Tweet
401 Takipçi
92 Takip Edilen
I am very excited to share our ICML paper “Deep Variational Reinforcement Learning (DVRL) for POMDPs”: Our agent learns a model of the environment and acts based on its belief state in this model. w/ @zinmalu Tuan Anh Le Frank Wood Shimon Whiteson arxiv.org/abs/1806.02426
I had the pleasure to co-supervise outstanding MSc students jointly with Jakob Foerster (Jakob Foerster) and Greg Farquhar (Greg Farquhar) at Oxford Comp Sci this year. Together, we compiled our advice for embarking on short-term machine learning research projects: rockt.github.io/2018/08/29/msc…
Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: arxiv.org/abs/1906.12266 + code: github.com/TorchCraft/Tor…. Great working with Laura Gustafson Zeming Lin Shimon Whiteson Nicolas Usunier Gabriel Synnaeve
Excited to share "DiCE: The Infinitely Differentiable Monte Carlo Estimator": arxiv.org/abs/1802.05098 Try this one weird objective for correct any-order gradient estimators in all your stochastic graphs ;) With fantastic Oxford/CMU team: Greg Farquhar Maruan Al-Shedivat Tim Rocktäschel Shimon Whiteson
The camera-ready of our #ICLR2018 paper “TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning” is now online arxiv.org/abs/1710.11417. Code is available at github.com/oxwhirl/treeqn/ Tim Rocktäschel Maximilian Igl Shimon Whiteson WhiRL