Greg Farquhar (@greg_far) 's Twitter Profile
Greg Farquhar

@greg_far

ID: 925419726045577217

calendar_today31-10-2017 17:50:30

23 Tweet

401 Takipçi

92 Takip Edilen

Maximilian Igl (@maxiigl) 's Twitter Profile Photo

I am very excited to share our ICML paper “Deep Variational Reinforcement Learning (DVRL) for POMDPs”: Our agent learns a model of the environment and acts based on its belief state in this model. w/ @zinmalu Tuan Anh Le Frank Wood Shimon Whiteson arxiv.org/abs/1806.02426

I am very excited to share our ICML paper “Deep Variational Reinforcement Learning (DVRL) for POMDPs”: Our agent learns a model of the environment and acts based on its belief state in this model. w/ @zinmalu <a href="/tuananhle7/">Tuan Anh Le</a> <a href="/frankdonaldwood/">Frank Wood</a> <a href="/shimon8282/">Shimon Whiteson</a> arxiv.org/abs/1806.02426
Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

I had the pleasure to co-supervise outstanding MSc students jointly with Jakob Foerster (Jakob Foerster) and Greg Farquhar (Greg Farquhar) at Oxford Comp Sci this year. Together, we compiled our advice for embarking on short-term machine learning research projects: rockt.github.io/2018/08/29/msc…

Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

How can RL agents exploit the compositional, relational and hierarchical structure of the world? A growing number of authors propose learning from natural language. We are excited to share our IJCAIconf survey of this emerging field! arxiv.org/abs/1906.03926 TL;DR:🤖+📖=📈🎯🏆🥳

How can RL agents exploit the compositional, relational and hierarchical structure of the world? A growing number of authors propose learning from natural language. We are excited to share our <a href="/IJCAIconf/">IJCAIconf</a> survey of this emerging field! arxiv.org/abs/1906.03926 
TL;DR:🤖+📖=📈🎯🏆🥳
Greg Farquhar (@greg_far) 's Twitter Profile Photo

Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: arxiv.org/abs/1906.12266 + code: github.com/TorchCraft/Tor…. Great working with Laura Gustafson Zeming Lin Shimon Whiteson Nicolas Usunier Gabriel Synnaeve

Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: arxiv.org/abs/1906.12266 + code: github.com/TorchCraft/Tor…. Great working with Laura Gustafson <a href="/ebetica/">Zeming Lin</a> <a href="/shimon8282/">Shimon Whiteson</a> Nicolas Usunier <a href="/syhw/">Gabriel Synnaeve</a>
Greg Farquhar (@greg_far) 's Twitter Profile Photo

AI accelerates by 10x in the hour it takes to repost from r/machinelearning to r/singularityisnear... just how near is it at that rate?? 😱

AI accelerates by 10x in the hour it takes to repost from r/machinelearning to r/singularityisnear... just how near is it at that rate?? 😱
Greg Farquhar (@greg_far) 's Twitter Profile Photo

I particularly enjoyed visualising & analysing the learned mixing functions that combine per-agent utilities into joint values!

I particularly enjoyed visualising &amp; analysing the learned mixing functions that combine per-agent utilities into joint values!
Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/

Greg Farquhar (@greg_far) 's Twitter Profile Photo

Permanent damage to generalisation from early updates in non-stationary training -- really enjoyed looking into this intriguing problem and trying to solve it for deep RL agents!

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

Excited to share "DiCE: The Infinitely Differentiable Monte Carlo Estimator": arxiv.org/abs/1802.05098 Try this one weird objective for correct any-order gradient estimators in all your stochastic graphs ;) With fantastic Oxford/CMU team: Greg Farquhar Maruan Al-Shedivat Tim Rocktäschel Shimon Whiteson

Excited to share "DiCE: The Infinitely Differentiable Monte Carlo Estimator": arxiv.org/abs/1802.05098 Try this one weird objective for correct any-order gradient estimators in all your stochastic graphs ;) With fantastic Oxford/CMU team: <a href="/greg_far/">Greg Farquhar</a> <a href="/alshedivat/">Maruan Al-Shedivat</a> <a href="/_rockt/">Tim Rocktäschel</a> <a href="/shimon8282/">Shimon Whiteson</a>
Greg Farquhar (@greg_far) 's Twitter Profile Photo

The camera-ready of our #ICLR2018 paper “TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning” is now online arxiv.org/abs/1710.11417. Code is available at github.com/oxwhirl/treeqn/ Tim Rocktäschel Maximilian Igl Shimon Whiteson WhiRL

The camera-ready of our #ICLR2018 paper “TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning” is now online arxiv.org/abs/1710.11417. Code is available at github.com/oxwhirl/treeqn/ <a href="/_rockt/">Tim Rocktäschel</a> <a href="/MaxiIgl/">Maximilian Igl</a> <a href="/shimon8282/">Shimon Whiteson</a> <a href="/whi_rl/">WhiRL</a>
Shimon Whiteson (@shimon8282) 's Twitter Profile Photo

Our latest paper: how to learn complex joint value functions for teams of agents whose greedy policies can be computed and executed in a decentralised fashion. The trick is a new monotonic value function factorisation. With results on StartCraft 2! arxiv.org/abs/1803.11485