Vektor Dewanto (@tttorrr) 's Twitter Profile
Vektor Dewanto

@tttorrr

study control and reinforcement-learning to make robots progressively more autonomous; love teaching. "Likes" is for bookmarks.

ID: 940441783

linkhttp://tttor.github.io/ calendar_today11-11-2012 02:57:42

345 Tweet

130 Followers

1,1K Following

Massimo (@rainmaker1973) 's Twitter Profile Photo

This ultra-small generator is highly portable and works even with shallow, slow moving water. Already tested with powering street lights, it promises to allow people in the world’s remote regions to generate their own electricity buff.ly/2U6yeEc

Danfei Xu (@danfei_xu) 's Twitter Profile Photo

dspace.mit.edu/bitstream/hand… How to do Research At the MIT AI Lab (1988). Almost all advices are still valid more than three decades later. Highly recommended.

dspace.mit.edu/bitstream/hand… How to do Research
At the MIT AI Lab (1988). 
Almost all advices are still valid more than three decades later. Highly recommended.
Gergely Neu (@neu_rips) 's Twitter Profile Photo

excited to announce a new series of virtual seminars on ~~~REINFORCEMENT LEARNING THEORY~~~ we've set this up with Ciara Pike-Burke and Csaba Szepesvari to keep track of all the advances of this fast-paced field. hope others will also find it useful! sites.google.com/view/rltheorys…

excited to announce a new series of virtual seminars on
~~~REINFORCEMENT LEARNING THEORY~~~

we've set this up with <a href="/CiaraPikeBurke/">Ciara Pike-Burke</a> and <a href="/CsabaSzepesvari/">Csaba Szepesvari</a> to keep track of all the advances of this fast-paced field. hope others will also find it useful!
sites.google.com/view/rltheorys…
Yoni Nazarathy (@ynazarathy) 's Twitter Profile Photo

Battling COVID-19 | Think outside the box | Think beyond contact tracing. A method like Safe Blues can help with quick detection of the second wave. Safe Blues paper: safeblues.org

Battling COVID-19 | Think outside the box | Think beyond contact tracing.

A method like Safe Blues can help with quick detection of the second wave.

Safe Blues paper: safeblues.org
Richard Socher (@richardsocher) 's Twitter Profile Photo

Excited to introduce the AI Economist: Extends ideas from Reinforcement Learning for tackling inequality through learned tax policy design. The framework optimizes productivity and equality. Blog: blog.einstein.ai/the-ai-economi… Paper: arxiv.org/abs/2004.13332 Q&A: salesforce.com/company/news-p…

Yue Wu (@frankyuewu1) 's Twitter Profile Photo

We just released our new paper: A Finite Time Analysis of Two Time-Scale Actor Critic Methods (arxiv.org/abs/2005.01350). We for the first time show that two time-scale actor-critic methods can find an eps-stationary point within eps^{-2.5} sample complexity.

arXiv Daily (@arxiv_daily) 's Twitter Profile Photo

Average-reward model-free reinforcement learning: a systematic review and literature mapping deepai.org/publication/av… by Vektor Dewanto et al. including Marcus Gallagher #ReinforcementLearning #ArtificialIntelligence

arXiv Daily (@arxiv_daily) 's Twitter Profile Photo

Examining average and discounted reward optimality criteria in reinforcement learning deepai.org/publication/ex… by Vektor Dewanto and Marcus Gallagher #ReinforcementLearning #ComputerScience

arXiv Daily (@arxiv_daily) 's Twitter Profile Photo

Approximate discounting-free policy evaluation from transient and recurrent states deepai.org/publication/ap… by Vektor Dewanto and Marcus Gallagher #Statistics #Estimator