Tuomas Haarnoja
@haarnoja
Research Scientist @DeepMind, previously PhD @berkeley_ai and Student Researcher @GoogleAI
ID: 899136914531393536
20-08-2017 05:12:00
38 Tweet
3,3K Followers
73 Following
My thesis "Acquiring Diverse Robot Skills via Maximum Entropy Deep Reinforcement Learning" has been accepted! Below is a 17 second summary. Thanks to my committee Sergey Levine, Pieter Abbeel, Mark Mueller and all the collaborators! www2.eecs.berkeley.edu/Pubs/TechRpts/…
Great WIRED article about our research on real-world RL for robotic walking, with some nice quotes from Tuomas Haarnoja: wired.com/story/the-clev… (research w/ Tuomas Haarnoja, Aurick Zhou, George Tucker, Sehoon Ha, and Jie Tan)
Videos of our #Infer2Control workshop #neurips2018 are now available, featuring great talks by Ryan Adams Ryan Adams, Anne Collins Collins Lab, Leslie Kaelbling, Sergey Levine Sergey Levine, Shie Mannor, Emo Todorov, and many others! youtube.com/playlist?list=…
Big update to Stable-Baselines: v2.4.0 is out! Highlights: - New RL baseline: Soft Actor-Crtic (Tuomas Haarnoja et al.) with automatic entropy temperature - Customize policy at model creation - RL Baselines Zoo updated (now 80+ trained agents) Full changelog: github.com/hill-a/stable-…
Excited to share I have joined DeepMindAI in London as a Research Scientist!
L2 distance is a bad proxy for how long it takes to reach a goal. Can we learn dynamical distances? DDL can shape rewards and help learn from preferences. Below: robot learning w/ DDL from images w/o reward, only prefs. sites.google.com/view/skills-vi… w K Hartikainen, X Geng, Tuomas Haarnoja
New research from Google DeepMind brings together soccer and robotics. Using reinforcement learning, robots display agile and reactive movements similar to a soccer player, no shin guards needed:) science.org/doi/10.1126/sc…