Junhyuk Oh (@junh_oh) Twitter Tweets • TwiCopy

Miles Brundage

8 years ago

"Self-Imitation Learning," Oh and Guo et al.: arxiv.org/abs/1806.05635 Imitating past good experiences in the replay buffer leads to big improvements over A2C, PPO, inc. good Montezuma performance in fewer frames than prior approaches

thumb_up_off_alt57

chat_bubble_outline2

repeat19

shareShare

Vincent François-Lavet

@vinfl

7 years ago

Excited to share a quite extensive introduction to deep reinforcement learning! With ... Riashat Islam Marc G. Bellemare and Joelle Pineau, we hope it will be useful to the community. Print version available at #NeurIPS! arxiv.org/abs/1811.12560

thumb_up_off_alt253

chat_bubble_outline3

repeat87

shareShare

Demis Hassabis

@demishassabis

7 years ago

Delighted to welcome reinforcement learning pioneer Satinder Singh to DeepMindAI. He’ll bring some incredible experience to the team and I'm really looking forward to working with him!

thumb_up_off_alt538

chat_bubble_outline4

repeat57

shareShare

Google DeepMind

@googledeepmind

7 years ago

Join us and Blizzard Entertainment this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by Dan Stemkoski and Kevin van der Kooi 🇺🇦! Livestream on YouTube: youtube.com/c/deepmind Read more about #StarCraft2 as an environment for AI research: deepmind.com/blog/deepmind-…

Join us and <a href="/Blizzard_Ent/">Blizzard Entertainment</a> this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by <a href="/Artosis/">Dan Stemkoski</a> and <a href="/RotterdaM08/">Kevin van der Kooi 🇺🇦</a>!

Livestream on YouTube: youtube.com/c/deepmind

Read more about #StarCraft2 as an environment for AI research: deepmind.com/blog/deepmind-…

thumb_up_off_alt2,2K

chat_bubble_outline62

repeat901

shareShare

Pablo Samuel Castro

@pcastr

7 years ago

really happy to announce the next version of our #RL framework: Dopamine 2.0! beyond atari: now we support general discrete-domain gym environments. we've been using this internally for our research and it allows us to test out new ideas very quickly. try it out!

thumb_up_off_alt128

chat_bubble_outline4

repeat29

shareShare

Berkeley AI Research

@berkeley_ai

7 years ago

CfP for the @ICLR2019 workshop on structure and priors in reinforcement learning (SPiRL), deadline 3/7! spirl.info/2019/call/

thumb_up_off_alt30

chat_bubble_outline1

repeat4

shareShare

Google DeepMind

@googledeepmind

5 years ago

In a major scientific breakthrough, the latest version of #AlphaFold has been recognised as a solution to one of biology's grand challenges - the “protein folding problem”. It was validated today at #CASP14, the biennial Critical Assessment of protein Structure Prediction (1/3)

thumb_up_off_alt9,9K

chat_bubble_outline123

repeat2,2K

shareShare

Marc G. Bellemare

@marcgbellemare

5 years ago

Our most recent work is out in Nature! We're reporting on (reinforcement) learning to navigate Loon stratospheric balloons and minimizing the sim2real gap. Results from a 39-day Pacific Ocean experiment show RL keeps its strong lead in real conditions. nature.com/articles/s4158…

thumb_up_off_alt740

chat_bubble_outline22

repeat103

shareShare

Junhyuk Oh

@junh_oh

5 years ago

I will be briefly talking about how I used JAX to implement my recent #NeurIPS2020 work on Discovering RL Algorithms (arxiv.org/pdf/2007.08794…). Stop by our livestream if you are interested. :)

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Luisa Zintgraf

@luisa_zintgraf

2 months ago

Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"! We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models? Paper: arxiv.org/pdf/2505.17895 to appear NeurIPS Conference Thread 👇

thumb_up_off_alt326

chat_bubble_outline10

repeat50

shareShare

Junhyuk Oh

@junh_oh

11 years ago

@albert_swart Thank you for mentioning my code. :) They will merge Jeff Donahue's code soon. See the following PR: github.com/BVLC/caffe/pul…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Ruben Villegas

@rubenevillegas

9 years ago

Check out the code (with trained models) for our ICLR 2017 paper on video prediction. github.com/rubenvillegas/…

thumb_up_off_alt9

chat_bubble_outline0

repeat5

shareShare

Pieter Abbeel

@pabbeel

8 years ago

NIPS Deep RL Symposium deadline Fri Nov 3. We welcome your #ICLR Deep RL submissions! (+other recent work) sites.google.com/view/deeprl-sy…

thumb_up_off_alt45

chat_bubble_outline0

repeat15

shareShare

Pieter Abbeel

@pabbeel

8 years ago

NIPS Deep RL Symposium Schedule now available: sites.google.com/view/deeprl-sy… includes over 70 contributed papers/posters, and invited talks by David Silver, Joelle Pineau, Ruslan Russ Salakhutdinov , Ben Van Roy, Michael Bowling. Thursday 12/7 Ronny MacIntoshes Plus Jr 🐀

thumb_up_off_alt167

chat_bubble_outline2

repeat54

shareShare