Junhyuk Oh (@junh_oh) 's Twitter Profile
Junhyuk Oh

@junh_oh

Research Scientist at DeepMind

ID: 3075414729

linkhttps://junhyuk.com/ calendar_today06-03-2015 18:30:17

13 Tweet

562 Takipçi

70 Takip Edilen

Miles Brundage (@miles_brundage) 's Twitter Profile Photo

"Self-Imitation Learning," Oh and Guo et al.: arxiv.org/abs/1806.05635 Imitating past good experiences in the replay buffer leads to big improvements over A2C, PPO, inc. good Montezuma performance in fewer frames than prior approaches

Vincent François-Lavet (@vinfl) 's Twitter Profile Photo

Excited to share a quite extensive introduction to deep reinforcement learning! With ... Riashat Islam Marc G. Bellemare and Joelle Pineau, we hope it will be useful to the community. Print version available at #NeurIPS! arxiv.org/abs/1811.12560

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

Delighted to welcome reinforcement learning pioneer Satinder Singh to DeepMindAI. He’ll bring some incredible experience to the team and I'm really looking forward to working with him!

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

Join us and Blizzard Entertainment this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by Dan Stemkoski and Kevin van der Kooi 🇺🇦! Livestream on YouTube: youtube.com/c/deepmind Read more about #StarCraft2 as an environment for AI research: deepmind.com/blog/deepmind-…

Join us and <a href="/Blizzard_Ent/">Blizzard Entertainment</a> this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by <a href="/Artosis/">Dan Stemkoski</a> and <a href="/RotterdaM08/">Kevin van der Kooi 🇺🇦</a>! 

Livestream on YouTube: youtube.com/c/deepmind

Read more about #StarCraft2 as an environment for AI research: deepmind.com/blog/deepmind-…
Pablo Samuel Castro (@pcastr) 's Twitter Profile Photo

really happy to announce the next version of our #RL framework: Dopamine 2.0! beyond atari: now we support general discrete-domain gym environments. we've been using this internally for our research and it allows us to test out new ideas very quickly. try it out!

Google DeepMind (@googledeepmind) 's Twitter Profile Photo

In a major scientific breakthrough, the latest version of #AlphaFold has been recognised as a solution to one of biology's grand challenges - the “protein folding problem”. It was validated today at #CASP14, the biennial Critical Assessment of protein Structure Prediction (1/3)

Marc G. Bellemare (@marcgbellemare) 's Twitter Profile Photo

Our most recent work is out in Nature! We're reporting on (reinforcement) learning to navigate Loon stratospheric balloons and minimizing the sim2real gap. Results from a 39-day Pacific Ocean experiment show RL keeps its strong lead in real conditions. nature.com/articles/s4158…

Junhyuk Oh (@junh_oh) 's Twitter Profile Photo

I will be briefly talking about how I used JAX to implement my recent #NeurIPS2020 work on Discovering RL Algorithms (arxiv.org/pdf/2007.08794…). Stop by our livestream if you are interested. :)

Luisa Zintgraf (@luisa_zintgraf) 's Twitter Profile Photo

Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"! We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models? Paper: arxiv.org/pdf/2505.17895 to appear NeurIPS Conference Thread 👇

Junhyuk Oh (@junh_oh) 's Twitter Profile Photo

@albert_swart Thank you for mentioning my code. :) They will merge Jeff Donahue's code soon. See the following PR: github.com/BVLC/caffe/pul…

Pieter Abbeel (@pabbeel) 's Twitter Profile Photo

NIPS Deep RL Symposium deadline Fri Nov 3. We welcome your #ICLR Deep RL submissions! (+other recent work) sites.google.com/view/deeprl-sy…

Pieter Abbeel (@pabbeel) 's Twitter Profile Photo

NIPS Deep RL Symposium Schedule now available: sites.google.com/view/deeprl-sy… includes over 70 contributed papers/posters, and invited talks by David Silver, Joelle Pineau, Ruslan Russ Salakhutdinov , Ben Van Roy, Michael Bowling. Thursday 12/7 Ronny MacIntoshes Plus Jr 🐀