Harri Edwards (@harriledwards) Twitter Tweets • TwiCopy

Antreas Antoniou

7 years ago

Interested in stabilizing the training your MAML? Do you want to substantially improve the generalization of MAML whilst cutting down hyperparameter tuning by learning most of the hyperparms automatically? Look no furthr Paper arxiv.org/abs/1810.09502 Code: github.com/AntreasAntonio…

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Antreas Antoniou

@antreasantonio

7 years ago

A a very p(r)uny blogpost on Prunning Neural Networks. bayeswatch.com/2018/10/26/pru…

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

OpenAI

@openai

7 years ago

Random Network Distillation: A prediction-based method that achieves state-of-the-art performance on Montezuma’s Revenge -blog.openai.com/reinforcement-…

thumb_up_off_alt506

chat_bubble_outline8

repeat154

shareShare

Arthur Juliani

@awjuliani

7 years ago

Nice article by James Vincent on OpenAI's new exploration + RL results. theverge.com/2018/11/1/1805…

thumb_up_off_alt23

chat_bubble_outline0

repeat6

shareShare

Miles Brundage

@miles_brundage

7 years ago

"Dilated DenseNets for Relational Reasoning," Antoniou et al.: arxiv.org/abs/1811.00410

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

OpenAI

@openai

7 years ago

We’ve built an energy-based model that can quickly recognize, generate, and transfer simple concepts after only 5 training demos: blog.openai.com/learning-conce…

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat331

shareShare

Joshua Achiam

@jachiam0

7 years ago

We did a thing!!

thumb_up_off_alt32

chat_bubble_outline1

repeat7

shareShare

Antreas Antoniou

@antreasantonio

7 years ago

My latest blog post on meta-learning in general and "How to train your MAML" in particular is now out. bayeswatch.com/2018/11/30/HTY… The post thoroughly explains MAML, some of its problems, and proposes some solutions. In addition visualizes the learned per-step per layer learning rate

thumb_up_off_alt41

chat_bubble_outline1

repeat8

shareShare

OpenAI

@openai

7 years ago

We’re releasing CoinRun, an environment generator that provides a metric for an agent’s ability to generalize across new environments - blog.openai.com/quantifying-ge…

thumb_up_off_alt970

chat_bubble_outline16

repeat320

shareShare

Arthur Juliani

@awjuliani

7 years ago

Excited to share the release of Obstacle Tower! Inspired by Montezuma's Revenge, we've built it to act as a benchmark for hard problems in DeepRL: requiring vision, control, planning, and (importantly) generalization in order for agents to perform well. github.com/Unity-Technolo…

thumb_up_off_alt488

chat_bubble_outline14

repeat141

shareShare

James Owers-Bardsley

@jamesowers

7 years ago

My group enjoy talking about prunes: arxiv.org/abs/1810.04622

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Deepak Pathak

@pathak2206

5 years ago

RL agents get specific to tasks they are trained on. What if we remove the task itself during training? Turns out, a self-supervised planning agent can both explore efficiently & achieve SOTA on test tasks w/ zero or few samples in DMControl from images! ramanans1.github.io/plan2explore

thumb_up_off_alt661

chat_bubble_outline9

repeat166

shareShare

Stanislas Polu

@spolu

5 years ago

Posted my first paper on arXiv💥🙌 GPT-f is a Transformer-based automated theorem prover. We show that Transformer + Search is suitable to formal reasoning and continuous self-improvement 🦾 arxiv.org/abs/2009.03393

thumb_up_off_alt873

chat_bubble_outline17

repeat187

shareShare

Stanislas Polu

@spolu

4 years ago

📔 New MiniF2F paper! arxiv.org/abs/2109.00110 Introduces MiniF2F a benchmark of Olympiad-level problem statements formalized in Lean/Metamath/Isabelle. GPT-f applied to MiniF2F/Metamath ~ 2% 🥶 GPT-f applied to MiniF2F/Lean ~ 29% 🔥 Code: github.com/openai/miniF2F 👇

thumb_up_off_alt40

chat_bubble_outline2

repeat9

shareShare

Stanislas Polu

@spolu

4 years ago

When I started this project 2 years ago I couldn't have dreamt of us getting that far. But this is also only the beginning💥 Some thoughts on what we achieved so far 🧵

thumb_up_off_alt733

chat_bubble_outline15

repeat96

shareShare

Harri Edwards

@harriledwards

4 years ago

I'm in awe of the great work my teammates have done.

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

Kevin Hartnett

@kshartnett

4 years ago

In September 2020 Quanta Magazine wrote about researchers trying to build an AI system that can achieve a gold-medal score at the IMO. New work by Stanislas Polu and co. at OpenAI takes another step in that direction. openai.com/blog/formal-ma…

thumb_up_off_alt17

chat_bubble_outline1

repeat4

shareShare

Yannic Kilcher 🇸🇨

@ykilcher

4 years ago

AI proves formal math theorems🧮This paper uses language models for theorem proving, and expert iteration to automatically build a curriculum towards ever harder statements. The final system solves two(!) IMO problems. Watch the paper review here: youtu.be/lvYVuOmUVs8