The NetHack Learning Environment (@nethack_le) Twitter Tweets • TwiCopy

The NetHack Learning Environment

@nethack_le

+ Follow

Official handle for the NetHack Learning Environment (arxiv.org/abs/2006.13760)

ID: 1384437325606891521

linkhttps://github.com/heiner/nle calendar_today20-04-2021 09:23:45

95 Tweet

912 Takipçi

26 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Great to see the next keynote by Pierluca D'Oro at the AutoRL workshop also using The NetHack Learning Environment. Motif (arxiv.org/abs/2310.00166) uses LLMs to intrinsically motivate agents, leading to better score in NetHack than training on the environment's extrinsic reward 🤯

Great to see the next keynote by <a href="/proceduralia/">Pierluca D'Oro</a> at the AutoRL workshop also using <a href="/NetHack_LE/">The NetHack Learning Environment</a>. Motif (arxiv.org/abs/2310.00166) uses LLMs to intrinsically motivate agents, leading to better score in NetHack than training on the environment's extrinsic reward 🤯

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

The NetHack Learning Environment

@nethack_le

a year ago

What's stopping you from working like this?

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

badcop

@badcop_

a year ago

does anyone know how to prevent this from happening?? help

thumb_up_off_alt4,4K

chat_bubble_outline182

repeat247

shareShare

The NetHack Learning Environment

@nethack_le

a year ago

Most video games kill your potential. In NetHack, it gets killed by a random monster first.

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Ethan Mollick

@emollick

a year ago

Steve Strickland ARC will be solved before Nethack

thumb_up_off_alt46

chat_bubble_outline1

repeat4

shareShare

Tim Rocktäschel

@_rockt

a year ago

💯‼️ That's why Davide Paglieri created balrogai.com

thumb_up_off_alt18

chat_bubble_outline1

repeat2

shareShare

Tim Rocktäschel

@_rockt

a year ago

Yearly reminder

thumb_up_off_alt51

chat_bubble_outline0

repeat9

shareShare

The NetHack Learning Environment

@nethack_le

a year ago

cc Tim Rocktäschel

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Tim Rocktäschel

@_rockt

a year ago

In the meantime, The NetHack Learning Environment and balrogai.com...

thumb_up_off_alt11

chat_bubble_outline1

repeat2

shareShare

Tim Rocktäschel

@_rockt

a year ago

💯 For me this is NetHack (see The NetHack Learning Environment and balrogai.com). I am still holding my breath.

thumb_up_off_alt20

chat_bubble_outline3

repeat1

shareShare

Martin Klissarov

@martinklissarov

10 months ago

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first

thumb_up_off_alt203

chat_bubble_outline6

repeat53

shareShare

Mikayel Samvelyan

@_samvelyan

10 months ago

⚔️ MiniHack Updates! ⚔️ 1️⃣ MiniHack 1.0.0 is here! Following popular demand, it now supports the new Gymnasium API and is built on NLE 1.1.0. Huge thanks to @Stephen_Oman (maintainer of The NetHack Learning Environment ) for his outstanding contribution! 🙌

thumb_up_off_alt66

chat_bubble_outline3

repeat13

shareShare

Eric Hambro

@erichammy

9 months ago

All part of the Pokémon to The NetHack Learning Environment pipeline… all we need is a cute soundtrack heiner Tim Rocktäschel

thumb_up_off_alt8

chat_bubble_outline2

repeat1

shareShare

Stephen Oman

@stephen_oman

7 months ago

Happy to announce the latest release of The NetHack Learning Environment (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.

Happy to announce the latest release of <a href="/NetHack_LE/">The NetHack Learning Environment</a> (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.

thumb_up_off_alt26

chat_bubble_outline1

repeat3

shareShare

Mikael Henaff

@henaffmikael

6 months ago

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some The NetHack Learning Environment challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…

A couple bits of news:

1. Happy to share my first (human) NetHack ascension-next step is RL agents :)

2. I wrote a post discussing some <a href="/NetHack_LE/">The NetHack Learning Environment</a> challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo.

mikaelhenaff.substack.com/p/first-nethac…

thumb_up_off_alt49

chat_bubble_outline3

repeat10

shareShare

Tim Rocktäschel

@_rockt

5 months ago

Happy "The NetHack Learning Environment is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).