The NetHack Learning Environment (@nethack_le) 's Twitter Profile
The NetHack Learning Environment

@nethack_le

Official handle for the NetHack Learning Environment (arxiv.org/abs/2006.13760)

ID: 1384437325606891521

linkhttps://github.com/heiner/nle calendar_today20-04-2021 09:23:45

95 Tweet

912 Followers

26 Following

Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

Great to see the next keynote by Pierluca D'Oro at the AutoRL workshop also using The NetHack Learning Environment. Motif (arxiv.org/abs/2310.00166) uses LLMs to intrinsically motivate agents, leading to better score in NetHack than training on the environment's extrinsic reward 🤯

Great to see the next keynote by <a href="/proceduralia/">Pierluca D'Oro</a> at the AutoRL workshop also using <a href="/NetHack_LE/">The NetHack Learning Environment</a>. Motif (arxiv.org/abs/2310.00166) uses LLMs to intrinsically motivate agents, leading to better score in NetHack than training on the environment's extrinsic reward 🤯
Martin Klissarov (@martinklissarov) 's Twitter Profile Photo

Can AI agents adapt zero-shot, to complex multi-step language instructions in open-ended environments? We present MaestroMotif, a method for AI-assisted skill design that produces highly capable and steerable hierarchical agents. To the best of our knowledge, it is the first

Mikayel Samvelyan (@_samvelyan) 's Twitter Profile Photo

⚔️ MiniHack Updates! ⚔️ 1️⃣ MiniHack 1.0.0 is here! Following popular demand, it now supports the new Gymnasium API and is built on NLE 1.1.0. Huge thanks to @Stephen_Oman (maintainer of The NetHack Learning Environment ) for his outstanding contribution! 🙌

Stephen Oman (@stephen_oman) 's Twitter Profile Photo

Happy to announce the latest release of The NetHack Learning Environment (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.

Happy to announce the latest release of <a href="/NetHack_LE/">The NetHack Learning Environment</a> (version 1.2.0). You can now use the seed function to make the dungeon layout reproducible across training episodes. The in-level interaction and combat is still randomly determined and doesn't impact lower level layouts.
Mikael Henaff (@henaffmikael) 's Twitter Profile Photo

A couple bits of news: 1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 2. I wrote a post discussing some The NetHack Learning Environment challenges & how they map to open problems in RL & agentic AI. Still the best RL benchmark imo. mikaelhenaff.substack.com/p/first-nethac…

A couple bits of news:

1. Happy to share my first (human) NetHack ascension-next step is RL agents :) 

2. I wrote a post discussing some <a href="/NetHack_LE/">The NetHack Learning Environment</a>  challenges &amp; how they map to open problems in RL &amp; agentic AI. Still the best RL benchmark imo.  

mikaelhenaff.substack.com/p/first-nethac…
Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

Happy "The NetHack Learning Environment is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).

Happy "<a href="/NetHack_LE/">The NetHack Learning Environment</a> is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).