akjahsgfs
@usgsvok
sha1
ID: 1177988800523444224
28-09-2019 16:49:56
770 Tweet
6 Takipçi
307 Takip Edilen
My Reinforcement Learning (RL) & Agents 3 hour workshop is out! I talk about: 1. RL fundamentals & hacks 2. "Luck is all you need" 3. Building smart agents with RL 4. Closed vs Open-source 5. Dynamic 1bit GGUFs & RL in Unsloth AI 6. The Future of Training youtube.com/watch?v=OkEGJ5…
rankwho ᗜˬᗜ I think this paper explains it quite well. arxiv.org/abs/2103.12726