
Jonny Cook
@jonnycoook
DPhil Student in AI @FLAIR_Ox
Prev. RS Intern @cohere, @DeepMind Scholar
ID: 1452746225074200582
25-10-2021 21:17:53
48 Tweet
305 Takipçi
514 Takip Edilen


How can we bypass the need for online hyper-parameter tuning in offline RL? Foerster Lab for AI Research is introducing two fully offline algorithms: SOReL, for accurate offline regret approximation, and TOReL, for offline hyper-parameter tuning! arxiv.org/html/2505.2244…





Theory of Mind (ToM) is crucial for next gen LLM Agents, yet current benchmarks suffer from multiple shortcomings. Enter 💽 Decrypto, an interactive benchmark for multi-agent reasoning and ToM in LLMs! Work done with Timon Willi & Jakob Foerster at AI at Meta & Foerster Lab for AI Research 🧵👇

Antiviral therapy design is myopic 🦠🙈 optimised only for the current strain. That's why you need a different Flu vaccine every year! Our #ICML2025 paper ADIOS proposes "shaper therapies" that steer viral evolution in our favour & remain effective. Work done Foerster Lab for AI Research 🧵👇

