Jean Tarbouriech
@jean_tarbou
researcher @googledeepmind | gemini diffusion | phd in rl @inria @metaai | x14 @polytechnique | football, techno dj, scuba diving
ID: 560488803
22-04-2012 16:26:57
41 Tweet
507 Followers
220 Following
Two RL papers accepted at ICLR 2026 : * Learning a subspace of policies for online adaptation in RL - with Jean-Baptiste Gaya and Laure Soulier - tinyurl.com/5eh6z5uk * Direct then Diffuse: Incremental Unsupervised Skill Discovery ... - with @pa_kamienny Jean Tarbouriech Alessandro Lazaric
Our work on learning exploration strategies without extrinsic rewards was accepted at ICLR 2026! We learn a tree-structured policy that composes skills to reach increasingly far states. Work with Jean Tarbouriech Alessandro Lazaric Ludovic Denoyer. Camera-ready paper coming soon.
Towards a better understanding of unsupervised goal-conditioned RL in our #AISTATS2022 paper! Join us at poster session 4 on Wed, Mar 30 (10:30-12 UTC) AISTATS Conference Adaptive Multi-Goal Exploration arxiv.org/pdf/2111.12045… w/ Omar D. Pierre Ménard Matteo Pirotta Michal Valko Alessandro Lazaric
I’m super excited to share our work on AdA: An Adaptive Agent capable of hypothesis-driven exploration which solves challenging unseen tasks with just a handful of experience, at a similar timescale to humans. sites.google.com/corp/view/adap… See the thread for more details 👇 [1/N]
Really excited about this work! We consider the notion of Bayesian optimality in RL. Computing the exact posterior is computationally intractable, so we derive a new variational approximation that maintains the same low regret. Great work led by Jean Tarbouriech !