Jean Tarbouriech (@jean_tarbou) Twitter Tweets • TwiCopy

Jean Tarbouriech

@jean_tarbou

+ Follow

researcher @googledeepmind | gemini diffusion | phd in rl @inria @metaai | x14 @polytechnique | football, techno dj, scuba diving

ID: 560488803

calendar_today22-04-2012 16:26:57

41 Tweet

507 Followers

220 Following

Ludovic Denoyer

@ludovicdenoyer

4 years ago

Two RL papers accepted at ICLR 2026 : * Learning a subspace of policies for online adaptation in RL - with Jean-Baptiste Gaya and Laure Soulier - tinyurl.com/5eh6z5uk * Direct then Diffuse: Incremental Unsupervised Skill Discovery ... - with @pa_kamienny Jean Tarbouriech Alessandro Lazaric

thumb_up_off_alt42

chat_bubble_outline2

repeat7

shareShare

Pierre-Alex

@pierrealexai

4 years ago

Our work on learning exploration strategies without extrinsic rewards was accepted at ICLR 2026! We learn a tree-structured policy that composes skills to reach increasingly far states. Work with Jean Tarbouriech Alessandro Lazaric Ludovic Denoyer. Camera-ready paper coming soon.

thumb_up_off_alt20

chat_bubble_outline1

repeat4

shareShare

Jean Tarbouriech

@jean_tarbou

4 years ago

Towards a better understanding of unsupervised goal-conditioned RL in our #AISTATS2022 paper! Join us at poster session 4 on Wed, Mar 30 (10:30-12 UTC) AISTATS Conference Adaptive Multi-Goal Exploration arxiv.org/pdf/2111.12045… w/ Omar D. Pierre Ménard Matteo Pirotta Michal Valko Alessandro Lazaric

thumb_up_off_alt33

chat_bubble_outline2

repeat4

shareShare

Feryal

@feryalmp

3 years ago

I’m super excited to share our work on AdA: An Adaptive Agent capable of hypothesis-driven exploration which solves challenging unseen tasks with just a handful of experience, at a similar timescale to humans. sites.google.com/corp/view/adap… See the thread for more details 👇 [1/N]

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat256

shareShare

Brendan O'Donoghue

@bodonoghue85

2 years ago

Excited to present the 'epistemic-risk-seeking actor-critic' - an actor-critic / policy gradient algorithm that performs deep exploration. arxiv.org/abs/2302.09339 Accepted at #ICML2023

thumb_up_off_alt113

chat_bubble_outline2

repeat22

shareShare

Tom Zahavy

@tzahavy

2 years ago

I'm super excited to share AlphaZeroᵈᵇ, a team of diverse #AlphaZero agents that collaborate to solve #Chess puzzles and demonstrate increased creativity. Check out our paper to learn more! arxiv.org/abs/2308.09175 A quick 🧵(1/n)

thumb_up_off_alt327

chat_bubble_outline5

repeat68

shareShare

Brendan O'Donoghue

@bodonoghue85

2 years ago

Really excited about this work! We consider the notion of Bayesian optimality in RL. Computing the exact posterior is computationally intractable, so we derive a new variational approximation that maintains the same low regret. Great work led by Jean Tarbouriech !

thumb_up_off_alt48

chat_bubble_outline2

repeat8

shareShare

Brendan O'Donoghue

@bodonoghue85

6 months ago

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,

thumb_up_off_alt2,2K

chat_bubble_outline89

repeat258

shareShare

John Lindquist

@johnlindquist

6 months ago

The Future of Development: Gemini Diffusion

thumb_up_off_alt934

chat_bubble_outline18

repeat88

shareShare

Brendan O'Donoghue

@bodonoghue85

4 months ago

We're looking for people to join us to work on Gemini Diffusion and help revolutionize language modeling! Details below: job-boards.greenhouse.io/deepmind/jobs/…

thumb_up_off_alt267

chat_bubble_outline7

repeat43

shareShare