Théo Vincent @ ICLR 2025 (@theo_vincent_) Twitter Tweets • TwiCopy

Théo Vincent @ ICLR 2025

@theo_vincent_

+ Follow

PhD student at @dfki | @ias_tudarmstadt, working on RL 🤖 Previously master student at MVA @ENS_ParisSaclay | ENPC 🎓

ID: 1760086152374063105

linkhttps://www.ias.informatik.tu-darmstadt.de/Team/TheoVincent calendar_today20-02-2024 23:37:10

55 Tweet

105 Takipçi

236 Takip Edilen

Yogesh Tripathi

@yogeshtrip7354

2 months ago

Théo Vincent and I recently dived into BBF, a fascinating algorithm that is still poorly understood. Feel free to checkout the results of our analysis:

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

I don't disagree that data will fix many current limitations, but unless Figure is willing to visit people's apartments to periodically collect data for their evolving use cases, we won't see a PMF. The scalable fix is to learn in deployment, which requires new algorithms.

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Puze LIU

@liu_puze

2 months ago

How do robots thrive in dynamic worlds? Join us at IROS LeapRiDE 2025 Full-day Workshop Oct 20, ROOM 102A, Hangzhou HIEC, China 🎯 From quadruped to humanoid ping-pong & robot soccer 🎓 Speakers from MIT, Berkeley, DeepMind, NTU, NUS, Monash, Tongji, & KU Leuven #IROS25

thumb_up_off_alt15

chat_bubble_outline0

repeat5

shareShare

Cohere Labs

@cohere_labs

2 months ago

Our Reinforcement Learning Group is looking forward to a hearing from Calarina Muslimani next week on October 23rd as she presents "Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners" Thanks to Rahul and Gusti T. Winata for organizing

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Nico Bohlinger

@nicobohlinger

2 months ago

I'm presenting four different works at IROS 2025 this week in Hangzhou 🤖

I'm presenting four different works at <a href="/IROS2025/">IROS 2025</a> this week in Hangzhou 🤖

thumb_up_off_alt102

chat_bubble_outline4

repeat12

shareShare

Pablo Samuel Castro

@pcastr

2 months ago

🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊 In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents. 1/X

thumb_up_off_alt68

chat_bubble_outline3

repeat15

shareShare

Roger Creus Castanyer

@creus_roger

2 months ago

1/9 Can we leverage foundation models for better RL agents? Yes! We introduce Language-Aligned Reward Machines (LARMs) and a framework that uses Foundation Models (FMs) to automatically generate them. This work enables new ways to train RL agents efficiently on complex tasks 🧵

thumb_up_off_alt43

chat_bubble_outline1

repeat12

shareShare

Théo Vincent @ ICLR 2025

@theo_vincent_

2 months ago

Very happy to share that "iterated Q-Network (i-QN)" received a J2C Certification from TMLR🥳 What is i-QN about?👇 x.com/Theo_Vincent_/…

thumb_up_off_alt27

chat_bubble_outline0

repeat5

shareShare

Certified papers at TMLR

@tmlrcert

2 months ago

New #J2CCertification: Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Théo Vincent, Daniel Palenicek, Boris Belousov, Jan Peters, Carlo D'Eramo openreview.net/forum?id=Lt2H8… #reinforcement #iterative #iterations

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Georgia Chalvatzaki

@georgiachal

2 months ago

I am sharing this post with a heart full of gratitude. I am deeply humbled and honored to be named a recipient of this year's Alfried Krupp Förderpreis. Thank you to the Krupp Stiftung for this prestigious award and for a truly memorable and beautifully organized ceremony. This

thumb_up_off_alt44

chat_bubble_outline3

repeat3

shareShare

Bram Grooten

@bramgrooten

a month ago

The Gran Turismo research that I've worked on at Sony AI just got accepted at #AAAI as an oral! We'll publish it on arXiv soon. Looking forward to see you in Singapore!

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Bram Grooten

@bramgrooten

a month ago

The SPARC paper is online! (link below) During my internship at Sony AI, we created a policy that can generalize across all cars in Gran Turismo 7. Even to unseen cars, without knowing any vehicle details! Co-authors: Patrick MacAlpine, Kaushik Subramanian, Peter Stone, Peter Wurman

thumb_up_off_alt13

chat_bubble_outline1

repeat3

shareShare

Jie Wang

@jiewang_zjui

a month ago

We are very happy to welcome Jan Peters at GRASP Laboratory , UPenn and give a talk on “Inductive Biases for Robot Learning.” After talk, students like me were fortunate to have lunch together. To be honest, at first, I expected heavy math/physics priors, but his message was

We are very happy to welcome <a href="/Jan_R_Peters/">Jan Peters</a> at <a href="/GRASPlab/">GRASP Laboratory</a> , UPenn and give a talk on “Inductive Biases for Robot Learning.”

After talk, students like me were fortunate to have lunch together.

To be honest, at first, I expected heavy math/physics priors, but his message was

thumb_up_off_alt45

chat_bubble_outline1

repeat4

shareShare

Théo Vincent @ ICLR 2025

Yogesh Tripathi

Khurram Javed

Puze LIU

Cohere Labs

Nico Bohlinger

Pablo Samuel Castro

Roger Creus Castanyer

Théo Vincent @ ICLR 2025

Certified papers at TMLR

Georgia Chalvatzaki

Bram Grooten

Bram Grooten

Jie Wang