Mariano Phielipp
@mphielipp
Driving the Development of Visual Language Action Models for Next-Generation Humanoid Robots. Views are my own.
ID: 22180100
http://thehumanoid.ai 27-02-2009 19:42:03
748 Tweet
127 Followers
517 Following
Are we using the best representations for OfflineRL? We found that using latent diffusion models work better at capturing the complex multi-modal distribution of Q-values in Offline RL datasets. Learn about the details from Siddarth Venkatraman tomorrow at ICLR 2026 4:30 in Halle B #157
we've shown MoEs help deep RL agents, but what if we turn up non-stationarity to 11 with multi-task and continual RL? We explore this in our paper, led by Timon Willi & Johan Obando-Ceron ππ½ , & w/ Jakob Foerster & Gintare Karolina Dziugaite , accepted RL_Conference ! paper: arxiv.org/abs/2406.18420 1/8