Mariano Phielipp (@mphielipp) Twitter Tweets • TwiCopy

Mariano Phielipp

@mphielipp

+ Follow

Driving the Development of Visual Language Action Models for Next-Generation Humanoid Robots. Views are my own.

ID: 22180100

linkhttp://thehumanoid.ai calendar_today27-02-2009 19:42:03

748 Tweet

127 Followers

517 Following

Glen Berseth

@glenberseth

2 years ago

Are we using the best representations for OfflineRL? We found that using latent diffusion models work better at capturing the complex multi-modal distribution of Q-values in Offline RL datasets. Learn about the details from Siddarth Venkatraman tomorrow at ICLR 2026 4:30 in Halle B #157

thumb_up_off_alt20

chat_bubble_outline0

repeat3

shareShare

Unitree

@unitreerobotics

2 years ago

Daily Training of Robots Driven by RL Segments of daily training for robots driven by reinforcement learning. Multiple tests done in advance for friendly service humans.😊 The training includes some extreme tests, please do not imitate. #AI #Unitree #AGI #EmbodiedIntelligence

thumb_up_off_alt1,1K

chat_bubble_outline103

repeat403

shareShare

Patrick Collison

@patrickc

2 years ago

This morning, Nature published two papers on bridge editing, the new genome engineering technology from @ArcInstitute: nature.com/articles/s4158…, nature.com/articles/s4158…. I'm quite excited about its potential! Since the whole thing is pretty arcane, I fed the blog post

thumb_up_off_alt7,7K

chat_bubble_outline523

repeat1,1K

shareShare

Pablo Samuel Castro

@pcastr

2 years ago

we've shown MoEs help deep RL agents, but what if we turn up non-stationarity to 11 with multi-task and continual RL? We explore this in our paper, led by Timon Willi & Johan Obando-Ceron 👍🏽 , & w/ Jakob Foerster & Gintare Karolina Dziugaite , accepted RL_Conference ! paper: arxiv.org/abs/2406.18420 1/8

thumb_up_off_alt87

chat_bubble_outline5

repeat23

shareShare

Mariano Phielipp

@mphielipp

a year ago

Popular information nobelprize.org/prizes/physics…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Mariano Phielipp

@mphielipp

a year ago

nGPT: Normalized Transformer with Representation Learning on the Hypersphere. arxiv.org/abs/2410.01131. Remarkable efficient. (reducing the number of training steps required to achieve the same accuracy by a factor of 4 to 20)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Mariano Phielipp

@mphielipp

a year ago

import brain brain.loading("executive_function") # DEBUG: Insufficient sleep detected. Retrying... # DEBUG: Compensatory mechanisms activated (Efficiency -20%) brain.run("today_tasks") ....

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Mariano Phielipp

@mphielipp

a year ago

🚀 Exciting Opportunity! 🚀 linkedin.com/posts/mariano-…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Mariano Phielipp

@mphielipp

9 months ago

Please robot… I’m out of toilet paper. No yelling. No awkward moments. Just a smooth, silent rescue. 🧻🤖😂 #robotics #AI #robotsdoingthings #vla #funrobotics

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Nando de Freitas

@nandodf

9 months ago

RL is not all you need, nor attention nor Bayesianism nor free energy minimisation, nor an age of first person experience. Such statements are propaganda. You need thousands of people working hard on data pipelines, scaling infrastructure, HPC, apps with feedback to drive

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat193

shareShare

Generalist

@generalistai_

2 months ago

Read more in our blog post, including early notes from large-scale ablations on our pretraining data. Blog: generalistai.com/blog/nov-04-20…

thumb_up_off_alt74

chat_bubble_outline1

repeat7

shareShare

Sunday

@sundayrobotics

2 months ago

After 18 months in stealth, dozens of prototypes, millions of real-home demonstrations, and one final all-nighter, we’re thrilled for you to say hello to Memo

thumb_up_off_alt2,2K

chat_bubble_outline200

repeat284

shareShare

Xuanbin Peng

@xuanbin_peng

2 months ago

What if a humanoid robot could choose how to interact with the environment 🤖 — soft when it needs compliance, stiff when it needs precision, and force-aware when it must push/pull? That’s exactly what our Heterogeneous Meta-Control (HMC) framework enables. Our new framework

thumb_up_off_alt154

chat_bubble_outline2

repeat45

shareShare

Mariano Phielipp

@mphielipp

2 months ago

"We do this not because it is easy, but because we thought it would be easy" Sunday 😂

"We do this not because it is easy, but because we thought it would be easy" <a href="/sundayrobotics/">Sunday</a> 😂

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare