Johan S. Obando 👍🏽 (@johanobandoc) 's Twitter Profile
Johan S. Obando 👍🏽

@johanobandoc

Graduate student @Mila_Quebec @UMontrealDIRO | RL/Deep Learning/AI | De Cali/Colombia pal’ Mundo 🇨🇴 | #JuntosProsperamos⚡#TogetherWeThrive| 🌱🌎

ID: 833395922646233088

linkhttps://johanobandoc.github.io calendar_today19-02-2017 19:20:46

2,2K Tweet

1,1K Takipçi

2,2K Takip Edilen

Milad Aghajohari (@maghajohari) 's Twitter Profile Photo

Multi-Agent RL fails in real life. Agents cooperating to solve tasks remains a utopia. -No scalable algorithms for general-sum games. -In a simple apple-harvesting game, PPO agents overharvest and ruin bushes. Advantage Alignment (ICLR 2025 Oral📢) is a huge step forward. 1/n

Multi-Agent RL fails in real life. Agents cooperating to solve tasks remains a utopia.

-No scalable algorithms for general-sum games. 
-In a simple apple-harvesting game, PPO agents overharvest and ruin bushes.

Advantage Alignment (ICLR 2025 Oral📢) is a huge step forward. 1/n
Milad Aghajohari (@maghajohari) 's Twitter Profile Photo

Want to hear more? Come to our Oral at ICLR. My first co-author Juan is presenting on April 26th. Paper: arxiv.org/abs/2406.14662 Work done at Mila - Institut québécois d'IA with my great collaborators: Juan Agustin Duque, Tim Cooijmans@koning_robot , Razvan, Gauthier Gidel and Aaron Courville 8/

Want to hear more? Come to our Oral at ICLR. My first co-author Juan is presenting on April 26th.
Paper: arxiv.org/abs/2406.14662
Work done at <a href="/Mila_Quebec/">Mila - Institut québécois d'IA</a> with my great collaborators: <a href="/JuanDuquevan/">Juan Agustin Duque</a>, Tim Cooijmans@koning_robot , Razvan, <a href="/gauthier_gidel/">Gauthier Gidel</a>  and <a href="/AaronCourville/">Aaron Courville</a>  8/
Reza Bayat (@reza_byt) 's Twitter Profile Photo

Mohammad, my amazing co-author, will present our work today at ICLR (#304 at 3 p.m.). Don’t miss it, you can learn far more about fundamental research from him than from any paper. I was really fortunate to work with him and to experience a glimpse of the fundamental research

Johan S. Obando 👍🏽 (@johanobandoc) 's Twitter Profile Photo

At #ICLR2025 and interested in the science of deep RL? 2 great papers are being presented today from 3–5:30 PM. Don't Flatten, Tokenize! - Spotlight presentation at #363. Neuroplastic Expansion - Poster presentation at #361. Don’t miss it, go chat with amazing co-authors!🥳

At #ICLR2025 and interested in the science of deep RL?
2 great papers are being presented today from 3–5:30 PM. 

Don't Flatten, Tokenize! - Spotlight presentation at #363.
Neuroplastic Expansion - Poster presentation at #361.

Don’t miss it, go chat with amazing co-authors!🥳
Roberta Raileanu (@robertarail) 's Twitter Profile Photo

With a stellar lineup of speakers and panelists, including Yoshua Bengio 🙀, the Scaling Self-Improving Foundation Models at ICLR 2025 promises to be 🔥 ⏰ Sunday, April 27 📍 Garnet 214-215

With a stellar lineup of speakers and panelists, including Yoshua Bengio 🙀, the Scaling Self-Improving Foundation Models at <a href="/iclr_conf/">ICLR 2025</a> promises to be 🔥

⏰ Sunday, April 27
📍 Garnet 214-215
Oumar Kaba (@sekoumarkaba) 's Twitter Profile Photo

Happy to have presented this work with Hannah Lawrence @ PetExpo Vasco Portilheiro and Yan! Thanks to those who came! Check out the paper to learn about the link between symmetry breaking, equivariant distributions and positional encodings (+experiments on Ising models) arxiv.org/abs/2503.21985

Happy to have presented this work with <a href="/HLawrenceCS/">Hannah Lawrence @ PetExpo</a> <a href="/vportilheiro/">Vasco Portilheiro</a> and Yan! Thanks to those who came!

Check out the paper to learn about the link between symmetry breaking, equivariant distributions and positional encodings (+experiments on Ising models)
arxiv.org/abs/2503.21985
Sarthak Mittal (@sarthmit) 's Twitter Profile Photo

Come check out the workshop and hear about novel works and contributions from an exciting lineup of speakers and panelists!

Lynn Cherif (@lynncherif) 's Twitter Profile Photo

Come check out our poster at #ICLR2025 at The Third Workshop in Deep Learning for Code Workshop (@DL4C)! ⏰ Mon, Apr 28 3pm-4:30pm – Garnet 218-219 I unfortunately won’t be there but my fantastic advisor @Khimya will be presenting our new paper – Cracking the Code of Action: a

Come check out our poster at #ICLR2025 at The Third Workshop in Deep Learning for Code Workshop (@DL4C)!

⏰ Mon, Apr 28  3pm-4:30pm – Garnet 218-219

I unfortunately won’t be there but my fantastic advisor @Khimya will be presenting our new paper – Cracking the Code of Action: a
Tim Rocktäschel (@_rockt) 's Twitter Profile Photo

Jack (Jack Parker-Holder) on how we scaled up foundation world models at Google DeepMind at the World Model Understanding, Modelling, and Scaling workshop at ICLR 2025 2025 sites.google.com/view/worldmode…

Jack (<a href="/jparkerholder/">Jack Parker-Holder</a>) on how we scaled up foundation world models at <a href="/GoogleDeepMind/">Google DeepMind</a> at the World Model Understanding, Modelling, and Scaling workshop at <a href="/iclr_conf/">ICLR 2025</a> 2025 sites.google.com/view/worldmode…
Johan S. Obando 👍🏽 (@johanobandoc) 's Twitter Profile Photo

🥳Come chat with Brian Bartoldson and Moksh Jain at our TBA poster at the #ICLR25 workshop on Open Science for Foundation Models (SCI-FM). The workshop will be held in EXPO Hall 4 #5 on Monday, April 28th.

🥳Come chat with <a href="/bartoldson/">Brian Bartoldson</a> and <a href="/JainMoksh/">Moksh Jain</a> at our TBA poster at the #ICLR25 workshop on Open Science for Foundation Models (SCI-FM). The workshop will be held in EXPO Hall 4 #5 on Monday, April 28th.
Shivalika Singh (@singhshiviii) 's Twitter Profile Photo

LMArena is widely used for model evaluation, but is it measuring true progress? 🔮 In our work, "The Leaderboard Illusion", we reveal: 🔒 Private testing 📊 Data access asymmetries ⚠️ Overfitting risks 🚫 Silent deprecations Despite best intentions, arena policies favor a few!

LMArena is widely used for model evaluation, but is it measuring true progress? 🔮

In our work, "The Leaderboard Illusion", we reveal:
🔒 Private testing
📊 Data access asymmetries
⚠️ Overfitting risks
🚫 Silent deprecations

Despite best intentions, arena policies favor a few!
Stephanie Chan (@scychan_brains) 's Twitter Profile Photo

Some years ago, I got trapped in a Massive Trough of Imposter Syndrome. It took more than a year to dig myself out of it, but the following framework really helped me. It feels a bit vulnerable to share, but I hope it might help a few others too! A short thread 🧵🙂

Johan S. Obando 👍🏽 (@johanobandoc) 's Twitter Profile Photo

🚨 Reminder: Submissions for RL_Conference's Finding the Frame are due May 30 (AoE)! don’t miss your chance to be part of this unique workshop! 🤖🧠