Dane Malenfant (@dvnxmvl_hdf5) Twitter Tweets • TwiCopy

Dane Malenfant

@dvnxmvl_hdf5

+ Follow

MSc. Computer Science @Mila_Quebec & @mcgillu in the LiNC lab | Currently distracted with multi-agent RL and neuroAI | Restless | Ēka ē-akimiht

ID: 1733325744187539456

linkhttps://danemalenfant.com/ calendar_today09-12-2023 03:21:01

128 Tweet

101 Takipçi

218 Takip Edilen

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

CDS Professor Yann LeCun sees the end of large language models, claiming they'll be obsolete in five years. In Newsweek, he explains why current AI lacks real-world understanding—and what a smarter system could look like. newsweek.com/ai-impact-inte…

thumb_up_off_alt109

chat_bubble_outline11

repeat20

shareShare

Pablo Samuel Castro

@pcastr

19 days ago

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks thrilled to share our #ICML2025 paper led by Walter Mayor-Toro & Johan S. Obando 👍🏽 , with Aaron Courville , where we explore how data collection affects agents in parallelized setups. 1/

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks

thrilled to share our #ICML2025 paper led by <a href="/WalterMayor_T/">Walter Mayor-Toro</a> & <a href="/johanobandoc/">Johan S. Obando 👍🏽</a> , with <a href="/AaronCourville/">Aaron Courville</a> , where we explore how data collection affects agents in parallelized setups.
1/

thumb_up_off_alt55

chat_bubble_outline1

repeat14

shareShare

Nanda H Krishna

@nandahkrishna

18 days ago

New preprint! 🧠🤖 How do we build neural decoders that are: ⚡️ fast enough for real-time use 🎯 accurate across diverse tasks 🌍 generalizable to new sessions, subjects, and species? We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes! 🧵1/7

thumb_up_off_alt54

chat_bubble_outline4

repeat24

shareShare

Majdi Hassan

@majdi_has

14 days ago

(1/n)🚨You can train a model solving DFT for any geometry almost without training data!🚨 Introducing Self-Refining Training for Amortized Density Functional Theory — a variational framework for learning a DFT solver that predicts the ground-state solutions for different

thumb_up_off_alt151

chat_bubble_outline3

repeat38

shareShare

Emiliano Penaloza

@emilianopp_

13 days ago

Excited that our paper "Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization" was accepted to ICML 2025! We show how Preference Optimization can reduce the impact of noisy concept labels in CBMs. 🧵/9

thumb_up_off_alt30

chat_bubble_outline1

repeat21

shareShare

Benno Krojer

@benno_krojer

11 days ago

Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details 👇🔬

Excited to share the results of my internship research with <a href="/AIatMeta/">AI at Meta</a>, as part of a larger world modeling release!

What subtle shortcuts are VideoLLMs taking on spatio-temporal questions?

And how can we instead curate shortcut-robust examples at a large-scale?

Details 👇🔬

thumb_up_off_alt59

chat_bubble_outline3

repeat22

shareShare

Benjamin Thérien

@benjamintherien

8 days ago

Tired of tuning hyperparameters? Introducing PyLO! We’re bringing hyperparameter-free learned optimizers to PyTorch with drop in torch.optim support and faster step times thanks to our custom cuda kernels. Check out our code here: github.com/Belilovsky-Lab…

thumb_up_off_alt31

chat_bubble_outline1

repeat7

shareShare

Wilka Carvalho

@cogscikid

8 days ago

To help computational cognitive scientist engage with more naturalistic experiments, I've made NiceWebRL. NiceWebRL is a Python library for designing human subject experiments that leverage machine reinforcement learning environments. github.com/KempnerInstitu…

thumb_up_off_alt79

chat_bubble_outline2

repeat15

shareShare

Roger Creus Castanyer

@creus_roger

18 hours ago

🚨 Excited to share our new work: "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning"! 📈 We propose gradient interventions that enable stable, scalable learning, achieving significant performance gains across agents and environments! Details below 👇

thumb_up_off_alt99

chat_bubble_outline1

repeat22

shareShare

Johan S. Obando 👍🏽

@johanobandoc

17 hours ago

🚨 Excite to share "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning" work. 🥳 We tackle gradient instability in large deep RL networks, enabling stable and scalable learning with strong performance across the board. 📄 Paper: arxiv.org/abs/2506.15544

thumb_up_off_alt36

chat_bubble_outline1

repeat10

shareShare

Dane Malenfant

Gate.io

NYU Center for Data Science

Pablo Samuel Castro

Nanda H Krishna

Majdi Hassan

Emiliano Penaloza

Benno Krojer

Benjamin Thérien

Wilka Carvalho

Roger Creus Castanyer

Johan S. Obando 👍🏽