
Daniel Jiang
@danielrjiang
Research Scientist @Meta, Adjunct Professor at University of Pittsburgh. PhD from @Princeton ORFE. Decision making under uncertainty.
ID: 23724401
http://danielrjiang.github.io 11-03-2009 04:50:06
197 Tweet
774 Followers
1,1K Following

Excited to share "Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank", with my former Meta team (Wenhao Zhan, Yonathan Efroni, Daniel Jiang) in collaboration with FAIR (Scott Fujimoto) & Princeton (Jason Lee), is accepted to ICLR 2025!


💫Accepted to ICLR25! 💫 We investigate a special MARL structure in which agents weakly interact. This, we show, makes MARL much more tractable. Led by Wenhao Zhan in his summer internship + it was a delight working on this, and expect to see cool extensions ahead!






we actually started by asking this question in the multi-armed / tabular RL, and after spending some time on it realized it has been explored already by Chris Dann, Yishay Mansour, Mehryar Mohri: proceedings.mlr.press/v202/dann23a.h…




Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team Tatsunori Hashimoto Marcel Rød Neil Band Rohith Kuditipudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:


Honored that our RL_Conference paper won the Outstanding Paper Award on Empirical Reinforcement Learning Research! 📜Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-Functions 📎openreview.net/forum?id=H3jcT… Grateful to my advisors Joseph Lim and Erdem Bıyık!
