Daniel Jiang (@danielrjiang) Twitter Tweets • TwiCopy

Daniel Jiang

@danielrjiang

+ Follow

Research Scientist @Meta, Adjunct Professor at University of Pittsburgh. PhD from @Princeton ORFE. Decision making under uncertainty.

ID: 23724401

linkhttp://danielrjiang.github.io calendar_today11-03-2009 04:50:06

197 Tweet

774 Takipçi

1,1K Takip Edilen

Zheqing (Bill) Zhu

@zheqingzhu

a year ago

Excited to share "Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank", with my former Meta team (Wenhao Zhan, Yonathan Efroni, Daniel Jiang) in collaboration with FAIR (Scott Fujimoto) & Princeton (Jason Lee), is accepted to ICLR 2025!

Excited to share "Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank", with my former Meta team (<a href="/zhan_wenhao/">Wenhao Zhan</a>, <a href="/EfroniYonathan/">Yonathan Efroni</a>, <a href="/danielrjiang/">Daniel Jiang</a>) in collaboration with FAIR (Scott Fujimoto) & Princeton (<a href="/jasondeanlee/">Jason Lee</a>), is accepted to ICLR 2025!

thumb_up_off_alt15

chat_bubble_outline1

repeat4

shareShare

Yonathan Efroni

@efroniyonathan

a year ago

💫Accepted to ICLR25! 💫 We investigate a special MARL structure in which agents weakly interact. This, we show, makes MARL much more tractable. Led by Wenhao Zhan in his summer internship + it was a delight working on this, and expect to see cool extensions ahead!

thumb_up_off_alt39

chat_bubble_outline0

repeat3

shareShare

Daniel Russo

@danielruss0

9 months ago

There are multiple postdoc positions available as part of an exciting new AI-agent initiative at Columbia that tackles challenges at the frontier of agentic systems and sequential decision-making. I am not very active here so please help me spread the word!

thumb_up_off_alt56

chat_bubble_outline1

repeat19

shareShare

Yonathan Efroni

@efroniyonathan

7 months ago

Meet us (but not me sadly) at the poster session: iclr.cc/virtual/2025/p… #ICLR2025 (Also, much more interesting things to explore in MARL and offline MARL imo)

thumb_up_off_alt32

chat_bubble_outline1

repeat5

shareShare

Daniel Jiang

@danielrjiang

7 months ago

At ICLR 2025 in Singapore, my co-authors and I presented two papers on RL. Feel free to let us know of any feedback and let me know if you'd like to chat! - openreview.net/forum?id=AOlm4… - openreview.net/forum?id=BfUDZ…

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Dr. Karen Ullrich

@karen_ullrich

7 months ago

Aligned Multi-Objective Optimization (A-🐮) has been accepted at #ICML2025! 🎉 We explore optimization scenarios where objectives align rather than conflict, introducing new scalable algorithms with theoretical guarantees. #MachineLearning #AIResearch #Optimization #MLCommunity

thumb_up_off_alt86

chat_bubble_outline3

repeat12

shareShare

Yonathan Efroni

@efroniyonathan

7 months ago

accepted to #ICML25🍁 we asked 🤔 how can we improve gradient-descent in the presence of multiple aligned or similar objectives?🤔 this becomes increasingly important when having access to multiple reward functions / datasets / tasks

thumb_up_off_alt36

chat_bubble_outline1

repeat6

shareShare

Yonathan Efroni

@efroniyonathan

7 months ago

we actually started by asking this question in the multi-armed / tabular RL, and after spending some time on it realized it has been explored already by Chris Dann, Yishay Mansour, Mehryar Mohri: proceedings.mlr.press/v202/dann23a.h…

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Zheqing (Bill) Zhu

@zheqingzhu

7 months ago

Excited to share that the paper I worked with my previous team on at Meta was accepted to #ICML2025 ! It’s important to explore multi-objective optimization especially in the aligned goal setting. This can be particularly useful in the LLM and agent context where the models are

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

AI at Meta

@aiatmeta

7 months ago

CATransformers is a carbon-driven neural architecture and system hardware co-design framework. Using CATransformers, we discover greener CLIP models that achieve an average of 9.1% reduction potential in total lifecycle carbon emissions while maintaining accuracy (or increasing

thumb_up_off_alt324

chat_bubble_outline15

repeat69

shareShare

Mina Lee

@minalee__

6 months ago

What does it mean to write and think with AI? What new possibilities and challenges does that bring? I spoke with THE AI (in Korean) about our group's research and the future of writing with AI. 👩🤖✍️ newstheai.com/news/articleVi…

thumb_up_off_alt61

chat_bubble_outline2

repeat10

shareShare

Percy Liang

@percyliang

6 months ago

Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team Tatsunori Hashimoto Marcel Rød Neil Band Rohith Kuditipudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:

thumb_up_off_alt3,3K

chat_bubble_outline31

repeat323

shareShare

John Horton

@johnjhorton

4 months ago

Use the same Expected Parrot: Open-source tools for AI research survey you ran w/ LLMs with actual humans via our Prolific integration (or bring your own panel) - it works!

Use the same <a href="/ExpectedParrot/">Expected Parrot: Open-source tools for AI research</a> survey you ran w/ LLMs with actual humans via our <a href="/Prolific/">Prolific</a> integration (or bring your own panel) - it works!

thumb_up_off_alt7

chat_bubble_outline2

repeat2

shareShare

Ayush Jain

@ayushj240

4 months ago

Honored that our RL_Conference paper won the Outstanding Paper Award on Empirical Reinforcement Learning Research! 📜Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-Functions 📎openreview.net/forum?id=H3jcT… Grateful to my advisors Joseph Lim and Erdem Bıyık!

Honored that our <a href="/RL_Conference/">RL_Conference</a> paper won the Outstanding Paper Award on Empirical Reinforcement Learning Research!

📜Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-Functions
📎openreview.net/forum?id=H3jcT…

Grateful to my advisors <a href="/JosephLim_AI/">Joseph Lim</a> and <a href="/ebiyik_/">Erdem Bıyık</a>!

thumb_up_off_alt72

chat_bubble_outline9

repeat9

shareShare