Mickel Liu (@mickel_liu) 's Twitter Profile
Mickel Liu

@mickel_liu

PhD student @uwcse/@uwnlp · Incoming @AIatMeta FAIR · I do LLM+RL · Prev: @pkucfcs2017, @uoftengineering

ID: 1551803307622305793

linkhttps://mickel-liu.github.io/ calendar_today26-07-2022 05:35:28

289 Tweet

275 Followers

378 Following

Mickel Liu (@mickel_liu) 's Twitter Profile Photo

I will present this work at the ICML Multi-Agent System (MAS) workshop during the poster sessions. If you are interested in this work or self-play LLMs in general, please feel free to come chat with me!

I will present this work at the ICML Multi-Agent System (MAS) workshop during the poster sessions. If you are interested in this work or self-play LLMs in general, please feel free to come chat with me!
Multi-Turn Interaction LLM Workshop @ NeurIPS 2025 (@mti_neurips) 's Twitter Profile Photo

🚀 Call for Papers — NeurIPS Conference 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents,

🚀 Call for Papers — <a href="/NeurIPSConf/">NeurIPS Conference</a> 2025 Workshop
 Multi-Turn Interactions in LLMs
 📅 December 6/7 · 📍 San Diego Convention Center

Join us to shape the future of interactive AI. Topics include but are not limited to:
🧠 Multi-Turn RL for Agentic Tasks (e.g., web &amp; GUI agents,
Mickel Liu (@mickel_liu) 's Twitter Profile Photo

If you are interested in becoming a reviewer for the workshop, feel free to sign up here (expected workload: 2~3): docs.google.com/forms/d/e/1FAI…

Zhengzhong Tu (@_vztu) 's Twitter Profile Photo

Dear NeurIPS Conference PCs, I don't understand why we still need reviewers and area chairs if PCs are finally going to take over and overturn the AC decision without providing any reason, whereby our weeks of effort spent on rebuttals (both authors and reviewers) have been ignored.

Dear <a href="/NeurIPSConf/">NeurIPS Conference</a> PCs, I don't understand why we still need reviewers and area chairs if PCs are finally going to take over and overturn the AC decision without providing any reason, whereby our weeks of effort spent on rebuttals (both authors and reviewers) have been ignored.
Shangbin Feng (@shangbinfeng) 's Twitter Profile Photo

Seems to me the "findings of *ACL" decision is really wise and visionary (5 years ago), compared to the "had to reject because of venue limit" thing

Stanford NLP Group (@stanfordnlp) 's Twitter Profile Photo

Hi everyone! We're looking forward to the first NLP Seminar of the year! For this week's seminar, we are excited to host Tong Chen (Tong Chen) from University of Washington! If you are interested in attending remotely, please fill out the form below: forms.gle/E1iL719njyG1Nf…

Hi everyone! 

We're looking forward to the first NLP Seminar of the year! For this week's seminar, we are excited to host Tong Chen (<a href="/tomchen0/">Tong Chen</a>) from University of Washington!

If you are interested in attending remotely, please fill out the form below:
forms.gle/E1iL719njyG1Nf…
Bo Liu (Benjamin Liu) (@benjamin_eecs) 's Twitter Profile Photo

Thanks for the tweet AK! We designed a game that works with ANY image pairs - synthetic scenes, charts, real photos. Self-play on these arbitrary visual inputs improves reasoning across the board. Scalable visual reasoning improvement without manual curation :)

Stella Li (@stellalisy) 's Twitter Profile Photo

🚨What if solving a problem correctly isn't enough—cuz the WAY to reason about it based on your audience matters just as much⁉️ We introduce ✨personalized reasoning✨: proactively asking user preferences and adapting HOW models think Frontier models are not doing well at this!🧵

🚨What if solving a problem correctly isn't enough—cuz the WAY to reason about it based on your audience matters just as much⁉️
We introduce ✨personalized reasoning✨: proactively asking user preferences and adapting HOW models think
Frontier models are not doing well at this!🧵
Kunal Jha (@kjha02) 's Twitter Profile Photo

Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")? Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior! shorturl.at/siUYI🧵

Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")?

Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior!

shorturl.at/siUYI🧵
Zichen Liu @ ICLR2025 (@zzlccc) 's Twitter Profile Photo

6 months after our paper release, I still recall the debates on removing the length normalization term in DrGRPO. And people gradually think DrGRPO is just about removing the std, ignoring the most important and subtle (length) bias we tried to point out to the community. Even

6 months after our paper release, I still recall the debates on removing the length normalization term in DrGRPO. And people gradually think DrGRPO is just about removing the std, ignoring the most important and subtle (length) bias we tried to point out to the community.
Even
Joongwon Kim (@danieljwkim) 's Twitter Profile Photo

Excited to share Prompt Curriculum Learning (PCL) from AI at Meta - we improve performance-efficiency tradeoffs for reasoning RL by predicting prompt difficulty with a value model updated on-policy, and selecting intermediate-difficulty prompts that yield high effective ratios.

Mickel Liu (@mickel_liu) 's Twitter Profile Photo

I am at #COLM2025 🇨🇦 this week. Happy to chat about anything about RL fine-tuning of LLM, multiagent learning and multi-LLM training. I will be presenting a talk at the AI Agent workshop on Friday afternoon.

Mickel Liu (@mickel_liu) 's Twitter Profile Photo

Great paper! Thanks for citing our Self-play work [arxiv.org/pdf/2506.07468]! Glad to see the continued endeavor of multiagent training for safety alignment.

Asli Celikyilmaz (@real_asli) 's Twitter Profile Photo

🚀 Exciting opportunity! We are hiring research interns (current PhD students) at Meta FAIR to advance multi-agent, multimodal AI! Work on text, audio, images & more, collaborate with top mentors, and help shape the future of AI at scale. Apply: metacareers.com/jobs/182171308…

Guy Davidson (@guyd33) 's Twitter Profile Photo

My team at FAIR at Meta is recruiting interns for next summer! If you're a PhD student interested in questions around theory of mind in language models for social, multi-agent settings, and have relevant background and/or experience: metacareers.com/jobs/182171308…

Natasha Jaques (@natashajaques) 's Twitter Profile Photo

Our latest work proposes a new metric for measuring deception in LLMs, based on whether the LLM causes the listener's beliefs to become less accurate. This correlates more strongly with human judgements of deception than 4 existing methods. We find that LLMs frequently engage in