Mufei Li (@mufei_li) Twitter Tweets • TwiCopy

Michael Galkin

4 months ago

📣 Our spicy ICML 2025 position paper: “Graph Learning Will Lose Relevance Due To Poor Benchmarks”. Graph learning is less trendy in the ML world than it was in 2020-2022. We believe the problem is in poor benchmarks that hold the field back - and suggest ways to fix it! 🧵1/10

thumb_up_off_alt284

chat_bubble_outline4

repeat51

shareShare

Xinyu Yang

@xinyu2ml

4 months ago

🌟 Announcing the 2nd Workshop on Reliable and Responsible Foundation Models (R2-FM) at @icml_conf 2025 (July 13–19, Vancouver)! 📣 We welcome submissions! Submit your work here: openreview.net/group?id=ICML.… 🗓️ Deadline: May 29th, 2025 (AOE) 🔗 Website: r2-fm.github.io 💬

thumb_up_off_alt23

chat_bubble_outline0

repeat6

shareShare

Xavier Bresson

@xbresson

4 months ago

Bill Clinton invested $3B in 1990 for the first de novo human genome ~95%. In 2022, T2T achieved the first 100% assembly. Our AI assembler is the first of its kind - demonstrating this challenge can be solved w/ more data and compute. A call to industry: let's finish the job!

thumb_up_off_alt59

chat_bubble_outline2

repeat11

shareShare

Alex Dimakis

@alexgdimakis

4 months ago

"RL with only one training example" and "Test-Time RL" are two recent papers that I found fascinating. In the "One Training example" paper the authors find one question and ask the model to solve it again and again. Every time, the model tries 8 times (the Group in GRPO), and

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat190

shareShare

NVIDIA AI Developer

@nvidiaaidev

4 months ago

🎉 Congratulations to the FlashInfer team – their technical paper, "FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving," just won best paper at #MLSys2025. 🏆 🙌 We are excited to share that we are now backing FlashInfer – a supporter and

thumb_up_off_alt202

chat_bubble_outline4

repeat46

shareShare

Omar Khattab

@lateinteraction

3 months ago

Sigh, it's a bit of a mess. Let me just give you guys the full nuance in one stream of consciousness since I think we'll continue to get partial interpretations that confuse everyone. All the little things I post need to always be put together in one place. First, I have long

thumb_up_off_alt573

chat_bubble_outline18

repeat79

shareShare

Yuchen Zhu

@yuchen4975

3 months ago

🚀 New Paper at #icml2025🚀 Diffusion models have succeeded in handling data of various modalities, such as 🖼️images (continuous), 💬languages (discrete), and 🌐manifolds (Riemannian), etc. 🤔 But how about data of 𝐦𝐢𝐱𝐞𝐝-𝐦𝐨𝐝𝐚𝐥𝐢𝐭𝐲? We investigate this question in

thumb_up_off_alt145

chat_bubble_outline9

repeat30

shareShare

Michael Galkin

@michael_galkin

2 months ago

Hey, we built a Graph Foundation Model at Google and it's showing some very promising results! Read more in the blogpost and also catch me and Bryan Perozzi at the ICML Expo Talk next Monday. Happy to carry the Graph Learning flag ⛳️

thumb_up_off_alt258

chat_bubble_outline3

repeat37

shareShare

Zhengzhong Tu

@_vztu

2 months ago

I had a feeling, based on my past years of AI R&D experience in both industry and academia, that the AI technology stack evolves at an increasingly faster rate as the AI technology itself advances. This 'feeling' now has a formal definition - AI 4 S̶c̶i̶e̶n̶t̶i̶f̶i̶c̶AI Research

thumb_up_off_alt19

chat_bubble_outline1

repeat1

shareShare

Guohao Li (Hiring!) 🐫

@guohao_li

a month ago

Introducing Eigent — the first multi-agent workforce on your desktop. Eigent is a team of AI agents collaborating to complete complex tasks in parallel. It is your long-term working partner with fullly customizable workers and MCPs. Public beta available to download for MacOS,

thumb_up_off_alt672

chat_bubble_outline135

repeat136

shareShare

Yuanqi Du

@yuanqid

a month ago

We are calling for reviewers! forms.gle/oFrpazyy2WKFGF… Spread to the expert in generative modeling and probabilistic inference!

thumb_up_off_alt9

chat_bubble_outline2

repeat3

shareShare

Gabriele Berton

@gabriberton

a month ago

Some advice to anyone starting a PhD in ML, or things that I heard from more experienced researchers and I tried to follow: 1) focus on a real problem. Something tangible, that can benefit people. Talk to industry folks if you're looking for open problems. Talk to the end (1/8)

thumb_up_off_alt938

chat_bubble_outline7

repeat74

shareShare

Zihao Ye

@ye_combinator

a month ago

🚀 Excited to announce day-0 support from NVIDIA AI Developer for OpenAI's gpt-oss model in flashinfer v0.2.10! github.com/flashinfer-ai/… ✅ Speed-of-light Blackwell mxfp4/mxfp8 MoE kernels + attention-sink from trtllm-gen ✅ FA2/FA3 template-based attention-sink support for earlier

thumb_up_off_alt73

chat_bubble_outline1

repeat8

shareShare

Francesco Locatello

@francescolocat8

a month ago

Xin Eric Wang PC of the DB track here! Please kindly remind the reviewer that contributing new methods is not necessary for the DB track and ask the AC to take a closer look in a private message. If they don’t reply by the end of the discussion, shoot us an email and we can ping them/check too

thumb_up_off_alt103

chat_bubble_outline3

repeat3

shareShare

Molei Tao

@moleitaomath

a month ago

Georgia Tech AI4Science Center is soft launched, and I'm excited to be an Associate Director. ai4science.ai.gatech.edu Collaboration+Participation of all kinds are welcomed. Please get in touch! Thanks to gtsciences for supports. Retweets appreciated! Georgia Tech #AI4Science

thumb_up_off_alt116

chat_bubble_outline4

repeat18

shareShare

Dimitris Papailiopoulos

@dimitrispapail

24 days ago

Thinking about model generalization is quite painful. We observe empirically that models trained with SGD on cross-entropy generalize, instead of just memorize the train data. Even when they have sufficient capacity to memorize. We do not---i repeat--- we. do. not. have a

thumb_up_off_alt728

chat_bubble_outline74

repeat57

shareShare

jack morris

@jxmnop

13 days ago

first i thought scaling laws originated in OpenAI (2020) then i thought they came from Baidu (2017) now i am enlightened: Scaling Laws were first explored at Bell Labs (1993)

thumb_up_off_alt1,1K

chat_bubble_outline39

repeat97

shareShare

Guohao Li (Hiring!) 🐫

@guohao_li

12 days ago

The challenge for starting agent RL research is that very few are willing to do the less glamorous but essential work. Students I worked with usually want to dive straight into training agents or experimenting with RL algorithms. They want to invent the most beautiful new “PPO,

thumb_up_off_alt417

chat_bubble_outline20

repeat36

shareShare

Zifan (Sail) Wang

@_zifan_wang

10 days ago

Excited to share this new paper led by my intern Neil Kale on evaluating the adversarial robustness of the monitoring system in detecting misbehavior from autonomous agents' trajectories (e.g., CoTs + actions). It is a quite long paper with detailed setup and many empirical

Excited to share this new paper led by my intern <a href="/neilkale/">Neil Kale</a> on evaluating the adversarial robustness of the monitoring system in detecting misbehavior from autonomous agents' trajectories (e.g., CoTs + actions).

It is a quite long paper with detailed setup and many empirical

thumb_up_off_alt24

chat_bubble_outline1

repeat3

shareShare

Sean Welleck

@wellecks

3 days ago

Excited to teach Advanced NLP at CMU again this semester! Slides are on the course page as the course proceeds: cmu-l3.github.io/anlp-fall2025/ Lectures will be uploaded to Youtube: youtube.com/playlist?list=…

thumb_up_off_alt569

chat_bubble_outline5

repeat91

shareShare