Marvin Li (@marvin_li03) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Excited to share our latest research progress (joint work with Yang Song ): Consistency models can now scale stably to ImageNet 512x512 with up to 1.5B parameters using a simplified algorithm, and our 2-step samples closely approach the quality of diffusion models. See more

thumb_up_off_alt681

chat_bubble_outline21

repeat88

shareShare

Max Simchowitz

@max_simchowitz

4 months ago

There’s a lot of awesome research about LLM reasoning right now. But how is learning in the physical world 🤖different than in language 📚? In a new paper, show that imitation learning in continuous spaces can be exponentially harder than for discrete state spaces, even when

thumb_up_off_alt214

chat_bubble_outline3

repeat37

shareShare

Marvin Li

@marvin_li03

3 months ago

Accepted as a spotlight at #icml2025! See you in Vancouver 🎉

thumb_up_off_alt29

chat_bubble_outline10

repeat0

shareShare

Demi Guo

@demi_guo_

3 months ago

We had this vision a year ago, and it’s hard to believe how many dreams have come true since we filmed this video last summer. So much has changed—but one thing has stayed the same: our commitment to building a product for everyone, and giving people the power to create their own

thumb_up_off_alt89

chat_bubble_outline4

repeat6

shareShare

Ilia Shumailov🦔

@iliaishacked

2 months ago

Are modern large language models (LLMs) vulnerable to privacy attacks that can determine if given data was used for training? Models and dataset are quite large, what should we even expect? Our new paper looks into this exact question. 🧵 (1/10)

thumb_up_off_alt92

chat_bubble_outline1

repeat15

shareShare

Jason Lee

@jasondeanlee

2 months ago

Another day without chatgpt OpenAI check your support email!!

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Aayush Karan

@aakaran31

2 months ago

Steering diffusion models with external rewards has recently led to exciting results, but what happens when the reward is inherently difficult? Introducing ReGuidance: a simple algorithm to (provably!) boost your favorite guidance method on hard problems! 🚀🚀🚀 A thread: (1/n)

thumb_up_off_alt161

chat_bubble_outline7

repeat27

shareShare

Rylan Schaeffer

@rylanschaeffer

2 months ago

A bit late to the party, but our paper on predictable inference-time / test-time scaling was accepted to #icml2025 🎉🎉🎉 TLDR: Best of N was shown to exhibit power (polynomial) law scaling (left), but maths suggest one should expect exponential scaling (center). We show how to

thumb_up_off_alt106

chat_bubble_outline4

repeat14

shareShare

Giannis Daras

@giannis_daras

2 months ago

Announcing Ambient Diffusion Omni — a framework that uses synthetic, low-quality, and out-of-distribution data to improve diffusion models. State-of-the-art ImageNet performance. A strong text-to-image results in just 2 days on 8 GPUs. Filtering ❌ Clever data use ✅

thumb_up_off_alt416

chat_bubble_outline8

repeat55

shareShare

Ed Turner

@edturner42

2 months ago

1/8: The Emergent Misalignment paper showed LLMs trained on insecure code then want to enslave humanity...?! We're releasing two papers exploring why! We: - Open source small clean EM models - Show EM is driven by a single evil vector - Show EM has a mechanistic phase transition

thumb_up_off_alt226

chat_bubble_outline15

repeat42

shareShare

Sham Kakade

@shamkakade6

2 months ago

1/6 Infinite-dim SGD in linear regression is the strawman model for studying scaling laws, critical batch sizes, and LR schedules. We revisit (and simplify) its analysis using just linear algebra, making it easier to derive and reason about. No PSD operators. No tensor calculus.

thumb_up_off_alt177

chat_bubble_outline2

repeat30

shareShare

Joseph Suarez (e/🐡)

@jsuarez5341

2 months ago

PufferLib 3.0: We trained reinforcement learning agents on 1 Petabyte / 12,000 years of data with 1 server. Now you can, too! Our latest release includes algorithmic breakthroughs, massively faster training, and 10 new environments. Live demos on our site. Volume on for trailer!

thumb_up_off_alt691

chat_bubble_outline28

repeat93

shareShare

Jaeyeon Kim

@jaeyeon_kim_0

a month ago

Excited to share that I’ll be presenting two oral papers in this ICML—see u guys in Vancouver!!🇨🇦 1️⃣ arxiv.org/abs/2502.06768 Understanding Masked Diffusion Models theoretically/scientifically 2️⃣ arxiv.org/abs/2502.09376 Theoretical analysis on LoRA training

thumb_up_off_alt247

chat_bubble_outline4

repeat31

shareShare

Marvin Li

@marvin_li03

a month ago

Ecstatic to present an oral paper at ICML this year!!🎉 📚 “Blink of an Eye: a simple theory for feature localization in generative models” 🔗 arxiv.org/abs/2502.00921 Catch me at the poster session right after! See you there! 🚀

thumb_up_off_alt21

chat_bubble_outline1

repeat1

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

a month ago

🚨 New preprint! 🚨 Everyone loves causal interp. It’s coherently defined! It makes testable predictions about mechanistic interventions! But what if we had a different objective: predicting model behavior not under mechanistic interventions, but on unseen input data?

thumb_up_off_alt238

chat_bubble_outline2

repeat24

shareShare

Cengiz Pehlevan

@cpehlevan

a month ago

Great to see this one finally out in PNAS! Asymptotic theory of in-context learning by linear attention pnas.org/doi/10.1073/pn… Many thanks to my amazing co-authors Yue Lu, Mary Letey, Jacob Zavatone-Veth and Anindita Maiti

thumb_up_off_alt119

chat_bubble_outline1

repeat14

shareShare

Nicholas Boffi

@nmboffi

a month ago

🧵generative models are sweet, but navigating existing repositories can be overwhelming, particularly when starting a new research project so i built jax-interpolants, a clean & flexible implementation of the stochastic interpolant framework in jax github.com/nmboffi/jax-in…

thumb_up_off_alt145

chat_bubble_outline3

repeat20

shareShare

Miles Turpin

@milesaturpin

25 days ago

New @Scale_AI paper! 🌟 LLMs trained with RL can exploit reward hacks but not mention this in their CoT. We introduce verbalization fine-tuning (VFT)—teaching models to say when they're reward hacking—dramatically reducing the rate of undetected hacks (6% vs. baseline of 88%).

thumb_up_off_alt217

chat_bubble_outline7

repeat36

shareShare

Kulin Shah

@shahkulin98

24 days ago

Thrilled to share that our work received the Outstanding Paper Award at ICML! I will be giving the oral presentation on Tuesday at 4:15 PM. Jaeyeon (Jay) Kim @ICML and I both will be at the poster session shortly after the oral presentation. Please attend if possible!

thumb_up_off_alt121

chat_bubble_outline4

repeat14

shareShare

Marvin Li

Gate.io

Cheng Lu

Max Simchowitz

Marvin Li

Demi Guo

Ilia Shumailov🦔

Jason Lee

Aayush Karan

Rylan Schaeffer

Giannis Daras

Ed Turner

Sham Kakade

Joseph Suarez (e/🐡)

Jaeyeon Kim

Marvin Li

Naomi Saphra hiring a lab 🧈🪰

Cengiz Pehlevan

Nicholas Boffi

Miles Turpin

Kulin Shah