Michael Poli (@michaelpoli6) Twitter Tweets • TwiCopy

Michael Poli

@michaelpoli6

+ Follow

AI, numerics and systems @StanfordAILab. Founding Scientist @LiquidAI_

ID: 1027766058390716416

linkhttps://zymrael.github.io/ calendar_today10-08-2018 03:58:18

379 Tweet

2,2K Followers

347 Following

Michael Poli

@michaelpoli6

6 months ago

Excited about this line of work. Pretrained models are surprisingly resilient to architecture modification, opening up entirely new options for customization and optimization: operator swapping, rewiring depth into width, and more.

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Zhihao Jia

@jiazhihao

5 months ago

One of the best ways to reduce LLM latency is by fusing all computation and communication into a single GPU megakernel. But writing megakernels by hand is extremely hard. 🚀Introducing Mirage Persistent Kernel (MPK), a compiler that automatically transforms LLMs into optimized

thumb_up_off_alt439

chat_bubble_outline6

repeat68

shareShare

Tri Dao

@tri_dao

5 months ago

Getting mem-bound kernels to speed-of-light isn't a dark art, it's just about getting the a couple of details right. We wrote a tutorial on how to do this, with code you can directly use. Thanks to the new CuTe-DSL, we can hit speed-of-light without a single line of CUDA C++.

thumb_up_off_alt520

chat_bubble_outline7

repeat57

shareShare

clem 🤗

@clementdelangue

4 months ago

It’s time for the American AI community to wake up, drop the "open is not safe" bullshit, and return to its roots: open science and open-source AI, powered by an unmatched community of frontier labs, big tech, startups, universities, and non‑profits. If we don’t, we’ll be forced

thumb_up_off_alt572

chat_bubble_outline48

repeat77

shareShare

Shengjia Zhao

@shengjia_zhao

4 months ago

I am very excited to take up the role of chief scientist for meta super-intelligence labs. Looking forward to building asi and aligning it to empower people with the amazing team here. Let’s build!

thumb_up_off_alt6,6K

chat_bubble_outline338

repeat246

shareShare

Ramin Hasani

@ramin_m_h

4 months ago

Omar Sanseviero great release, tho you forgot to include the SoTA in the chart: LFM2-350M Liquid AI

<a href="/osanseviero/">Omar Sanseviero</a> great release, tho you forgot to include the SoTA in the chart: LFM2-350M <a href="/LiquidAI_/">Liquid AI</a>

thumb_up_off_alt168

chat_bubble_outline7

repeat19

shareShare

Binyuan Hui

@huybery

3 months ago

I believe LLMs will inevitably surpass humans in coding. Let us think about how humans actually learn to code. Human learning of coding has two stages. First comes memorization and imitation: learning syntax and copying good projects. Then comes trial and error: writing code,

thumb_up_off_alt1,1K

chat_bubble_outline124

repeat113

shareShare

Brian Hie

@brianhie

2 months ago

Welcome to the age of generative genome design! In 1977, Sanger et al. sequenced the first genome—of phage ΦX174. Today, led by Samuel King, we report the first AI-generated genomes. Using ΦX174 as a template, we made novel, high-fitness phages with genome language models. 🧵

Welcome to the age of generative genome design!

In 1977, Sanger et al. sequenced the first genome—of phage ΦX174.

Today, led by <a href="/samuelhking/">Samuel King</a>, we report the first AI-generated genomes. Using ΦX174 as a template, we made novel, high-fitness phages with genome language models. 🧵

thumb_up_off_alt984

chat_bubble_outline31

repeat217

shareShare

Michael Poli

@michaelpoli6

2 months ago

A careful study of architecture transfer dynamics, to be presented as oral at NeurIPS 2025 (top 0.3% of all submissions). Advances in grafting methods significantly accelerate research on architecture design, scaling recipes, and even post-training.

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Michael Poli

@michaelpoli6

2 months ago

Fun explainer about Evo2 and its architecture, StripedHyena 2. Hybrids are so 2023, multi-hybrids are the future!

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

Michael Poli

@michaelpoli6

2 months ago

We just released the largest open-source diffusion language model (RND1). RND1 is important to me on a personal level: it symbolizes our commitment to open-source exploration of radically different designs for AI at scale — training objectives, architectures, domains. There is

thumb_up_off_alt330

chat_bubble_outline9

repeat40

shareShare

Radical Numerics

@radicalnumerics

2 months ago

Sliding window attention (SWA) is powering frontier hybrid models for efficiency. Is there something better? Introducing Phalanx, a faster and better quality drop-in replacement for sliding window attention (SWA). Phalanx is a new family of hardware and numerics-aware windowed

thumb_up_off_alt197

chat_bubble_outline12

repeat49

shareShare

Michael Poli

@michaelpoli6

a month ago

Thank you for having me, Delta Institute! We touched on a few topics, including the story behind Radical Numerics

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare