Shanli Xing (@0xsling0) Twitter Tweets • TwiCopy

Shanli Xing

@0xsling0

+ Follow

@uwcse

ID: 2866457834

calendar_today20-10-2014 05:19:56

10 Tweet

109 Takipçi

152 Takip Edilen

Zihao Ye

@ye_combinator

8 months ago

We are excite to announce FlashInfer v0.2! Core contributions of this release include: - Block/Vector Sparse (Paged) Attention on FlashAttention-3 - JIT compilation for customized attention variants - Fused Multi-head Latent Attention (MLA) decoding kernel - Lots of bugfix and