Zihao Ye (@ye_combinator) 's Twitter Profile
Zihao Ye

@ye_combinator

Building flashinfer (github.com/flashinfer-ai/…)

ID: 916605919210827777

linkhttps://homes.cs.washington.edu/~zhye/ calendar_today07-10-2017 10:07:35

118 Tweet

1,1K Followers

511 Following

Zihao Ye (@ye_combinator) 's Twitter Profile Photo

We are excite to announce FlashInfer v0.2! Core contributions of this release include: - Block/Vector Sparse (Paged) Attention on FlashAttention-3 - JIT compilation for customized attention variants - Fused Multi-head Latent Attention (MLA) decoding kernel - Lots of bugfix and

We are excite to announce FlashInfer v0.2!

Core contributions of this release include:
- Block/Vector  Sparse (Paged) Attention on FlashAttention-3 
- JIT compilation for customized attention variants
- Fused Multi-head Latent Attention (MLA) decoding kernel
- Lots of bugfix and