shuo yang
@randwalk0
ID: 1629407863440539649
25-02-2023 09:08:19
9 Tweet
37 Followers
51 Following
π SANA-Video: Linear Attention + Constant-Memory KV Cache = Fast Long Videos π₯ Key Features π π§ Linear DiT everywhere β O(N) complexity on video-scale tokens π§° Constant-memory Block KV cache β store cumulative states only (no growing KV) π π― Temporal Mix-FFN + 3D RoPE
π Come check out our Spotlight Poster @Neurips 2025! π Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation π Exhibit Hall C,D,E β #3508 ποΈ Fri, Dec 5 | π 4:30β7:30 PM PST β‘ Sparse VideoGen2 boosts video generation efficiency
See you at our poster session at NeurIPS Conference! π Radial Attention: O(n log n) Sparse Attention with Energy Decay for Long Video Generation 4:30 - 7:30pm, Exhibit Hall, CDE, id = 5414 Come talk with us if you are interested in efficient ML & VideoGen π₯³
TurboDiffusion: 100β205Γ faster video generation on a single RTX 5090 π Only takes 1.8s to generate a high-quality 5-second video. The key to both high speed and high quality? πSageAttention + Sparse-Linear Attention (SLA) + rCM Github: github.com/thu-ml/TurboDiβ¦ Technical
πSonicMoEπ: a blazingly-fast MoE implementation optimized for NVIDIA Hopper GPUs. SonicMoE reduces activation memory by 45% and is 1.86x faster on H100 than previous SOTAπ Paper: arxiv.org/abs/2512.14080 Work with Mayank Mishra, Xinle Cheng, Ion Stoica, Tri Dao