Guangxuan Xiao (@guangxuan_xiao) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Excited to share: "Fine-tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design" With my amazing coauthors Masatoshi Uehera, Yichun He, Amy Wang, Tommaso Biancalani, @lal_avantika, Tommi Jaakkola, Sergey Levine, Hanchen Wang, Aviv Regev

thumb_up_off_alt233

chat_bubble_outline6

repeat38

shareShare

机器之心 JIQIZHIXIN

@synced_global

10 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads arxiv.org/abs/2410.10819 github.com/mit-han-lab/du… #MIT Song Han

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Muyang Li

@lmxyy1999

9 months ago

🚀 The 4-bit era has arrived! Meet #SVDQuant, our new W4A4 quantization paradigm for diffusion models. Now, 12B FLUX can run on a 16GB 4090 laptop without offloading—with 3x speedups over W4A16 models (like NF4) while maintaining top-tier image quality. #AI #Quantization. 1/7

thumb_up_off_alt173

chat_bubble_outline10

repeat44

shareShare

Scale ML

@scaleml

9 months ago

Hello everyone, this week at 3pm EST Nov 20 (Wed) we will be having Guangxuan Xiao present his work about efficient/effective long sequence modeling! Sign up via scale-ml.org to join our mailing list and zoom access.

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Tianwei Yin

@tianweiy

8 months ago

Video diffusion models generate high-quality videos but are too slow for interactive applications. We MIT CSAIL Adobe Research introduce CausVid, a fast autoregressive video diffusion model that starts playing the moment you hit "Generate"! A thread 🧵

thumb_up_off_alt616

chat_bubble_outline23

repeat116

shareShare

Haocheng Xi

@haochengxiucb

6 months ago

🚀 We're are excited to open source an FP8 training technique, COAT: Compressing Optimizer states and Activation for memory-efficient fp8 Training. COAT is accepted by ICLR 2025! FP8 training effectively improves the training efficiency. Deepseek-v3 is a successful example of

thumb_up_off_alt67

chat_bubble_outline9

repeat12

shareShare

Shang Yang

@shang_mit

6 months ago

🎉 Excited to share that LServe, our efficient long-sequence LLM serving framework, is accepted by #MLSys’25! 🔥 ⚡Up to 2.9× faster prefilling & 1.3-2.1× faster decoding over vLLM 🔋Hybrid attention kernels unifying static & dynamic sparsity 🔗 hanlab.mit.edu/projects/lserve (1/5)

thumb_up_off_alt34

chat_bubble_outline1

repeat7

shareShare

Muyang Li

@lmxyy1999

a month ago

🚀 Meet #RadialAttention — a static sparse attention mechanism with O(nlogn) complexity for long video generation! ✅ Plug-and-play: works with pretrained models like #Wan, #HunyuanVideo, #Mochi ✅ Speeds up both training&inference by 2–4×, without quality loss 🧵1/4

thumb_up_off_alt144

chat_bubble_outline2

repeat39

shareShare

Guangxuan Xiao

@guangxuan_xiao

6 days ago

The release of GPT-OSS-120B & GPT-OSS-20B models today incorporates my Attention Sink work (github.com/mit-han-lab/st…). Exciting to see this come to life! 🎉 Looking forward to more progress in this space. 😁

thumb_up_off_alt730

chat_bubble_outline18

repeat46

shareShare

Graham Neubig

@gneubig

6 days ago

Summary of GPT-OSS architectural innovations: 1. sliding window attention (ref: arxiv.org/abs/1901.02860) 2. mixture of experts (ref: arxiv.org/abs/2101.03961) 3. RoPE w/ Yarn (ref: arxiv.org/abs/2309.00071) 4. attention sinks (ref: streaming llm arxiv.org/abs/2309.17453)

thumb_up_off_alt1,1K

chat_bubble_outline9

repeat274

shareShare

Yuandong Tian

@tydsh

6 days ago

Great to hear that the recently released OpenAI OSS models leverage our study on attention sink😀.

thumb_up_off_alt18

chat_bubble_outline0

repeat1

shareShare

Ryan Hanrui Wang

@hanrui_w

5 days ago

Announcing Eigen AI Eigen AI, the world’s first company dedicated to AEI — Artificial Efficient Intelligence. 🚀 The future of AI is already here; it’s simply not evenly distributed. Our mission is to close that gap by driving radical efficiency so that every person and

thumb_up_off_alt52

chat_bubble_outline1

repeat13

shareShare

Guangxuan Xiao

Gate.io

Chenyu Wang

机器之心 JIQIZHIXIN

Muyang Li

Scale ML

Tianwei Yin

Haocheng Xi

Shang Yang

Muyang Li

Guangxuan Xiao

Graham Neubig

Yuandong Tian

Ryan Hanrui Wang