Perry Zhang (@py_z001) Twitter Tweets • TwiCopy

Hao AI Lab

6 months ago

Thrilled to share recent research from our fascinating lab members and collaborators at #ICLR2025! 🚀✨ Come say hi in our poster sessions and dive into discussions on LLM agents, reasoning, long-context training, efficient inference, and more. We’re excited to share, learn and

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

Perry Zhang

@py_z001

6 months ago

STA is accepted by ICML 2025!!

thumb_up_off_alt28

chat_bubble_outline0

repeat6

shareShare

Hao AI Lab

@haoailab

6 months ago

Announcing FastVideo V1, a unified framework for accelerating video generation. FastVideo V1 offers: - A simple, consistent Python API - State of the art model performance optimizations - Optimized implementations of popular models Blog: hao-ai-lab.github.io/blogs/fastvide…

thumb_up_off_alt164

chat_bubble_outline2

repeat43

shareShare

Perry Zhang

@py_z001

5 months ago

amazing！

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Perry Zhang

@py_z001

5 months ago

I will be giving a talk in GPU MODE tomorrow (May 31 12pm PST) about FastVideo/STA/VSA. Come if you're interested! youtube.com/watch?v=x44iGp…

I will be giving a talk in <a href="/GPU_MODE/">GPU MODE</a> tomorrow (May 31 12pm PST) about FastVideo/STA/VSA.
Come if you're interested!

youtube.com/watch?v=x44iGp…

thumb_up_off_alt110

chat_bubble_outline2

repeat21

shareShare

Hao AI Lab

@haoailab

5 months ago

🔧🤖 New wave of open-source LLMs like Deekseek-R1-0528 and Qwen3-235B-A22B are leveling up with stronger agentic performance. We test them in head-to-head gameplay — the upgraded Deekseek-R1-0528 outsmarts strong reasoning models like o4-mini across several games and it nearly

thumb_up_off_alt285

chat_bubble_outline7

repeat64

shareShare

Perry Zhang

@py_z001

4 months ago

🚀 Attention is the bottleneck in video DiTs—5 s of 720p = 100K+ tokens, quadratic cost blows up fast. Sparse/linear attention is 🔑 for long-context world models. 🧠 Track relavent papers in our awsome-video-attention repo → github.com/hao-ai-lab/Aws… #WorldModel #VideoAI

thumb_up_off_alt40

chat_bubble_outline0

repeat9

shareShare

Hao Zhang

@haozhangml

4 months ago

Heading to ICML next week (Monday - Thursday). Down to chat research, ideas, anything cool, or just hang 😄📍🎯

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Hao AI Lab

@haoailab

3 months ago

📣 We’ve had three papers accepted at #ICML2025, Hao-AI-Lab is sending Hao Zhang to attend ICML in person😂! If you're around, please find Hao at the venue and chat with him about video diffusion, LLM agents, and efficient attention 👋🧠 🎬 Fast Video Generation with Sliding

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

Ali Hassani

@alihassanijr

3 months ago

Watch my talk about NATTEN on GPU MODE this Saturday at 3PM ET / noon PT. I'll go over all the exciting new features we shipped very recently, especially our Hopper and Blackwell FNA kernels, now speeding up video / world models by up to 2.6X e2e! youtube.com/watch?v=mF_H_J

thumb_up_off_alt26

chat_bubble_outline1

repeat6

shareShare

Perry Zhang

@py_z001

3 months ago

I learned a lot from NATTEN!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Hao AI Lab

@haoailab

3 months ago

(1/n) 🚀 With FastVideo, you can now generate a 5-second video in 5 seconds on a single H200 GPU! Introducing FastWan series, a family of fast video generation models trained via a new recipe we term as “sparse distillation”, to speed up video denoising time by 70X! 🖥️ Live

thumb_up_off_alt433

chat_bubble_outline10

repeat111

shareShare

Dan Fu

@realdanfu

3 months ago

Crazy fast!! Great work from Hao AI Lab

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Hao AI Lab

@haoailab

3 months ago

Try FastWan at fastwan.fastvideo.org!

thumb_up_off_alt14

chat_bubble_outline0

repeat2

shareShare

Perry Zhang

@py_z001

3 months ago

Simple design ofen wins in the long run. GPT-OSS uses sliding window atteniton. Our Sliding Tile Attention brings efficieint window attention to video generation: arxiv.org/abs/2502.04507

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

Hao AI Lab

@haoailab

3 months ago

[Lmgame Bench] 🔥 We tested Openai’s GPT-5-thinking-high and two recent open-source models in our Lmgame Bench! Across 26 models and 6 games (Sokoban, Tetris, 2048, Candy Crush, Mario, Ace Attorney), Here’s where they landed: GPT-5-thinking-high → #2

thumb_up_off_alt151

chat_bubble_outline2

repeat22

shareShare

Hao AI Lab

@haoailab

2 months ago

[Lmgame Bench] 🤔 Ever wondered how to evaluate different games in Lmgame-Bench or even add your own, but don’t know where to start? We’ve made it super easy to run evaluations and integrate new games. Our latest blog walks you through a few key features from Lmgame Bench

thumb_up_off_alt20

chat_bubble_outline1

repeat3

shareShare

Yichao Fu

@fuyichao123

2 months ago

Excited to share my 1st project as a Research Scientist Intern at Meta FAIR! Grateful to my mentor Jiawei Zhao for guidance, and to Yuandong Tian & Xuewei for their valuable advice and collaboration. Our work DeepConf explores local confidence for more accurate & efficient LLM reasoning!

thumb_up_off_alt85

chat_bubble_outline10

repeat13

shareShare

Hao AI Lab

@haoailab

2 months ago

[1/5] [Lmgame Bench] 🎮 Question: Can RL-based LLM post-training on games generalize to other tasks? We shared a preliminary study to explore this question: - Same-family (in-domain): Training on 6×6 Sokoban → 8×8 and Tetris (1 block type) → Tetris (2 block types) transfers,

thumb_up_off_alt95

chat_bubble_outline2

repeat13

shareShare

Hao AI Lab

@haoailab

a month ago

🚀 Thrilled to share that our lab has THREE papers accepted at #NeurIPS2025 on AI efficiency from reasoning to video generation. Come hang out with us, it's going to be a lot of fun this year here local to UCSD! 😎 📊 Efficiently Scaling LLM Reasoning with Certaindex Introduces

thumb_up_off_alt34

chat_bubble_outline0

repeat5

shareShare