Jason Liu (@jasonliu106968) Twitter Tweets • TwiCopy

Jason Liu

@jasonliu106968

+ Follow

ID: 1955656692714385409

calendar_today13-08-2025 15:44:41

13 Tweet

74 Takipçi

71 Takip Edilen

Zhixuan Lin

@zhxlin

7 months ago

We'll be presenting the Forgetting Transformer during the poster session at 3 pm on April 25th at #ICLR2025 (board number 282). Come and chat with us! • Poster info: iclr.cc/virtual/2025/p… • Paper: arxiv.org/abs/2503.02130 • Code: github.com/zhixuan-lin/fo…

thumb_up_off_alt230

chat_bubble_outline1

repeat40

shareShare

Zhixuan Lin

@zhxlin

2 months ago

#COLM2025 We introduce Adaptive Computation Pruning (ACP) for the Forgetting Transformer (FoX) — a provably safe pruning method that significantly speeds up our Forgetting Attention kernel, especially for long-context pretraining. Our simple Triton kernel with ACP is 1.7x to 2.4x

thumb_up_off_alt310

chat_bubble_outline5

repeat53

shareShare

Jason Liu

@jasonliu106968

17 days ago

We build an open-source asynchronous RL training framework to accelerate Post-training! See the thread from Han Lu !

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Pablo Samuel Castro

@pcastr

13 days ago

going beyond dormancy and into gradient activity for identifying neuron activity. check out our work led by Jason Liu , zihao wu, and Johan Obando-Ceron 👍🏽 . and if you'll be in san diego for #neurips2025 come by our poster to chat!

thumb_up_off_alt27

chat_bubble_outline0

repeat9

shareShare

wang

@weixunwang

10 days ago

Lewis Tunstall Alexandre L.-Piché ROLL：github.com/alibaba/ROLL

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare