Huanxuan Liao (@xn_hyacinth) 's Twitter Profile
Huanxuan Liao

@xn_hyacinth

M.S. student in @UCAS1978

ID: 1497629518357745665

linkhttps://xnhyacinth.github.io/ calendar_today26-02-2022 17:48:17

6 Tweet

12 Takipçi

120 Takip Edilen

Huanxuan Liao (@xn_hyacinth) 's Twitter Profile Photo

🚀 Excited to share our new GitHub repo featuring must-read papers & blogs on Efficient Transformers, Length Extrapolation, Long Term Memory, Retrieval Augmented Generation (RAG), and Evaluation for Long Context Modeling! 🌟🔥 📚 [github.com/Xnhyacinth/Awe…]

Dawei Zhu (@dwzhu128) 's Twitter Profile Photo

[1/n] Super excited to introduce our comprehensive survey on Long Context Language Models (LCLM), a collaborative effort between amazing researchers from Nanjing University , Peking University , CASIA, Alibaba Group , ByteDance , Tencent 腾讯 & Kuaishou! 🚀 Our survey covers 3 core

[1/n]
Super excited to introduce our comprehensive survey on Long Context Language Models (LCLM), a collaborative effort between amazing researchers from <a href="/NJU1902/">Nanjing University</a> , <a href="/PKU1898/">Peking University</a> , CASIA, <a href="/AlibabaGroup/">Alibaba Group</a> , <a href="/BytedanceTalk/">ByteDance</a> , <a href="/TencentGlobal/">Tencent 腾讯</a>  &amp; Kuaishou! 🚀

Our survey covers 3 core
𝚐𝔪𝟾𝚡𝚡𝟾 (@gm8xx8) 's Twitter Profile Photo

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Zero RL training is evaluated across 10 base models using rule-based rewards. Key strategies improve reasoning and response length, with model-specific dynamics and “aha moments”

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Zero RL training is evaluated across 10 base models using rule-based rewards. Key strategies improve reasoning and response length, with model-specific dynamics and “aha moments”