Zeyu Huang (@zeroyuhuang) 's Twitter Profile
Zeyu Huang

@zeroyuhuang

PhD @ EdinburghNLP
Working on LLM + RL

ID: 1538913436138057728

linkhttps://scholar.google.com/citations?hl=en&user=EWU88_YAAAAJ calendar_today20-06-2022 15:55:49

9 Tweet

54 Takipçi

119 Takip Edilen

Zeyu Huang (@zeroyuhuang) 's Twitter Profile Photo

Excited that our paper: "Transformer-Patcher: One mistake worth one neuron" got accepted at ICLR 2023! #ICLR2023 This work focused on the continuous and timely fixing of large Pretrained Language Models after deployment. The full paper: arxiv.org/abs/2301.09785

Qwen (@alibaba_qwen) 's Twitter Profile Photo

🚀 New Approach to Training MoE Models! We’ve made a key change: switching from micro-batches to global-batches for better load balancing. This simple tweak lets experts specialize more effectively, leading to: ✅ Improved model performance ✅ Better handling of real-world

🚀 New Approach to Training MoE Models! We’ve made a key change: switching from micro-batches to global-batches for better load balancing. This simple tweak lets experts specialize more effectively, leading to: 
✅ Improved model performance  
✅ Better handling of real-world
Zeyu Huang (@zeroyuhuang) 's Twitter Profile Photo

Two papers got accepted at #ICLR2025 and one at #NAACL2025! One for calibrating the RM bias: openreview.net/forum?id=Iu8Ry… Two for MoE: openreview.net/forum?id=eWNEq…; arxiv.org/abs/2406.18219 Thanks to my great supervisors Ivan Titov Edoardo Ponti and my excellent co-author Zihan Qiu!

Zeyu Huang (@zeroyuhuang) 's Twitter Profile Photo

Thrilled that we're 2 for 2 at NeurIPS 2025, with one Oral and one Spotlight !!! 🎉 If there's one lesson I've learned, it's that solid work and controlled studies truly pay off. Massive congrats to all my amazing collaborators! #NeurIPS2025

Thrilled that we're 2 for 2 at NeurIPS 2025, with one Oral and one Spotlight !!! 🎉 If there's one lesson I've learned, it's that solid work and controlled studies truly pay off.

Massive congrats to all my amazing collaborators! #NeurIPS2025