Sagnik Mukherjee (@saagnikkk) 's Twitter Profile
Sagnik Mukherjee

@saagnikkk

CS PhD student at @IllinoisCDS @convai_uiuc

ID: 1617896838950187009

linkhttps://sagnikmukherjee.github.io/ calendar_today24-01-2023 14:47:35

46 Tweet

107 Followers

159 Following

Sagnik Mukherjee (@saagnikkk) 's Twitter Profile Photo

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models” From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮 And this isn’t a one-off. The pattern holds across RL algorithms and models. 🧵A Deep Dive

🚨 Paper Alert: “RL Finetunes Small Subnetworks in Large Language Models”

From DeepSeek V3 Base to DeepSeek R1 Zero, a whopping 86% of parameters were NOT updated during RL training 😮😮
And this isn’t a one-off. The pattern holds across RL algorithms and models.
🧵A Deep Dive