
Yufan Zhuang
@yufan_zhuang
LLM & GenAI Researcher | phd student @UCSanDiego | prev @AMD @Meta @MSFTResearch @IBMResearch
ID: 973591762756280320
https://evanzhuang.github.io/ 13-03-2018 16:08:59
886 Tweet
246 Followers
275 Following




Our results overall suggest that we can effectively separate harmful from harmless data and use pretraining data filtering to improve model safety without compromising usefulness. Big thanks to the team! 🙏 Mycal Tucker, Nina, Tony Wang 🐨, Francesco Mosconi,






appreciate Thinking Machines taking an open research approach! excited to see the first blog mentioned our work! truly on-policy RL is like RTX3090 for gamers in 2020 - you really want it, but the blockers make your head itch… kernel mismatches, parallelism mismatches, etc. etc.









