Shujian Zhang (@zhang_shujian) 's Twitter Profile
Shujian Zhang

@zhang_shujian

Research Scientist @GoogleDeepmind | Ph.D. @UTAustin

ID: 1282129122333007872

linkhttps://szhang42.github.io/ calendar_today12-07-2020 01:46:24

20 Tweet

39 Takipรงi

49 Takip Edilen

Wenxuan Zhou (@wenxuanzhou_96) 's Twitter Profile Photo

Introducing WPO: Enhancing RLHF with Weighted Preference Optimization ๐ŸŒŸ Our new preference optimization method reweights preference data to simulate on-policy preference optimization using off-policy data, combining efficiency with high performance. โœ… up to 5.6% better than

Yuxin Xiao (@yuxinxiao6) 's Twitter Profile Photo

๐Ÿšจ Excited to share our new preprint with Shujian Zhang, Wenxuan Zhou, Marzyeh, and Sanqiang Zhao. ๐Ÿš€ TLDR: We propose SFTMix, a novel recipe that elevates language model instruction tuning without relying on expensive, well-curated datasets. arxiv.org/abs/2410.05248

Tong Wu (@tongwu_pton) 's Twitter Profile Photo

How can LLM architecture recognize Instruction Hierarchy? ๐Ÿš€ Excited to share our latest work on Instructional Segment Embedding (ISE)! A technique embeds Instruction Hierarchy directly into LLM architecture, significantly boosting LLM safety. ๐Ÿงต[1/n]

How can LLM architecture recognize Instruction Hierarchy?

๐Ÿš€ Excited to share our latest work on Instructional Segment Embedding (ISE)! A technique embeds Instruction Hierarchy directly into LLM architecture, significantly boosting LLM safety. ๐Ÿงต[1/n]
Tong Wu (@tongwu_pton) 's Twitter Profile Photo

๐Ÿ“ For more details and experiments, please check our arxiv: arxiv.org/abs/2410.09102 Thanks to all of my amazing collaborators ๐Ÿ™Œ Wenxuan Zhou Shujian Zhang KaiqiangSong Silei Xu xcc Prof. Prateek Mittal and more colleagues from @zoom. ๐Ÿงต[7/n]

John Lambert (@jlambert_) 's Twitter Profile Photo

I am hiring a Student Researcher at Google DeepMind for 2025! ๐Ÿ‘ฉ๐Ÿ”ฌ Interested in improving multi-turn optimization and reasoning capabilities of LLMs? ๐Ÿง‘โ€๐ŸŽ“ Currently studying for a Bachelor's/Master's/PhD? ๐Ÿง‘โ€๐Ÿ’ป Have solid engineering and research skills? ๐ŸŒŸWe want to hear from you!

I am hiring a Student Researcher at Google DeepMind for 2025!

๐Ÿ‘ฉ๐Ÿ”ฌ Interested in improving multi-turn optimization and reasoning capabilities of LLMs?
๐Ÿง‘โ€๐ŸŽ“ Currently studying for a Bachelor's/Master's/PhD?
๐Ÿง‘โ€๐Ÿ’ป Have solid engineering and research skills?

๐ŸŒŸWe want to hear from you!