Shujian Zhang (@zhang_shujian) Twitter Tweets • TwiCopy

Shujian Zhang

@zhang_shujian

+ Follow

Research Scientist @GoogleDeepmind | Ph.D. @UTAustin

ID: 1282129122333007872

linkhttps://szhang42.github.io/ calendar_today12-07-2020 01:46:24

20 Tweet

39 Takipçi

49 Takip Edilen

Wenxuan Zhou

@wenxuanzhou_96

a year ago

Introducing WPO: Enhancing RLHF with Weighted Preference Optimization 🌟 Our new preference optimization method reweights preference data to simulate on-policy preference optimization using off-policy data, combining efficiency with high performance. ✅ up to 5.6% better than

thumb_up_off_alt20

chat_bubble_outline5

repeat10

shareShare

Yuxin Xiao

@yuxinxiao6

a year ago

🚨 Excited to share our new preprint with Shujian Zhang, Wenxuan Zhou, Marzyeh, and Sanqiang Zhao. 🚀 TLDR: We propose SFTMix, a novel recipe that elevates language model instruction tuning without relying on expensive, well-curated datasets. arxiv.org/abs/2410.05248

thumb_up_off_alt7

chat_bubble_outline1

repeat4

shareShare

Tong Wu

@tongwu_pton

a year ago

How can LLM architecture recognize Instruction Hierarchy? 🚀 Excited to share our latest work on Instructional Segment Embedding (ISE)! A technique embeds Instruction Hierarchy directly into LLM architecture, significantly boosting LLM safety. 🧵[1/n]

thumb_up_off_alt9

chat_bubble_outline1

repeat4

shareShare

Tong Wu

@tongwu_pton

a year ago

📝 For more details and experiments, please check our arxiv: arxiv.org/abs/2410.09102 Thanks to all of my amazing collaborators 🙌 Wenxuan Zhou Shujian Zhang KaiqiangSong Silei Xu xcc Prof. Prateek Mittal and more colleagues from @zoom. 🧵[7/n]

thumb_up_off_alt4

chat_bubble_outline1

repeat3

shareShare

John Lambert

@jlambert_

10 months ago

I am hiring a Student Researcher at Google DeepMind for 2025! 👩🔬 Interested in improving multi-turn optimization and reasoning capabilities of LLMs? 🧑‍🎓 Currently studying for a Bachelor's/Master's/PhD? 🧑‍💻 Have solid engineering and research skills? 🌟We want to hear from you!

thumb_up_off_alt845

chat_bubble_outline21

repeat90

shareShare

Shujian Zhang

@zhang_shujian

5 months ago

Our Gemini 2.5 tech report is out on Arxiv (arxiv.org/pdf/2507.06261)! Nice work from the team! 🌟

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare