Crick Wu
@crickwu
Software Engineer @ Google
ID: 2839636832
04-10-2014 04:34:38
8 Tweet
22 Followers
42 Following
This work is a result of wonderful collaboration with Junwen Bai, @sidbrahma, Joshua Ainslie, Kenton Lee, Zhou Yanqi, Nan Du, Vincent Y. Zhao, Crick Wu, Bo Li, Yu Zhang, Ming-Wei Chang. We believe CoDA and CoLT5 highlight the potential of conditional computation for efficiency.
MoE Meets Instruction Tuning: A Winning Combination for Large Language Models [1/3] arxiv.org/pdf/2305.14705 Sheng Shen Zhou Yanqi Barret Zoph William Fedus Jason Wei Hyung Won Chung Shayne Longpre Xinyun Chen @tuvuumass Crick Wu Albert Webson Vincent Y. Zhao Denny Zhou