Yuqing Yang
@yyqcode
First-year PhD student @CSatUSC @nlp_usc.
ID: 1670754896352784391
https://ayyyq.github.io/ 19-06-2023 11:26:45
24 Tweet
189 Takipçi
347 Takip Edilen
I'll present a poster for Lifelong ICL and Task Haystack at #NeurIPS2024! ⏰ Wednesday 11am-2pm 📍 East Exhibit Hall A-C #2802 📜 arxiv.org/abs/2407.16695 My co-first author Xiaoyue Xu is applying to PhD programs and I am looking jobs in industry! Happy to connect at NeurIPS!
Running your model on multiple GPUs but often found the speed not satisfiable? We introduce Ladder-residual, a parallelism-aware architecture modification that makes 70B Llama with tensor parallelism ~30% faster! Work done at Together AI. Co-1st author with Mayank Mishra