Colin Raffel
@colinraffel
nonbayesian parameterics, sweet lessons, and random birds.
Friend of @srush_nlp
ID:837133583558987776
http://www.colinraffel.com 02-03-2017 02:52:54
1,5K Tweets
30,2K Followers
654 Following
🚀 Introducing Pile-T5!
🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer.
✨ Featuring intermediate checkpoints and a significant boost in benchmark performance.
Work done by Lintang Sutawika, me…
I love music most when it’s live, in the moment, and expressing something personal.
This is why I’m psyched about the new “DJ mode” we developed for MusicFX: aitestkitchen.withgoogle.com/tools/music-fx…
It’s an infinite AI jam that you control 🎛️. Try mixing your unique 🌀 of instruments, genres,…
Crowd-sourcing human feedback for open-source LLMs? 💬🤖
Let's make it happen together! 💪
chromewebstore.google.com/detail/sharelm…
W ♻️ Leshem Choshen ♻️ Omri Abend
I'll be at #NeurIPS2023 supporting my collaborators who are presenting arxiv.org/abs/2306.01708, arxiv.org/abs/2305.16264, arxiv.org/abs/2302.00674, and neurips.cc/virtual/2023/p…. Find me to chat about decentralizing/democratizing/de-risking ML!
New preprint! Introducing MaTS - a new framework for merging individual task models into a multitask model by matching them in their task subspace
Work done w/ Mohit Bansal Colin Raffel
📄 arxiv.org/abs/2312.04339
💾 github.com/r-three/mats
🧵 ⬇️
Presenting ComPEFT 🗜!
We compress parameter updates to facilitate efficient communication of expert models for compositional generalization. ComPEFT improves perf. 📈, while reducing storage/communication costs 📉
buff.ly/49Qaryo
♻️ Leshem Choshen ♻️ Colin Raffel Mohit Bansal
🧵
Our work on Data Augmentation for Learning from Limited Data has been accepted to #TACL ! We are presenting it at #ACL2023 on Wed 11:00-12:30 in Session 7.
Paper: transacl.org/index.php/tacl…
Poster + Video: virtual2023.aclweb.org/paper_T4291.ht…
Jiaao Chen Colin Raffel Mohit Bansal Diyi Yang
We just pushed a new update adding support for the (very impressive) safetensors library from our friends at Hugging Face!
Git-Theta's plug-in system meant that we spent more time waiting on CI/CD than actually adding support (I'll get off my soapbox now 🧼📦).