Bowen Peng
@bloc97_
ID: 1699560720738787328
06-09-2023 23:10:35
6 Tweet
481 Followers
67 Following
Announcing Yarn-Mistral-7b-128k! You heard right, 128k (and 64k) context length for Mistral 🥳 🤗128k: huggingface.co/NousResearch/Y… 📜v2: arxiv.org/abs/2309.00071 Special thanks to LAION for the compute support via FZ Jülich-JSC Along with Bowen Peng EnricoShippole Honglu Fan
Just finished recording a 2 hr podcast with the Nous Research DisTrO team about their upcoming paper. Haven't been this excited in a while. We are entering a new era in distributed systems H/t to Teknium (e/λ) for putting this on my radar!