Bowen Peng (@bloc97_) 's Twitter Profile
Bowen Peng

@bloc97_

ID: 1699560720738787328

calendar_today06-09-2023 23:10:35

6 Tweet

481 Followers

67 Following

emozilla (@theemozilla) 's Twitter Profile Photo

Announcing Yarn-Mistral-7b-128k! You heard right, 128k (and 64k) context length for Mistral 🥳 🤗128k: huggingface.co/NousResearch/Y… 📜v2: arxiv.org/abs/2309.00071 Special thanks to LAION for the compute support via FZ Jülich-JSC Along with Bowen Peng EnricoShippole Honglu Fan

Announcing Yarn-Mistral-7b-128k!

You heard right, 128k (and 64k) context length for Mistral 🥳

🤗128k: huggingface.co/NousResearch/Y…
📜v2: arxiv.org/abs/2309.00071

Special thanks to <a href="/laion_ai/">LAION</a> for the compute support via <a href="/fzj_jsc/">FZ Jülich-JSC</a> 

Along with <a href="/bloc97_/">Bowen Peng</a> <a href="/EnricoShippole/">EnricoShippole</a> <a href="/Void13950782/">Honglu Fan</a>
Together AI (@togethercompute) 's Twitter Profile Photo

Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context. It builds on the lessons learned in past year designing efficient sequence modeling architectures. together.ai/blog/stripedhy…

Announcing StripedHyena 7B — an open source model using an architecture that goes beyond Transformers achieving faster performance and longer context.

It builds on the lessons learned in past year  designing efficient sequence modeling architectures.

together.ai/blog/stripedhy…
emozilla (@theemozilla) 's Twitter Profile Photo

Proud to announce that YaRN (arxiv.org/abs/2309.00071) got accepted to ICLR 2024! Our very own Bowen Peng will be in Vienna to present it 🥳 To celebrate, we're releasing YaRN versions of Upstage's SOLAR model 🤗 huggingface.co/NousResearch/Y… 🤗 huggingface.co/NousResearch/Y…

Proud to announce that YaRN (arxiv.org/abs/2309.00071) got accepted to ICLR 2024! Our very own <a href="/bloc97_/">Bowen Peng</a> will be in Vienna to present it 🥳

To celebrate, we're releasing YaRN versions of <a href="/upstageai/">Upstage</a>'s SOLAR model

🤗 huggingface.co/NousResearch/Y…
🤗 huggingface.co/NousResearch/Y…
Nous Research (@nousresearch) 's Twitter Profile Photo

What if you could use all the computing power in the world to train a shared, open source AI model? Preliminary report: github.com/NousResearch/D… Nous Research is proud to release a preliminary report on DisTrO (Distributed Training Over-the-Internet) a family of

What if you could use all the computing power in the world to train a shared, open source AI model?

Preliminary report: github.com/NousResearch/D…

Nous Research is proud to release a preliminary report on DisTrO (Distributed Training Over-the-Internet) a family of
Anjney Midha 🇺🇸 (@anjneymidha) 's Twitter Profile Photo

Just finished recording a 2 hr podcast with the Nous Research DisTrO team about their upcoming paper. Haven't been this excited in a while. We are entering a new era in distributed systems H/t to Teknium (e/λ) for putting this on my radar!