
Marija Stanojevic
@mstanojevic118
ML, NLP, ML4Health, MultiModality, and STEM geek. Travel enthusiast. [email protected]
ID: 150602726
https://marija-stanojevic.github.io/ 01-06-2010 10:18:23
305 Tweet
287 Followers
110 Following




Dimitris Papailiopoulos I'm biased but I think this paper is pretty cool too arxiv.org/abs/2311.13647 (at ICLR this year)











a tiny bit of a cat is out now; we train our own large (medium) sized LM on our own proprietary data from scratch ourselves at Prescient Design and Genentech . very easy in my opinion, and Keunwoo Choi hates it whenever i say this 😂


Introducing Jamba, our groundbreaking SSM-Transformer open model! As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU. 🥂Meet Jamba ai21.com/jamba 🔨Build on Hugging Face



