AI21 Labs (@AI21Labs) Twitter Tweets • TwiCopy

AI21 Labs

@AI21Labs

+ Follow

AI21 Labs builds Foundation Models and AI Systems for the enterprise that accelerate the use of GenAI in production.

🥂Meet Jamba
https://t.co/xUBjKZHKVH

ID:1166332664569368576

linkhttp://www.ai21.com calendar_today27-08-2019 12:52:51

259 Tweets

6,2K Followers

90 Following

NYSE 🏛

@NYSE

1 week ago

.Ori Goshen, Co-Founder + Co-CEO of AI21 Labs, talks about growth opportunities for the company following a $208 million Series C funding round, and shares his perspective on the future of AI on #NYSEFloorTalk with Judy Khan Shaw

thumb_up_off_alt18

chat_bubble_outline0

repeat7

shareShare

account_circle

AI21 Labs

@AI21Labs

1 month ago

Building a RAG solution is easy. Building a great one is not.

In our guest blog on Streamlit, our team explores the intricacies of how AI21's Contextual Answers Task-Specific Model & our RAG Engine generate context-based answers grounded in your proprietary organizational data.…

thumb_up_off_alt27

chat_bubble_outline0

repeat2

shareShare

account_circle

meng shao

@shao__meng

1 month ago

Paper of Jamba: A Hybrid Transformer-Mamba Language Model

Jamba, a novel architecture which combines Attention and Mamba layers, with MoE modules, and an open implementation of it, reaching state-of-the-art performance and supporting long contexts.

We showed how Jamba provides…

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

account_circle

AK

@_akhaliq

1 month ago

Jamba

A Hybrid Transformer-Mamba Language Model

present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of

account_circle

Philipp Schmid

@_philschmid

1 month ago

Last Week AI21 Labs released the production-scale Mamba implementation, and today, they released their paper. 🧐
Jamba introduces a new hybrid Transformer-Mamba mixture-of-experts architecture offering state-of-the-art performance but with significant improvements on long…

account_circle

Philipp Schmid

@_philschmid

1 month ago

Yesterday AI21 Labs released Jamba, the first production-scale Mamba implementation as a hybrid SSM-Transformer MoE 🐍 And today, you can already finetune it with Hugging Face TRL.

Alexander Doria shared a working Qlora script using 4-bit quantization on an A100 GPU (for now,…

account_circle

1LittleCoder💻

@1littlecoder

1 month ago

🥹 Jamba is truly amazing!

Everyone speaks about Long Context. But it's been mostly useful for ingesting in-context learning.

But Jamba seems to be the first Model offering a great throughput even for higher context!

account_circle

AI21 Labs

@AI21Labs

1 month ago

This Jamba overview is a great way to quickly understand the novel features Jamba brings to the dev community. Thanks Ai Flux!

thumb_up_off_alt22

chat_bubble_outline0

repeat4

shareShare

account_circle

Maxime Labonne

@maximelabonne

1 month ago

I played a little with Jamba: it looks like an amazing model.

In terms of architecture, the MoE implementation is very close to Mixtral's. What's great about it is that it hasn't been fine-tuned. Curious to see how much improvement we can get through SFT.

I made a little…

account_circle

Aleksa Gordić 🍿🤖

@gordic_aleksa

1 month ago

Extremely cool new model release from AI21 Labs - Jamba - and it's not even a transformer! It's a hybrid model that combines Mamba (structured state space model), transformer layers and MoE technique, and it's a first production-grade Mamba based model!

* It's a 52B MoE with 12B…

account_circle

swyx

@swyx

1 month ago

incredibly impressed by AI21 Labs' Jamba today. This is the first legitimate Mixtral-killer we've seen and it came out of 'nowhere':

buttondown.email/ainews/archive…

They've helped me redefine my idea of a model 'weight class' from 'number of parameters' (increasingly outdated with MoEs…

account_circle