BigScience Large Model Training (@bigsciencellm) Twitter Tweets • TwiCopy

BigScience Large Model Training

@bigsciencellm

+ Follow

Follow the training of "BLOOM 🌸", the @BigScienceW multilingual 176B parameter open-science open-access language model, a research tool for the AI community.

ID: 1502036410081202180

linkhttps://bigscience.notion.site/BigScience-176B-Model-Training-ad073ca07cdf479398d5f95d88e218c4 calendar_today10-03-2022 21:40:30

129 Tweet

8,8K Takipçi

1 Takip Edilen

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 91%

thumb_up_off_alt59

chat_bubble_outline0

repeat1

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 92%

thumb_up_off_alt169

chat_bubble_outline2

repeat8

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 94%

thumb_up_off_alt78

chat_bubble_outline6

repeat1

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 95%

thumb_up_off_alt177

chat_bubble_outline1

repeat8

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓░ 96%

thumb_up_off_alt80

chat_bubble_outline1

repeat2

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 97%

thumb_up_off_alt168

chat_bubble_outline2

repeat8

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 98%

thumb_up_off_alt231

chat_bubble_outline4

repeat13

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 99%

thumb_up_off_alt733

chat_bubble_outline16

repeat75

shareShare

Saulnier Lucile

@lucilesaulnier

3 years ago

🌸BigScience Research Workshop BLOOM's intermediate checkpoints have already shown some very cool capabilities! What's great about BLOOM is that you can ask it to generate the rest of a text - and this even if it is not yet fully trained yet! 👶 🧵 A thread with some examples

🌸<a href="/BigscienceW/">BigScience Research Workshop</a> BLOOM's intermediate checkpoints have already shown some very cool capabilities!

What's great about BLOOM is that you can ask it to generate the rest of a text - and this even if it is not yet fully trained yet! 👶

🧵 A thread with some examples

thumb_up_off_alt146

chat_bubble_outline5

repeat25

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 100%

thumb_up_off_alt2,2K

chat_bubble_outline44

repeat310

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 101%

thumb_up_off_alt987

chat_bubble_outline68

repeat78

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

For 111 days, we've enjoyed world-class hardware stability and throughput thanks to the hard work of our friends at Genci, @INS2I_CNRS, Megatron & DeepSpeed. Having reached our objective earlier than expected, we'll keep training for a few more days. Stay tuned, more soon ;)

thumb_up_off_alt307

chat_bubble_outline3

repeat25

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓ 102%

thumb_up_off_alt178

chat_bubble_outline15

repeat6

shareShare

BigScience Research Workshop

@bigsciencew

3 years ago

BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at bigscience.huggingface.co/blog/bloom hf.co/bigscience/blo…

thumb_up_off_alt2,2K

chat_bubble_outline29

repeat779

shareShare

Hugging Face

@huggingface

3 years ago

The Technology Behind BLOOM Training🌸 Discover how BigScience Research Workshop used Microsoft Research DeepSpeed + NVIDIA Megatron-LM technologies to train the World's Largest Open Multilingual Language Model (BLOOM): huggingface.co/blog/bloom-meg…

thumb_up_off_alt612

chat_bubble_outline8

repeat148

shareShare

clem 🤗

@clementdelangue

3 years ago

What do Stability AI Emad #stablediffusion & BigScience Research Workshop Bloom - aka the coolest new models ;) - have in common? They both use a new gen of ML licenses aimed at making ML more open & inclusive while keeping it harder to do harm with them. So cool! huggingface.co/blog/open_rail

What do <a href="/StabilityAI/">Stability AI</a> <a href="/EMostaque/">Emad</a> #stablediffusion & <a href="/BigscienceW/">BigScience Research Workshop</a> Bloom - aka the coolest new models ;) - have in common?

They both use a new gen of ML licenses aimed at making ML more open & inclusive while keeping it harder to do harm with them. So cool!

huggingface.co/blog/open_rail

thumb_up_off_alt177

chat_bubble_outline8

repeat35

shareShare

BigScience Large Model Training

@bigsciencellm

3 years ago

The super-fast inference solutions are finally here for all to use:

thumb_up_off_alt29

chat_bubble_outline1

repeat2

shareShare

Niklas Muennighoff

@muennighoff

3 years ago

Crosslingual Generalization through Multitask Finetuning 🌸 Demo: huggingface.co/bigscience/blo… 📜 arxiv.org/abs/2211.01786 💻github.com/bigscience-wor… We present BLOOMZ & mT0, a family of models w/ up to 176B params that follow human instructions in >100 languages zero-shot. 1/7

thumb_up_off_alt289

chat_bubble_outline11

repeat78

shareShare

clem 🤗

@clementdelangue

3 years ago

The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! arxiv.org/abs/2211.05100

thumb_up_off_alt598

chat_bubble_outline11

repeat106

shareShare