Sam Ade Jacobs (@samadejacobs) Twitter Tweets • TwiCopy

Luc Peterson

4 years ago

Too many great people worked on this to name them all, but here’s a start: Peter Robinson Brian Spears Jay Thiagarajan Rushil Sam Ade Jacobs Frank Di Natale @benjbay with much help and support from LLNL Computing Cyrus Harrison Ian Lee and of course Lawrence Livermore National Laboratory!

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Sam Ade Jacobs

@samadejacobs

3 years ago

AI for AI for AI…..really cool!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Yann LeCun Pedro Domingos Hmm here's some seemingly less opinionated holistic view on the topic. #ChatGPT seems to be one of the better collators of public knowledge but of course not replacing human experts who *created* that training data. Got any views on this?

<a href="/ylecun/">Yann LeCun</a> <a href="/pmddomingos/">Pedro Domingos</a> Hmm here's some seemingly less opinionated holistic view on the topic. #ChatGPT seems to be one of the better collators of public knowledge but of course not replacing human experts who *created* that training data. Got any views on this?

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Sam Ade Jacobs

@samadejacobs

3 years ago

Ask ⁦OpenAI⁩ #ChatGPT simple(st) question about Nigeria, you get 45% accuracy….progress FWIW!!!!!

Ask ⁦<a href="/OpenAI/">OpenAI</a>⁩ #ChatGPT simple(st) question about Nigeria, you get 45% accuracy….progress FWIW!!!!!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Brian Spears

@bkspears9

3 years ago

Exhausted by an amazing day 1. Fusion ignition announced 2. Watch party with the team 3. Friends and colleagues sharing the excitement with the world 3. Daughters sending me screenshots with breaking news about mom and dad’s work! 4. Being a part of something amazing for humanity

thumb_up_off_alt92

chat_bubble_outline2

repeat7

shareShare

OpenAI

@openai

3 years ago

We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning. Encouraging sign for alignment of advanced AIs: …openai.com/research/impro…

thumb_up_off_alt4,4K

chat_bubble_outline435

repeat818

shareShare

DeepSpeed

@deepspeedai

2 years ago

Want to train 1 million token context lengths (all 7 of the Harry Potter books!📚) on a GPT-like model w. 64 GPUs? Announcing DeepSpeed-Ulysses🚀 This release enables highly efficient and scalable LLM training with extremely long sequence lengths🤯 github.com/microsoft/Deep…

thumb_up_off_alt143

chat_bubble_outline1

repeat41

shareShare

DeepSpeed

@deepspeedai

2 years ago

🚀Exciting new updates on #DeepSpeed ZeRO-Inference with 20X faster generation! - 4x lesser memory usage through 4-bit weight quantization with no code change needed. - 4x larger batch sizes through KV cache offloading. Available in DeepSpeed v0.10.3: aka.ms/z3-inference

thumb_up_off_alt169

chat_bubble_outline3

repeat29

shareShare

Eric Horvitz

@erichorvitz

2 years ago

We have much to learn about LLMs. Compact 1.3 billion parameter phi-1.5 model exhibits surprising capabilities. Microsoft Research

thumb_up_off_alt19

chat_bubble_outline0

repeat4

shareShare

DeepSpeed

@deepspeedai

2 years ago

🚀Introducing #DeepSpeed-VisualChat! 🖼📜 - Multi-image, multi-round #dialogues - Novel #MultiModal causal attention - Enriched training data via improved blending techniques - Unmatched #scalability (>70B params) Blog: github.com/microsoft/Deep… Paper: arxiv.org/abs/2309.14327

thumb_up_off_alt138

chat_bubble_outline2

repeat39

shareShare

DeepSpeed

@deepspeedai

2 years ago

Introducing DeepSpeed-FastGen 🚀 Serve LLMs and generative AI models with - 2.3x higher throughput - 2x lower average latency - 4x lower tail latency w. Dynamic SplitFuse batching Auto TP, load balancing w. perfect linear scaling, plus easy-to-use API github.com/microsoft/Deep…

thumb_up_off_alt552

chat_bubble_outline6

repeat116

shareShare

OpenAI

@openai

2 years ago

We're rolling out new features and improvements that developers have been asking for: 1. Our new model GPT-4 Turbo supports 128K context and has fresher knowledge than GPT-4. Its input and output tokens are respectively 3× and 2× less expensive than GPT-4. It’s available now to

thumb_up_off_alt14,14K

chat_bubble_outline933

repeat2,2K

shareShare

DeepSpeed

@deepspeedai

2 years ago

🚀 Excited to announce our paper "ZeRO++: Extremely Efficient Collective Communication for Large Model Training" has been accepted at #ICLR2024! 🔍 ZeRO++ significantly reduces communication volume by 4x, achieving up to 3.3x speedup. microsoft.com/en-us/research… #DeepSpeed #AI

thumb_up_off_alt95

chat_bubble_outline3

repeat20

shareShare

DeepSpeed

@deepspeedai

2 years ago

Introducing Mixtral, Phi2, Falcon, and Qwen support in #DeepSpeed-FastGen! - Up to 2.5x faster LLM inference - Optimized SplitFuse and token sampling - Exciting new features like RESTful API and more! For more details: github.com/microsoft/Deep… #DeepSpeeed #AI

thumb_up_off_alt418

chat_bubble_outline10

repeat88

shareShare

Stas Bekman

@stasbekman

2 years ago

If you were holding off to try @MSFTDeepSpeed ZeRO++ it looks like deepspeed@master should work well now: github.com/microsoft/Deep… ZeRO++'s main feature is allowing you to use a hybrid approach if you can fit a model on a single node of 8 gpus. So it takes benefit of the super

thumb_up_off_alt79

chat_bubble_outline3

repeat12

shareShare

Jeff Dean

@jeffdean

2 years ago

A nice example of the kind of capabilities unlocked by the long context feature in the Gemini 1.5 Pro model.

thumb_up_off_alt446

chat_bubble_outline31

repeat46

shareShare

DeepSpeed

@deepspeedai

a year ago

Introducing Universal Checkpointing for boosting training efficiency. - Change parallelism (PP, SP, TP, ZeRO-DP) or GPU count mid-stream - Improve resilience by scaling down to healthy nodes💪 - Increase throughput by scaling up to elastic nodes🚀 Blog: rb.gy/aup3pn

thumb_up_off_alt23

chat_bubble_outline0

repeat5

shareShare

DeepSpeed

@deepspeedai

a year ago

Announcing that DeepSpeed now runs natively on Windows. This exciting combination unlocks DeepSpeed optimizations to Windows users and empowers more people and organizations with AI innovations. - HF Inference & Finetuning - LoRA - CPU Offload Blog: shorturl.at/a7TF8

thumb_up_off_alt38

chat_bubble_outline1

repeat6

shareShare

DeepSpeed

@deepspeedai

a year ago

Great to see the amazing DeepSpeed optimizations from Guanhua Wang, Heyang Qin, Masahiro Tanaka, Quentin Anthony, and Sam Ade Jacobs presented by Ammar Ahmad Awan at MUG '24.

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

DeepSpeed

@deepspeedai

a year ago

🚀Introducing Ulysses-Offload🚀 - Unlock the power of long context LLM training and finetuning with our latest system optimizations - Train LLaMA3-8B on 2M tokens context using 4xA100-80GB - Achieve over 55% MFU Blog: shorturl.at/Spx6Y Tutorial: shorturl.at/bAWu5

thumb_up_off_alt97

chat_bubble_outline1

repeat30

shareShare

Sam Ade Jacobs

Luc Peterson

Sam Ade Jacobs

Yash Jakhotiya

Sam Ade Jacobs

Brian Spears

OpenAI

DeepSpeed

DeepSpeed

Eric Horvitz

DeepSpeed

DeepSpeed

OpenAI

DeepSpeed

DeepSpeed

Stas Bekman

Jeff Dean

DeepSpeed

DeepSpeed

DeepSpeed

DeepSpeed