Naman Goyal (@namangoyal21) Twitter Tweets • TwiCopy

Aran Komatsuzaki

2 years ago

The False Promise of Imitating Proprietary LLMs Open-sourced LLMs are adept at mimicking ChatGPT’s style but not its factuality. There exists a substantial capabilities gap, which requires better base LM. arxiv.org/abs/2305.15717

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat252

shareShare

Armen Aghajanyan

@armenagha

2 years ago

I’m excited to release our most recent work setting a new SOTA FID of 4.88 on text-to-image generation we call CM3Leon (pronounced chameleon)! ai.meta.com/research/publi…

thumb_up_off_alt475

chat_bubble_outline25

repeat126

shareShare

Naman Goyal

@namangoyal21

2 years ago

Finished 30/30 radiation therapy sessions today. Past 3-4 months have been one of the most challenging part of my life. Recovery from surgery and radiation therapy was quite physically and mentally challenging. With due respect, Cancer, please stay from me from now on.

thumb_up_off_alt177

chat_bubble_outline22

repeat0

shareShare

Mannat Singh

@mannat_singh

2 years ago

Excited to share Emu Video, for high quality video generation! Our factorized {text}-to-image generation followed by {image, text}-to-video generation approach outperforms all prior work & commercial solutions in human evals. Demo + blog + paper: emu-video.metademolab.com #emuvideo

thumb_up_off_alt39

chat_bubble_outline2

repeat10

shareShare

Mike Lewis

@ml_perception

a year ago

Excited to share a preview of Llama3, including the release of an 8B and 70B (82 MMLU, should be the best open weights model!), and preliminary results for a 405B model (still training, but already competitive with GPT4). Lots more still to come... ai.meta.com/blog/meta-llam…

thumb_up_off_alt503

chat_bubble_outline17

repeat97

shareShare

Ahmad Al-Dahle

@ahmad_al_dahle

a year ago

It’s here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs. Key highlights • 8B and 70B parameter openly available pre-trained and fine-tuned models. • Trained on more

thumb_up_off_alt972

chat_bubble_outline62

repeat201

shareShare

Noam Brown

@polynoamial

a year ago

Llama 3 is out in 8B and 70B sizes! (400B still training) Congrats to the AI at Meta team! ai.meta.com/blog/meta-llam…

Llama 3 is out in 8B and 70B sizes! (400B still training) Congrats to the <a href="/AIatMeta/">AI at Meta</a> team! ai.meta.com/blog/meta-llam…

thumb_up_off_alt196

chat_bubble_outline7

repeat20

shareShare

Naman Goyal

@namangoyal21

a year ago

Really proud of the work that went into making this possible, hope this helps the community push the field forward. Also in case anyone missed it, there's a sneak peak of what to come next at the end of blog post ai.meta.com/blog/meta-llam…

thumb_up_off_alt72

chat_bubble_outline2

repeat6

shareShare

Naman Goyal

@namangoyal21

a year ago

Got curious about this. Suggests average case of achieving 1e6 * gpt4 (or 3e31) flops model by 2028. At 2500 bf16 Tflops, 1.2KW of B100, that will require roughly ~456 GW per hour power to train in 6 months. Which afaik, is roughly United States's entire electricity usage in 2023

thumb_up_off_alt65

chat_bubble_outline8

repeat2

shareShare

Naman Goyal

@namangoyal21

a year ago

This is extremely exciting, looking forward to the impact it will have on biology. The team behind EvolutionaryScale is one of the most talented and passionate set of people, I have interacted with.

thumb_up_off_alt26

chat_bubble_outline0

repeat1

shareShare

Naman Goyal

@namangoyal21

a year ago

pretty cool! nice work, really happy the amazing research open sourcing base model weights can enable.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Naman Goyal

@namangoyal21

a year ago

Very excited to release the technical report and the model weights for the all 3 sizes of llama3 models. It has been exciting past 12 months. Really looking forward to the incredible research this will unlock from the community. Now on to llama4 🚀

thumb_up_off_alt53

chat_bubble_outline2

repeat4

shareShare

Naman Goyal

@namangoyal21

a year ago

llama1: 2048 gpus llama2: 4096 gpus llama3: 16384 gpus llama4: ..... You see where we are headed! Gonna be insane ride!

thumb_up_off_alt98

chat_bubble_outline2

repeat4

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Does style matter over substance in Arena? Can models "game" human preference through lengthy and well-formatted responses? Today, we're launching style control in our regression model for Chatbot Arena — our first step in separating the impact of style from substance in

thumb_up_off_alt890

chat_bubble_outline46

repeat115

shareShare

Naman Goyal

@namangoyal21

a year ago

Thousands of gpus isn't cool, you know what's cool? Thousands of hosts

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Naman Goyal

@namangoyal21

10 months ago

This is what we have been up to and much more! come join us!!! 🚀

thumb_up_off_alt16

chat_bubble_outline1

repeat0

shareShare

Naman Goyal

@namangoyal21

5 months ago

Congrats amazing friends and ex colleagues on killer release! Pushing the frontier of open source models pushes the field collectively forward!

thumb_up_off_alt75

chat_bubble_outline5

repeat3

shareShare

Vijay

@__tensorcore__

4 months ago

🚨🔥 CUTLASS 4.0 is released 🔥🚨 pip install nvidia-cutlass-dsl 4.0 marks a major shift for CUTLASS: towards native GPU programming in Python slidehelloworld.png docs.nvidia.com/cutlass/media/…

thumb_up_off_alt407

chat_bubble_outline15

repeat81

shareShare

Jeremy Howard

@jeremyphoward

3 months ago

literally openai

thumb_up_off_alt664

chat_bubble_outline20

repeat26

shareShare

Naman Goyal

@namangoyal21

2 months ago

The past 4 months have been among the most rewarding of my career—filled with learning and building alongside some of the most talented ML research and infra folks I know. I truly believe magic happens when driven, talented people are aligned on a shared mission.

thumb_up_off_alt122

chat_bubble_outline0

repeat3

shareShare