Afroz Mohiuddin (@afrozenator) Twitter Tweets • TwiCopy

Afroz Mohiuddin

@afrozenator

+ Follow

Llama at Meta🦙, x-Google Brain 🧠. Interested in Science, Psychology, Investing and generally everything.

Good Thoughts, Good Words, Good Deeds.

ID: 32336786

linkhttps://github.com/afrozenator calendar_today17-04-2009 07:00:02

475 Tweet

1,1K Followers

4,4K Following

Anthropic

@anthropicai

10 months ago

New Anthropic research: Adding Error Bars to Evals. AI model evaluations don’t usually include statistics or uncertainty. We think they should. Read the blog post here: anthropic.com/research/stati…

thumb_up_off_alt2,2K

chat_bubble_outline51

repeat309

shareShare

When my mother first moved to Japan, she tried to jaywalk while pushing a stroller on an empty residential street. She was immediately stopped by an old, well dressed Japanese man who solemnly told her in perfect English "the downfall of society begins with the individual"

thumb_up_off_alt331,331K

chat_bubble_outline1,1K

repeat24,24K

shareShare

Nikunj Kothari

@nikunj

6 months ago

I feel a little bit for the Google DeepMind team.. You build a world changing model and everyone is posting Ghibli-fied pictures instead. But this is the core problem with Google - they can build the best models in the world but if they don’t focus on the consumer experience

thumb_up_off_alt4,4K

chat_bubble_outline191

repeat217

shareShare

Aston Zhang

@astonzhangaz

6 months ago

Our Llama 4’s industry leading 10M+ multimodal context length (20+ hours of video) has been a wild ride. The iRoPE architecture I’d been working on helped a bit with the long-term infinite context goal toward AGI. Huge thanks to my incredible teammates! 🚀Llama 4 Scout 🔹17B

thumb_up_off_alt1,1K

chat_bubble_outline78

repeat137

shareShare

Sharan Narang

@sharan0909

6 months ago

Very excited to share Llama 4 models with the world. The pre-training team has cooked over the past few months to launch Llama 4 Scout, Maverick, and Behemoth. A 🧵about pretraining Blog link: ai.meta.com/blog/llama-4-m…

thumb_up_off_alt193

chat_bubble_outline5

repeat25

shareShare

AI at Meta

@aiatmeta

6 months ago

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

thumb_up_off_alt13,13K

chat_bubble_outline706

repeat2,2K

shareShare

Afroz Mohiuddin

@afrozenator

6 months ago

Extremely proud to have pioneered large scale distillation for Maverick and really delighted to be working alongside an extremely talented team. We truly hope the OSS community enjoys the fruits of our labour.

thumb_up_off_alt57

chat_bubble_outline6

repeat6

shareShare

Afroz Mohiuddin

@afrozenator

6 months ago

Really proud to present this model to the world and really excited on what is coming ahead 🔥🦾🚀

thumb_up_off_alt28

chat_bubble_outline2

repeat0

shareShare

Afroz Mohiuddin

@afrozenator

5 months ago

Coming from you Lukasz Kaiser this means a lot 🥰

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Dieuwke Hupkes

@_dieuwke_

5 months ago

So happy our new multilingual benchmark MultiLoKo is finally out (after some sweat and tears!) arxiv.org/abs/2504.10356 Multilingual eval for LLMs... could be better, and I hope MultiLoKo will help fill some gaps in it + help study design choices in benchmark design AI at Meta

thumb_up_off_alt49

chat_bubble_outline3

repeat10

shareShare

Lukasz Kaiser

@lukaszkaiser

5 months ago

o3: Leibniz wanted a single calculus, every thought settled by calculation. A LLM does just that: it turns all words into numbers, learns the patterns that link them. Its CoT is the arithmetic of reasoning. The old universal logic lives now in silicon, humming behind each reply

thumb_up_off_alt46

chat_bubble_outline3

repeat6

shareShare

Afroz Mohiuddin

@afrozenator

4 months ago

“The test of a first-rate intelligence is the ability to hold two opposing ideas in mind at the same time and still retain the ability to function. One should, for example, be able to see that things are hopeless yet be determined to make them otherwise.” F. Scott Fitzgerald

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Nassim Nicholas Taleb

@nntaleb

4 months ago

Finance has four simple rules: maintain a clear mind, figure out asymmetries, detect circularities, and never talk to idiots.

thumb_up_off_alt5,5K

chat_bubble_outline101

repeat741

shareShare

Afroz Mohiuddin

@afrozenator

3 months ago

"Raffiniert ist der Herrgott, aber boshaft ist er nicht" (God is subtle*, but malicious he is not.) — Albert Einstein * Also translated as: tricky, crafty, shrewd, sophisticated

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Afroz Mohiuddin

@afrozenator

2 months ago

Congratulations to the talented team Aakanksha Chowdhery Irwan Bello Ioannis Antonoglou

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

yung macro 宏观年少传奇

@apralky

a month ago

this dostoevsky quote is a massive whitepill if your intuitions are statistically mature btw if he perma grinded like a good boy instead of "idling around" and "making errors" the probability that he'd have died as a noname normie journalist rounds to 1 if you pay attention

thumb_up_off_alt8,8K

chat_bubble_outline169

repeat742

shareShare

Jacob Austin

@jacobaustin132

a month ago

Today we're putting out an update to the JAX TPU book, this time on GPUs. How do GPUs work, especially compared to TPUs? How are they networked? And how does this affect LLM training? 1/n

thumb_up_off_alt3,3K

chat_bubble_outline36

repeat516

shareShare

Afroz Mohiuddin

Anthropic

Lilly

Nikunj Kothari

Aston Zhang

Sharan Narang

AI at Meta

Afroz Mohiuddin

Afroz Mohiuddin

Afroz Mohiuddin

Dieuwke Hupkes

Lukasz Kaiser

Afroz Mohiuddin

Nassim Nicholas Taleb

Afroz Mohiuddin

Afroz Mohiuddin

yung macro 宏观年少传奇

Jacob Austin