Rohit Patel (@_rohit_patel_) 's Twitter Profile
Rohit Patel

@_rohit_patel_

Director @Meta Superintelligence Labs

ID: 45159013

calendar_today06-06-2009 16:17:08

49 Tweet

140 Followers

47 Following

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

We're open source releasing the the latest Llama models today. The largest of our models pushes new boundaries in many areas. Can't wait to see how the community will use these: github.com/meta-llama/lla…

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

With the release of Meta Llama 3.1 we are putting out the evaluation data for anyone looking to replicate our evals: huggingface.co/meta-llama

Ahmad Al-Dahle (@ahmad_al_dahle) 's Twitter Profile Photo

The team at SambaNova just announced their new API offering: SambaNova Cloud. They’re achieving the fastest inference we’ve seen yet for Llama 70B (570 tokens/s) and 405B (132 tokens/s). Available for free via API with no waitlist today. 👏

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Due to strong community interest, we've collaborated with AI at Meta to compare the bf16 and fp8 versions of Llama-3.1-405b in Chatbot Arena! With over 5K community votes, both versions show similar performance across the board: - Overall: 1266 vs 1266 - Hard prompts: 1267 vs

Due to strong community interest, we've collaborated with <a href="/AIatMeta/">AI at Meta</a> to compare the bf16 and fp8 versions of Llama-3.1-405b in Chatbot Arena!

With over 5K community votes, both versions show similar performance across the board:
- Overall: 1266 vs 1266
- Hard prompts: 1267 vs
Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

Today we're open source releasing the latest versions of our Llama models, Llama 3.2. We have 1B/3B models for text and 11B/90B multimodal models: ai.meta.com/blog/llama-3-2…

Towards Data Science (@tdatascience) 's Twitter Profile Photo

Curious about Large Language Models but don't know where to start? Rohit Patel's latest article breaks it all down from the basics, requiring only your ability to add and multiply. #LLM #ML towardsdatascience.com/understanding-…

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

We’re releasing 1B & 3B quantized Llama models with same quality as the original, while achieving 2-4x speedup. We used two techniques: Quantization-Aware Training with LoRA adaptors, and SpinQuant ai.meta.com/blog/meta-llam…

We’re releasing 1B &amp; 3B quantized Llama models with same quality as the original, while achieving 2-4x speedup. We used two techniques: Quantization-Aware Training with LoRA adaptors, and SpinQuant
ai.meta.com/blog/meta-llam…
Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

We are releasing Llama 3.3 today. An updated Lllama 70B open source instruct model which is comparable in performance to the 405B model. Happy holidays!!! 🥳 #llm #ai #llama One Meta: llama.com/llama-download… Oh Huggingface: huggingface.co/meta-llama/Lla…

Towards Data Science (@tdatascience) 's Twitter Profile Photo

From our #TDSBestOf2024 collection: Rohit Patel with a beginner-friendly primer on LLMs and how they work under the hood. towardsdatascience.com/understanding-…

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

Our CRAG-MM Challenge (KDD Cup 2025) invites you to develop innovative multi-modal, multi-turn question-answering systems with a focus on RAG, using agentic tools to retrieve information. The goal is to improve visual reasoning: aicrowd.com/challenges/met…

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

Negative log-likelihood, cross entropy, and KL divergence. Related, simple and extremely useful concepts worth fully internalizing.medium.com/data-science-c… hashtag#ml hashtag#ai hashtag#statistics

Rohit Patel (@_rohit_patel_) 's Twitter Profile Photo

Understanding reinforcement learning for model training from scratch. This took me a lot longer to write than anticipated, partly because the RLMT literature is not an easy read: rohit-patel.medium.com/understanding-…