Shantanu Sharma (@shantanu) 's Twitter Profile
Shantanu Sharma

@shantanu

he/him RT≠👍 NMLS # 1677482

ID: 16146256

linkhttps://www.linkedin.com/in/shantanu calendar_today05-09-2008 17:16:20

82 Tweet

657 Followers

102 Following

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context

Yann LeCun (@ylecun) 's Twitter Profile Photo

A hugely important commitment to the openness of Meta's AI ecosystem by Mark: "Open Source AI Is the Path Forward " Llama 3.1 is free, open, and on par with the best proprietary systems. To maximize performance, safety, customizability, and efficiency, AI platforms must be open,

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences Details ➡️ go.fb.me/p749s5

Shantanu Sharma (@shantanu) 's Twitter Profile Photo

Survey paper on Large Language Models in Finance. Authors delve into current approaches employing LLMs in finance, from leveraging pretrained models via zero-shot or few-shot learning to training custom LLMs from scratch, addressing the industry's need for accuracy and fairness.

Shantanu Sharma (@shantanu) 's Twitter Profile Photo

Writer's Palmyra-Fin-70B-32K model has passed the CFA Level III exam, showcasing leading performance in their internal long-fin-eval benchmark. This model is designed to excel in analyzing and summarizing complex financial reports, market data, and economic indicators, providing

Writer's Palmyra-Fin-70B-32K model has passed the CFA Level III exam, showcasing leading performance in their internal long-fin-eval benchmark.

This model is designed to excel in analyzing and summarizing complex financial reports, market data, and economic indicators, providing
Shantanu Sharma (@shantanu) 's Twitter Profile Photo

Revolutionizing Finance with LLMs: An Overview of Applications and Insights. Large Language Models (LLMs) are reshaping the finance landscape, offering novel capabilities in processing textual data and zero-shot learning. From sentiment analysis to fraud detection, LLMs are

Revolutionizing Finance with LLMs: An Overview of Applications and Insights.

Large Language Models (LLMs) are reshaping the finance landscape, offering novel capabilities in processing textual data and zero-shot learning. From sentiment analysis to fraud detection, LLMs are
US-India Strategic Partnership Forum (@usispforum) 's Twitter Profile Photo

🇺🇸🤝🇮🇳| “The U.S.-India relationship is absolutely critical to both our nations, as well as to a free and open Indo-Pacific and the free and open international system upon which we both depend.” - Doug Beck, Director of DefenseInnovationUnit, delivered opening remarks at #INDUSXSummit2024

Shantanu Sharma (@shantanu) 's Twitter Profile Photo

"We have a great generic opioid overdose antidote, naloxone. It needs to be cheaper and available everywhere, not hidden behind pharmacy counters but placed near every defibrillator and in every first aid kit. Two medications — methadone and buprenorphine — have proved to cut

Shantanu Sharma (@shantanu) 's Twitter Profile Photo

Experimented with DeepSeek-R1 this weekend. DeepSeek-R1 migrated from supervised fine tuning to reinforcement learning for model training, enabling generating longer chain-of-thought (CoT). Using Group Relative Policy Optimization (GRPO) to save the training costs enables a

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

New 2h11m YouTube video: How I Use LLMs This video continues my general audience series. The last one focused on how LLMs are trained, so I wanted to follow up with a more practical guide of the entire LLM ecosystem, including lots of examples of use in my own life. Chapters

New 2h11m YouTube video: How I Use LLMs

This video continues my general audience series. The last one focused on how LLMs are trained, so I wanted to follow up with a more practical guide of the entire LLM ecosystem, including lots of examples of use in my own life.

Chapters
David Sacks (@davidsacks47) 's Twitter Profile Photo

Congrats to the AI at Meta team on the launch of their new Llama 4 open-weights models. For the U.S. to win the AI race, we have to win in open source too, and Llama 4 puts us back in the lead.

Shantanu Sharma (@shantanu) 's Twitter Profile Photo

Good read: The Leaderboard Illusion: alphaxiv.org/abs/2504.20879 Big Tech commercially dependent on marketing model performance for revenues putting their best models out on Chatbot Arena is not surprising. I would argue against prohibiting score retraction after submission and

Eric Topol (@erictopol) 's Twitter Profile Photo

A new cover for SUPER AGERS after making the NYT bestseller list. Thanks to you for making it the #1 ranked new non-fiction book on Amazon. amazon.com/gp/new-release…

A new cover for SUPER AGERS after making the NYT bestseller list. Thanks to you for making it the #1 ranked new non-fiction book on Amazon. 
amazon.com/gp/new-release…