Nisal Mihiranga (@nisalm) 's Twitter Profile
Nisal Mihiranga

@nisalm

Head of AI & Data Science @Zone24x7, Microsoft AI MVP, Microsoft Certified Trainer, Architect - AI and Data

ID: 69902249

linkhttps://nisaldataai.blogspot.com/ calendar_today29-08-2009 16:22:39

1,1K Tweet

440 Followers

1,1K Following

Peter Lee (@peteratmsr) 's Twitter Profile Photo

๐Ÿš€ Phi-4 is here! A small language model that performs as well as (and often better than) large models on certain types of complex reasoning tasks such as math. Useful for us in Microsoft Research, and available now for all researcher on the Azure AI Foundry! aka.ms/phi4blog

๐Ÿš€ Phi-4 is here! A small language model that performs as well as (and often better than) large models on certain types of complex reasoning tasks such as math. Useful for us in <a href="/MSFTResearch/">Microsoft Research</a>, and available now for all researcher on the Azure AI Foundry! aka.ms/phi4blog
Nisal Mihiranga (@nisalm) 's Twitter Profile Photo

Microsoft has unveiled Phi-4, a 14B parameter small language model (SLM) designed to excel in complex reasoning tasks. Currently Phi-4 is available on Azure AI Foundry under a Microsoft Research License Agreement (MSRLA), and will be available on Hugging Face next week.

Microsoft has unveiled Phi-4, a 14B parameter small language model (SLM) designed to excel in complex reasoning tasks.
Currently Phi-4 is available on Azure AI Foundry under a Microsoft Research License Agreement (MSRLA), and will be available on Hugging Face next week.
Azure SQL (@azuresql) 's Twitter Profile Photo

Wait โ€“ what is GraphQL? How do I use it? WHY would I use it? In this episode of Data Exposed, Buck Woody and Anna Hoffman break it down for you. If you know SQL, youโ€™ll love GraphQL! Watch ๐Ÿ“บ: bit.ly/4iDWjg7 #AzureSQL

Nisal Mihiranga (@nisalm) 's Twitter Profile Photo

It will require a combined effort involving energy-efficient chips, optimized model architectures, algorithms, and a transition to renewable and low-carbon energy sources like nuclear energy. linkedin.com/posts/bgamazayโ€ฆ

Shital Shah (@sytelus) 's Twitter Profile Photo

We have been completely amazed by the response to phi-4 release. A lot of folks had been asking us for weight release. Few even uploaded bootlegged phi-4 weights on HuggingFace๐Ÿ˜ฌ. Well, wait no more. We are releasing today official phi-4 model on HuggingFace! With MIT licence!!

We have been completely amazed by the response to phi-4 release. A lot of folks had been asking us for weight release. Few even uploaded bootlegged phi-4 weights on HuggingFace๐Ÿ˜ฌ.

Well, wait no more. We are releasing today official phi-4 model on HuggingFace!

With MIT licence!!
Tucker Carlson (@tuckercarlson) 's Twitter Profile Photo

Chamath Palihapitiya on the emptiness of Silicon Valley, the future of technology and the promise of the new Trump Administration. (0:00) The War Machine Takeover (3:22) Chamathโ€™s Dark Passenger (19:47) The Emptiness of Silicon Valley Elites (27:06) Is the US at Risk of Losing

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent

We have to take the LLMs to school.

When you open any textbook, you'll see three major types of information:

1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent
Victoria Slocum (@victorialslocum) 's Twitter Profile Photo

What's the difference between *just* AI and truly ๐—ฎ๐—ด๐—ฒ๐—ป๐˜๐—ถ๐—ฐ ๐˜„๐—ผ๐—ฟ๐—ธ๐—ณ๐—น๐—ผ๐˜„๐˜€? (And why does it even matter?) There's a lot of debate on what makes something an "agent" versus just another AI application. The key difference? ๐—ฆ๐˜๐—ฎ๐˜๐—ถ๐—ฐ ๐—”๐—œ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜ƒ๐˜€.

What's the difference between *just* AI and truly ๐—ฎ๐—ด๐—ฒ๐—ป๐˜๐—ถ๐—ฐ ๐˜„๐—ผ๐—ฟ๐—ธ๐—ณ๐—น๐—ผ๐˜„๐˜€?

(And why does it even matter?)

There's a lot of debate on what makes something an "agent" versus just another AI application. 

The key difference? ๐—ฆ๐˜๐—ฎ๐˜๐—ถ๐—ฐ ๐—”๐—œ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜ƒ๐˜€.
Elon Musk (@elonmusk) 's Twitter Profile Photo

230k GPUs, including 30k GB200s, are operational for training Grok @xAI in a single supercluster called Colossus 1 (inference is done by our cloud providers). At Colossus 2, the first batch of 550k GB200s & GB300s, also for training, start going online in a few weeks. As Jensen

Elon Musk (@elonmusk) 's Twitter Profile Photo

Due to laws against data export, Tesla achieved the top results in China despite having no local training data. Tesla is adding training data from our world simulator and test tracks to achieve 6/6.

pratham (@prathammittal) 's Twitter Profile Photo

This is the capital allocation framework to stay on top of your game: - Sustained innovation (70%) - Exploring adjacencies (20%) - Building new paradigms (10%) The percentage isn't a hard cap. It's a rule of thumb for those who think in decades, not quarters.

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

My pleasure to come on Dwarkesh last week, I thought the questions and conversation were really good. I re-watched the pod just now too. First of all, yes I know, and I'm sorry that I speak so fast :). It's to my detriment because sometimes my speaking thread out-executes my