Phani Srikanth (@phanisrikanth33) 's Twitter Profile
Phani Srikanth

@phanisrikanth33

❤️ @nimishasureka01, Principal Applied Scientist @NetApp 💻. Prev @Microsoft. 🏎 fan. Views mine. ✍️

ID: 56754153

calendar_today14-07-2009 17:19:53

595 Tweet

1,1K Followers

527 Following

Phani Srikanth (@phanisrikanth33) 's Twitter Profile Photo

The AlphaGo moment for general intelligence is here. RL is mainstream now. It remains to be seen if organizations can use RL to improve their top line with past decisions, revenues as inputs.

Jim Fan (@drjimfan) 's Twitter Profile Photo

those who think RL use less compute don’t know RL at all 😅 SFT: human generates data and machine learns RL: machine generates data and machine learns

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent

We have to take the LLMs to school.

When you open any textbook, you'll see three major types of information:

1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent
Phani Srikanth (@phanisrikanth33) 's Twitter Profile Photo

Amazing journey from an OSS contributor to creating products (in public) and helping developers, enterprises & community! You’ve created so much impact not just with hard work & passion but with extreme ‘agency’. Super excited to see what you’ll do next!

Thomas Wolf (@thom_wolf) 's Twitter Profile Photo

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook" Check it out here: hf.co/spaces/nanotro… A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook"

Check it out here: hf.co/spaces/nanotro…

A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,
Sohan (@hisohan) 's Twitter Profile Photo

Big congrats from team India@ML Lossfunk! 🎉🇮🇳 Absolutely thrilled to see 25 papers featuring brilliant researchers from India accepted at #ICLR2025! 🔥 Massive achievement & testament to the growing strength of AI/ML research in the country. A thread celebrating their

Hamel Husain (@hamelhusain) 's Twitter Profile Photo

TOC for the open book "Beyond Naive RAG: Practical Advanced Methods" from our RAG series. This condenses 5 hours of instruction into something you can read in ~30 minutes. Link: maven.com/p/945082/beyon… Ben Clavié Nandan Thakur Orion Weller Antoine Chaffin Bryan Bischof fka Dr. Donut

TOC for the open book "Beyond Naive RAG: Practical Advanced Methods" from our RAG series.  

This  condenses 5 hours of instruction into something you can read in ~30 minutes. 

Link: maven.com/p/945082/beyon…

<a href="/bclavie/">Ben Clavié</a> <a href="/beirmug/">Nandan Thakur</a> <a href="/orionweller/">Orion Weller</a> <a href="/antoine_chaffin/">Antoine Chaffin</a>  <a href="/BEBischof/">Bryan Bischof fka Dr. Donut</a>
jack morris (@jxmnop) 's Twitter Profile Photo

curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre time for a deep dive 🧵

curious about the training data of OpenAI's new gpt-oss models? i was too. 

so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre

time for a deep dive 🧵
Phani Srikanth (@phanisrikanth33) 's Twitter Profile Photo

Hillclimbed by way from 83 to 95. Pretty sure I'm stuck at a local minima now. Vibe coded the progress with claude code. binga.github.io/vibe-dat-creat…