Phani Srikanth (@phanisrikanth33) Twitter Tweets • TwiCopy

Phani Srikanth

@phanisrikanth33

a year ago

What a day to release this monster of a model. Open weights w/ a detailed report for reasoning models and MIT licensed.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

The AlphaGo moment for general intelligence is here. RL is mainstream now. It remains to be seen if organizations can use RL to improve their top line with past decisions, revenues as inputs.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

What are some of the best resources on the internet to learn the best and latest on reinforcement learning for LLMs?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

Build your own Evals. This advice stands the test of time.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jim Fan

@drjimfan

a year ago

those who think RL use less compute don’t know RL at all 😅 SFT: human generates data and machine learns RL: machine generates data and machine learns

thumb_up_off_alt1,1K

chat_bubble_outline74

repeat175

shareShare

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent

thumb_up_off_alt12,12K

chat_bubble_outline365

repeat1,1K

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

Oopsie.

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

Amazing journey from an OSS contributor to creating products (in public) and helping developers, enterprises & community! You’ve created so much impact not just with hard work & passion but with extreme ‘agency’. Super excited to see what you’ll do next!

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Thomas Wolf

@thom_wolf

a year ago

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook" Check it out here: hf.co/spaces/nanotro… A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,

thumb_up_off_alt3,3K

chat_bubble_outline109

repeat711

shareShare

kepano

@kepano

a year ago

here you go... next version of Obsidian Web Clipper has nice Markdown export for Claude and ChatGPT out of the box

thumb_up_off_alt1,1K

chat_bubble_outline46

repeat93

shareShare

Yuvraj Singh

@yuvrajs9886

a year ago

Everyone use this tool to find relevant papers paperfinder.allen.ai It's sooo good

thumb_up_off_alt914

chat_bubble_outline19

repeat104

shareShare

Nick Dobos

@nickadobos

a year ago

Teachers using prompt injection to catch students Inadvertently this is the best course for ai security you could take

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat42

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

Fixed the o1 and o3 evals. Ofcourse, using o4!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Sohan

@hisohan

a year ago

Big congrats from team India@ML Lossfunk! 🎉🇮🇳 Absolutely thrilled to see 25 papers featuring brilliant researchers from India accepted at #ICLR2025! 🔥 Massive achievement & testament to the growing strength of AI/ML research in the country. A thread celebrating their

thumb_up_off_alt664

chat_bubble_outline3

repeat47

shareShare

Phani Srikanth

@phanisrikanth33

a year ago

Massive productivity gains for every torch user. The path to AGI is unblocked now.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

9 months ago

It was good while it lasted!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Hamel Husain

@hamelhusain

9 months ago

TOC for the open book "Beyond Naive RAG: Practical Advanced Methods" from our RAG series. This condenses 5 hours of instruction into something you can read in ~30 minutes. Link: maven.com/p/945082/beyon… Ben Clavié Nandan Thakur Orion Weller Antoine Chaffin Bryan Bischof fka Dr. Donut

thumb_up_off_alt584

chat_bubble_outline8

repeat73

shareShare

jack morris

@jxmnop

9 months ago

curious about the training data of OpenAI's new gpt-oss models? i was too. so i generated 10M examples from gpt-oss-20b, ran some analysis, and the results were... pretty bizarre time for a deep dive 🧵

thumb_up_off_alt3,3K

chat_bubble_outline90

repeat291

shareShare

Phani Srikanth

@phanisrikanth33

8 months ago

Hillclimbed by way from 83 to 95. Pretty sure I'm stuck at a local minima now. Vibe coded the progress with claude code. binga.github.io/vibe-dat-creat…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

@phanisrikanth33

7 months ago

A fabulous list of projects and super cool progress!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Phani Srikanth

Phani Srikanth

Phani Srikanth

Phani Srikanth

Phani Srikanth

Jim Fan

Andrej Karpathy

Phani Srikanth

Phani Srikanth

Thomas Wolf

kepano

Yuvraj Singh

Nick Dobos

Phani Srikanth

Sohan

Phani Srikanth

Phani Srikanth

Hamel Husain

jack morris

Phani Srikanth

Phani Srikanth