Shaurya Rohatgi (@shauryr) Twitter Tweets • TwiCopy

Shaurya Rohatgi

@shauryr

+ Follow

🤖 LLMs/ML @allsci_corp
👍 Open Science | OSS
🎓 PhD @ISTatPENNSTATE
Ex @allen_ai @SemanticScholar @UChicago @AbvIiitm

Build what excites

ID: 563049272

linkhttp://shaurya.ai calendar_today25-04-2012 17:25:55

598 Tweet

1,1K Takipçi

1,1K Takip Edilen

Shaurya Rohatgi

@shauryr

4 months ago

👀👀 120b

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

A mini-essay on this moment in history: We're living through something our grandkids will study in history class, and we're experiencing it as just ANOTHER Thursday. You don't really feel paradigm shifts while you're in them. They feel like normal life with slightly better

thumb_up_off_alt1,1K

chat_bubble_outline119

repeat166

shareShare

Tim Dettmers

@tim_dettmers

4 months ago

It seems the closed-source vs open-weights landscape has been leveled. GPT-5 is just 10% better at coding than an open-weight model you can run on a consumer desktop and soon laptop. If Anthropic cannot come up with a good model, then we will probably not see AGI for a while.

thumb_up_off_alt233

chat_bubble_outline14

repeat29

shareShare

basvanopheusden

@basvanopheusden

4 months ago

A few weeks ago, I started a new job at OpenAI. I wrote a document about my interview process and recommendations for anyone on the job market for AI research positions. I hope it's helpful! docs.google.com/document/d/1ZV…

thumb_up_off_alt4,4K

chat_bubble_outline59

repeat356

shareShare

⠕Talor

@talor_a

4 months ago

this is one of the most remarkable technical blog posts I’ve ever read

thumb_up_off_alt11,11K

chat_bubble_outline74

repeat755

shareShare

vLLM

@vllm_project

4 months ago

🚀 Amazing community project! vLLM CLI — a command-line tool for serving LLMs with vLLM: ✅ Interactive menu-driven UI & scripting-friendly CLI ✅ Local + HuggingFace Hub model management ✅ Config profiles for perf/memory tuning ✅ Real-time server & GPU monitoring ✅ Error

thumb_up_off_alt841

chat_bubble_outline9

repeat130

shareShare

Jay Alammar

@jayalammar

4 months ago

The Illustrated GPT-OSS New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT. Link in 🧵

thumb_up_off_alt557

chat_bubble_outline2

repeat77

shareShare

Shaurya Rohatgi

@shauryr

3 months ago

based "Incorporating synthetic instruction data into pretraining leads to improved performance on most benchmarks . . . . We also release Seed-OSS-36B-Base-woSyn trained without such data" huggingface.co/collections/By…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Ken Liu

@kenziyuliu

3 months ago

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions. Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:

thumb_up_off_alt362

chat_bubble_outline12

repeat72

shareShare

Shaurya Rohatgi

@shauryr

3 months ago

my toxic trait is never letting the cluster take a break - sorry H200s, you cost more than my car so you're working 24/7

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

Taylor W. Killian

@tw_killian

3 months ago

#K2Think (🏔️💭) is now live. We're proud of this model that punches well above its weights, developed primarily for mathematical reasoning but has shown itself to be quite versatile. As a fully deployed reasoning system at k2think.ai you can test it for yourself!

thumb_up_off_alt109

chat_bubble_outline11

repeat20

shareShare

clem 🤗

@clementdelangue

3 months ago

Tahnoon Bin Zayed Al Nahyan Thank you for open-sourcing on Hugging Face! Can't wait for more contributions to the world from the UAE! huggingface.co/LLM360/K2-Think

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

AK

@_akhaliq

3 months ago

K2-Think a reasoning system that achieves frontier performance with just a 32B parameter model, surpassing or matching much larger models such as GPT-OSS 120B and DeepSeek v3.1 vibe coded a chat app for it in anycoder

thumb_up_off_alt564

chat_bubble_outline7

repeat74

shareShare

Mikhail Yurochkin

@yurochkin_m

3 months ago

Excited for our first big release since I joined IFM MBZUAI 🚀 The performance is impressive and the speed is 🤯 (thanks to our friends at Cerebras). Give it a try k2think.ai 🙂

thumb_up_off_alt16

chat_bubble_outline1

repeat2

shareShare

Shaurya Rohatgi

Shaurya Rohatgi

GREG ISENBERG

Tim Dettmers

basvanopheusden

⠕Talor

vLLM

Jay Alammar

Shaurya Rohatgi

Ken Liu

Shaurya Rohatgi

Taylor W. Killian

clem 🤗

AK

Mikhail Yurochkin