Shaurya Rohatgi (@shauryr) 's Twitter Profile
Shaurya Rohatgi

@shauryr

🤖 LLMs/ML @allsci_corp
👍 Open Science | OSS
🎓 PhD @ISTatPENNSTATE
Ex @allen_ai @SemanticScholar @UChicago @AbvIiitm

Build what excites

ID: 563049272

linkhttp://shaurya.ai calendar_today25-04-2012 17:25:55

598 Tweet

1,1K Takipçi

1,1K Takip Edilen

GREG ISENBERG (@gregisenberg) 's Twitter Profile Photo

A mini-essay on this moment in history: We're living through something our grandkids will study in history class, and we're experiencing it as just ANOTHER Thursday. You don't really feel paradigm shifts while you're in them. They feel like normal life with slightly better

Tim Dettmers (@tim_dettmers) 's Twitter Profile Photo

It seems the closed-source vs open-weights landscape has been leveled. GPT-5 is just 10% better at coding than an open-weight model you can run on a consumer desktop and soon laptop. If Anthropic cannot come up with a good model, then we will probably not see AGI for a while.

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

A few weeks ago, I started a new job at OpenAI. I wrote a document about my interview process and recommendations for anyone on the job market for AI research positions. I hope it's helpful! docs.google.com/document/d/1ZV…

vLLM (@vllm_project) 's Twitter Profile Photo

🚀 Amazing community project! vLLM CLI — a command-line tool for serving LLMs with vLLM: ✅ Interactive menu-driven UI & scripting-friendly CLI ✅ Local + HuggingFace Hub model management ✅ Config profiles for perf/memory tuning ✅ Real-time server & GPU monitoring ✅ Error

🚀 Amazing community project!

vLLM CLI — a command-line tool for serving LLMs with vLLM:
✅ Interactive menu-driven UI & scripting-friendly CLI
✅ Local + HuggingFace Hub model management
✅ Config profiles for perf/memory tuning
✅ Real-time server & GPU monitoring
✅ Error
Jay Alammar (@jayalammar) 's Twitter Profile Photo

The Illustrated GPT-OSS New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT. Link in 🧵

The Illustrated GPT-OSS

New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT.

Link in 🧵
Shaurya Rohatgi (@shauryr) 's Twitter Profile Photo

based "Incorporating synthetic instruction data into pretraining leads to improved performance on most benchmarks . . . . We also release Seed-OSS-36B-Base-woSyn trained without such data" huggingface.co/collections/By…

Ken Liu (@kenziyuliu) 's Twitter Profile Photo

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions. Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:

New paper! We explore a radical paradigm for AI evals: assessing LLMs on *unsolved* questions.

Instead of contrived exams where progress ≠ value, we eval LLMs on organic, unsolved problems via reference-free LLM validation & community verification. LLMs solved ~10/500 so far:
Taylor W. Killian (@tw_killian) 's Twitter Profile Photo

#K2Think (🏔️💭) is now live. We're proud of this model that punches well above its weights, developed primarily for mathematical reasoning but has shown itself to be quite versatile. As a fully deployed reasoning system at k2think.ai you can test it for yourself!

AK (@_akhaliq) 's Twitter Profile Photo

K2-Think a reasoning system that achieves frontier performance with just a 32B parameter model, surpassing or matching much larger models such as GPT-OSS 120B and DeepSeek v3.1 vibe coded a chat app for it in anycoder

K2-Think

a reasoning system that achieves frontier performance with just a 32B parameter model, surpassing or matching much larger models such as GPT-OSS 120B and DeepSeek v3.1

vibe coded a chat app for it in anycoder
Mikhail Yurochkin (@yurochkin_m) 's Twitter Profile Photo

Excited for our first big release since I joined IFM MBZUAI 🚀 The performance is impressive and the speed is 🤯 (thanks to our friends at Cerebras). Give it a try k2think.ai 🙂