Shashwat Verma (@therumsticks) 's Twitter Profile
Shashwat Verma

@therumsticks

Applied Scientist @ Amazon; Opinions my own

ID: 1484695567

calendar_today05-06-2013 10:33:57

316 Tweet

243 Followers

880 Following

clem 🤗 (@clementdelangue) 's Twitter Profile Photo

Is it time we stop using the word AI for everything and instead use words like "chatbots", "video generation", "recommendation engines", "cell prediction",...? Feels like as a society, we could have healthier debates like that.

Shashwat Verma (@therumsticks) 's Twitter Profile Photo

whoever added ABC_XYZ.md style docs to claude’s training set, may your socks get wet 10 minutes after putting them on

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,

Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,
elie (@eliebakouch) 's Twitter Profile Photo

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably

huggingface.co/spaces/Hugging…