Rohan Pandey (@khoomeik) 's Twitter Profile
Rohan Pandey

@khoomeik

research @OpenAI || prev @CarnegieMellon '23 @ReworkdAI (YC S23) @AGIHouseSF

ID: 1228506265665462272

linkhttps://rpandey.tech calendar_today15-02-2020 02:28:25

4,4K Tweet

24,24K Takipçi

1,1K Takip Edilen

Rohan Pandey (@khoomeik) 's Twitter Profile Photo

Once we solve math and the world doesn’t react, we’ll realize that most problems are empirically bound, not intelligence bound. All the clusters built in the 2020s will then migrate from learning to simulation.

William Merrill (@lambdaviking) 's Twitter Profile Photo

Padding a transformer’s input with blank tokens (...) is a simple form of test-time compute. Can it increase the computational power of LLMs? 👀 New work with Ashish Sabharwal addresses this with *exact characterizations* of the expressive power of transformers with padding 🧵

Padding a transformer’s input with blank tokens (...) is a simple form of test-time compute. Can it increase the computational power of LLMs? 👀

New work with <a href="/Ashish_S_AI/">Ashish Sabharwal</a> addresses this with *exact characterizations* of the expressive power of transformers with padding 🧵
Rohan Pandey (@khoomeik) 's Twitter Profile Photo

Kevin Zhu incredibly important question and i still don't think there's a good go-to citation for this, but intuition is that RL on-policy update sparsity + CoT enables generalization rather than just memorizing the data as in SFT arxiv.org/abs/2501.17161 x.com/saagnikkk/stat…

Prithviraj (Raj) Ammanabrolu (@rajammanabrolu) 's Twitter Profile Photo

Half the projects for my undergrad Intro to Deep (LLM) RL pearls-lab.github.io/intro-deep-rl-… are exactly this. They came up with some pretty interesting stuff

Rohan Pandey (@khoomeik) 's Twitter Profile Photo

half the ppl pinging me about building novel RL envs want to RL web agents sick release from the bros, now u can just use hud sdk to build, train, and eval on web RL envs 🔥🔥🔥

Rohan Pandey (@khoomeik) 's Twitter Profile Photo

capabilities-pilled interp drop: we now know why transformers outperform SSMs 👀 (tldr don’t make your mamba conv kernel size too small, and if you’re using non-convolutional SSMs, it’s joever buddy)

Rohan Pandey (@khoomeik) 's Twitter Profile Photo

yo @ the doomers: here's a great example of a less intelligent entity exercising control over a more intelligent one and by that i mean the woke right somehow convincing techno-libertarians to coalition with them

Rohan Pandey (@khoomeik) 's Twitter Profile Photo

white professor living in chicago presenting to a chinese audience in shanghai about indian ideas from the 7th century in relation to 21st century developments in ML i love globalism

white professor living in chicago presenting to a chinese audience in shanghai about indian ideas from the 7th century in relation to 21st century developments in ML

i love globalism