Saurabh Shah (@saurabh_shah2) 's Twitter Profile
Saurabh Shah

@saurabh_shah2

training olmos @allen_ai prev @Apple @Penn 🎀dabbler of things🎸 πŸˆβ€β¬›enjoyer of cats 🐈 and mountainsπŸ”οΈhe/him

ID: 1599170691714138114

linkhttps://learnycurve.substack.com calendar_today03-12-2022 22:36:25

957 Tweet

1,1K Followers

1,1K Following

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

mostly agree but you shouldn't *only* make decisions based on asymptotes. The constants matter. e.g. Cursor has already created a ridiculous amount of value and that will continue. They'll exit, pivot, or start training their own models before the asymptotic trends kick in

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

Why is it important that there exist leading open models built in the west? "We should have open models that reflect western values" can feel vague and hand-wavy. Here's a deepseek screenshot to make it concrete. This is not a model you can do e.g. factuality research on

Why is it important that there exist leading open models built in the west? 

"We should have open models that reflect western values" can feel vague and hand-wavy. 

Here's a deepseek screenshot to make it concrete. This is not a model you can do e.g. factuality research on
Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

Does Isomorphic Labs write anything in the public? Where can I learn more abt AI for drug discovery (or knowledge discovery in general)? Is anyone e.g. using alphafold as an RL env to derive rewards from?

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

life update: for those who don’t know, i joined Ai2 a few months ago to work on open source AGI. incredibly excited about what we’re building πŸš€

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

Introducing synth-bench -> how good is your LM at generating data for other LM's? Olmo best is weirdly good at this, apparently!

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

david is one of the most thoughtful people I've met in general, and he's definitely the most thoughtful person I've met when it comes to how to interpret eval scores of language models. ty david + team for helping us make sound decisions!! (P.S. go follow David Heineman)

Saurabh Shah (@saurabh_shah2) 's Twitter Profile Photo

The space of sequential ops is more expressive, but parallel is more efficient. Beauty of the transformer is it kinda defies this tradeoff