Kasra (@kasra_danesh) 's Twitter Profile
Kasra

@kasra_danesh

๐ŸŒ Building @sfcompute, Product, AI, Prev led 4 products at @Rocketberlin

ID: 1008762447828803584

calendar_today18-06-2018 17:24:44

236 Tweet

254 Followers

399 Following

evan conrad (@evanjconrad) 's Twitter Profile Photo

We've partnered with Modular to create Large Scale Inference (LSI), a new OpenAI-compatible inference service. It's up to 85% cheaper than other offerings & can handle trillion-token scale. We originally created it at the request of a major AI lab to do large scale multimodal

We've partnered with Modular to create Large Scale Inference (LSI), a new OpenAI-compatible inference service. 

It's up to 85% cheaper than other offerings & can handle trillion-token scale.

We originally created it at the request of a major AI lab to do large scale multimodal
Chris Lattner (@clattner_llvm) 's Twitter Profile Photo

I'm very excited to partner with SFCompute - Evan and team are phenomenally driven and built a powerful platform for scaling GPU solutions like never before. Combined with Modular's high-performance inference solutions, they're able to deliver incredible TCO advantages! ๐Ÿ‘‡

Victor Boyd (@victorwboyd) 's Twitter Profile Photo

4mo ago: We bought a used forklift & strapped an Ai kit to it 4mo later: Weโ€™re moving hundreds of pallets a day in a customers warehouse Yesterday I got 3 requests totaling 100+ forklifts We have to scale right now V2 coming soon

Kasra (@kasra_danesh) 's Twitter Profile Photo

I was really excited to use Silicon Data but it's just the index don't make any sense and is way more different than what the actual market is. $3.35 as the H100 hourly market price is just wrong, unless someone is breaking the law... ๐Ÿ‘€

evan conrad (@evanjconrad) 's Twitter Profile Photo

We're going to sublease our current office space in the center of Hayes Valley. It's a beautiful, two floor spot, at about 2900 sqft in total, with room for about 20 desks. If you're interested, please DM!

San Francisco Compute (@sfcompute) 's Twitter Profile Photo

We're excited to combine the unbeatable engineering of Modular with the unbeatable prices of San Francisco to make the world's best priced inference service. The world gets better when costs go down.

Sarah Chieng (@sarahchieng) 's Twitter Profile Photo

Qwen3-Coder is now available on Cerebras, 17x faster than on GPU providers. And it's completely free. Try it out directly in your developer flow, or signup for our virtual hackathon tomorrow. It's a $5,000 prize :) Cerebras Cline

Kasra (@kasra_danesh) 's Twitter Profile Photo

In the GPU cloud business utilization is a misleading metric, aim for revenue based metrics. Specifically Cluster MRR and ARR. You can have a lower revenue on a higher utilization.