Kasra (@kasra_danesh) Twitter Tweets • TwiCopy

evan conrad

5 months ago

We've partnered with Modular to create Large Scale Inference (LSI), a new OpenAI-compatible inference service. It's up to 85% cheaper than other offerings & can handle trillion-token scale. We originally created it at the request of a major AI lab to do large scale multimodal

thumb_up_off_alt454

chat_bubble_outline40

repeat37

shareShare

Kasra

@kasra_danesh

5 months ago

We're soon launching LSI, a product built with love between San Francisco Compute and Modular

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Chris Lattner

@clattner_llvm

5 months ago

I'm very excited to partner with SFCompute - Evan and team are phenomenally driven and built a powerful platform for scaling GPU solutions like never before. Combined with Modular's high-performance inference solutions, they're able to deliver incredible TCO advantages! 👇

thumb_up_off_alt186

chat_bubble_outline8

repeat6

shareShare

Kasra

@kasra_danesh

5 months ago

No two days are alike - new people - brilliant minds - everyday - from all over the world.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Victor Boyd

@victorwboyd

5 months ago

4mo ago: We bought a used forklift & strapped an Ai kit to it 4mo later: We’re moving hundreds of pallets a day in a customers warehouse Yesterday I got 3 requests totaling 100+ forklifts We have to scale right now V2 coming soon

thumb_up_off_alt6,6K

chat_bubble_outline262

repeat365

shareShare

Kasra

@kasra_danesh

5 months ago

Cross company wins introduce the best slack emojis

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

I was really excited to use Silicon Data but it's just the index don't make any sense and is way more different than what the actual market is. $3.35 as the H100 hourly market price is just wrong, unless someone is breaking the law... 👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

evan conrad

@evanjconrad

5 months ago

We're going to sublease our current office space in the center of Hayes Valley. It's a beautiful, two floor spot, at about 2900 sqft in total, with room for about 20 desks. If you're interested, please DM!

thumb_up_off_alt96

chat_bubble_outline5

repeat9

shareShare

Kasra

@kasra_danesh

5 months ago

Finally, Sidebar. Josh Miller youtube.com/watch?v=5vexXI…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

I just made a long form with Typeform and it faced technical issues and lost everything!!!!! never going back.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

Keep an eye on Decart today, It has something to do with Magic 👀

Keep an eye on <a href="/DecartAI/">Decart</a> today, It has something to do with Magic 👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

Go Decart ! Everyone at SFC is excited about this launch, this is huge for the industry, this is Magic.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

Is there a summer deal going on for the Nasdaq billboard in NYC or what?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Kasra

@kasra_danesh

5 months ago

If I email people at 11:30 PM in San Francisco, they read within 5-10 minuets, no one stops working here.

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

San Francisco Compute

@sfcompute

4 months ago

We're excited to combine the unbeatable engineering of Modular with the unbeatable prices of San Francisco to make the world's best priced inference service. The world gets better when costs go down.

thumb_up_off_alt147

chat_bubble_outline8

repeat16

shareShare

Sarah Chieng

@sarahchieng

4 months ago

Qwen3-Coder is now available on Cerebras, 17x faster than on GPU providers. And it's completely free. Try it out directly in your developer flow, or signup for our virtual hackathon tomorrow. It's a $5,000 prize :) Cerebras Cline

thumb_up_off_alt734

chat_bubble_outline169

repeat77

shareShare

Kasra

@kasra_danesh

4 months ago

In the GPU cloud business utilization is a misleading metric, aim for revenue based metrics. Specifically Cluster MRR and ARR. You can have a lower revenue on a higher utilization.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare