Inference (@inference_net) Twitter Tweets • TwiCopy

Inference

@inference_net

+ Follow

AI inference APIs at 90% lower cost. Switch in two minutes. Sign up today and start saving.

ID: 1849902577808297984

linkhttps://inference.net?join=inference&utm_source=inferencexbio calendar_today25-10-2024 19:55:59

20 Tweet

782 Takipçi

4 Takip Edilen

Inference

@inference_net

a year ago

thumb_up_off_alt31

chat_bubble_outline4

repeat4

shareShare

Inference

@inference_net

a year ago

hello world. we are live in beta. visit our website for more information.

thumb_up_off_alt34

chat_bubble_outline7

repeat2

shareShare

The cost of LLM tokens has dropped more than 50% over the last 12 months. Groq Inc was leading the pack for a while, but just isn't competitive any more Llama 3.1 70B tokens on inference.net are now 63% cheaper than Groq Inc and almost 90% cheaper than Replicate

The cost of LLM tokens has dropped more than 50% over the last 12 months.

<a href="/GroqInc/">Groq Inc</a> was leading the pack for a while, but just isn't competitive any more

Llama 3.1 70B tokens on inference.net are now 63% cheaper than <a href="/GroqInc/">Groq Inc</a> and almost 90% cheaper than <a href="/replicate/">Replicate</a>

thumb_up_off_alt93

chat_bubble_outline11

repeat6

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

a year ago

This log scale graph from OpenRouter clearly shows the cost of LLM inference has been decreasing exponentially since the release of GPT-3 in late 2022 New inference providers like Inference will continue this trend into the foreseeable future 👇

This log scale graph from <a href="/OpenRouterAI/">OpenRouter</a> clearly shows the cost of LLM inference has been decreasing exponentially since the release of GPT-3 in late 2022

New inference providers like <a href="/inference_net/">Inference</a> will continue this trend into the foreseeable future 👇

thumb_up_off_alt38

chat_bubble_outline6

repeat6

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

a year ago

If you're working on synthetic data generation and need cheap batch inference, hit my DMs. We're cooking up something specifically for this on inference .net inference.net

thumb_up_off_alt46

chat_bubble_outline11

repeat12

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

a year ago

We just updated pricing on Inference. We are once again the most affordable LLM inference provider on the market

We just updated pricing on <a href="/inference_net/">Inference</a>.

We are once again the most affordable LLM inference provider on the market

thumb_up_off_alt68

chat_bubble_outline10

repeat13

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

9 months ago

DeepSeek V3 is now available on inference.net

thumb_up_off_alt22

chat_bubble_outline0

repeat2

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

9 months ago

We are now serving large models on OpenRouter - DeepSeek V3 - Nous Research Hermes 3 405B - DeepSeek R1 coming soon

We are now serving large models on <a href="/OpenRouterAI/">OpenRouter</a>

- DeepSeek V3
- <a href="/NousResearch/">Nous Research</a> Hermes 3 405B
- DeepSeek R1 coming soon

thumb_up_off_alt157

chat_bubble_outline7

repeat12

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

9 months ago

The inference .net grants program is live! Apply and get $10k for AI projects that need LLM inference from top models: - R1 and V3 from DeepSeek - Llama 70b and 405b from AI at Meta - High rate-limits, batch support, and more If you're building open source, read below 👇

The inference .net grants program is live!

Apply and get $10k for AI projects that need LLM inference from top models:

- R1 and V3 from <a href="/deepseek_ai/">DeepSeek</a>
- Llama 70b and 405b from <a href="/AIatMeta/">AI at Meta</a>
- High rate-limits, batch support, and more

If you're building open source, read below 👇

thumb_up_off_alt96

chat_bubble_outline10

repeat12

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

9 months ago

R1 from DeepSeek is live on Inference! R1 has one-shotted questions that O1 Pro stumbles on. $5.00/million tokens. Top tier reliability. Try it today 👇

R1 from <a href="/deepseek_ai/">DeepSeek</a> is live on <a href="/inference_net/">Inference</a>!

R1 has one-shotted questions that O1 Pro stumbles on.

$5.00/million tokens. Top tier reliability.

Try it today 👇

thumb_up_off_alt36

chat_bubble_outline4

repeat7

shareShare

Ibrahim Ahmed

@atbeme

8 months ago

We're excited to be one of the first inference APIs to host Gemma 3, starting with the 27b variant Inference ! It has incredible vision capabilities, rivaling models that are 2-6x its size in quality Give it a try and let us know what you think!

We're excited to be one of the first inference APIs to host Gemma 3, starting with the 27b variant <a href="/inference_net/">Inference</a> !

It has incredible vision capabilities, rivaling models that are 2-6x its size in quality

Give it a try and let us know what you think!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Keywords AI (YC W24)

@keywordsai

8 months ago

We worked with OpenAI to build a native integration for the OpenAI Agents SDK. Today, we are very excited to launch as a tracing processor. With just a few lines of code, you can trace all your agent workflows and debug them much faster. Check out this quick demo to get

thumb_up_off_alt225

chat_bubble_outline16

repeat75

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

8 months ago

Inference .net is hiring a Developer Relations Lead in San Francisco! If you: - Love working with AI to amazing build products - Have experience creating world-class educational content for developers - Enjoy solving developer problems and fostering community growth We want to

thumb_up_off_alt43

chat_bubble_outline11

repeat12

shareShare

Ibrahim Ahmed

@atbeme

8 months ago

So Groq Inc is cool, but it’s no longer magic With basic out of the box optimizations from SGLang we’re achieving over 250 tokens per second in llama 3.1 8b at full precision on a 4090 (consumer gaming card) Inference And we think we can go even higher (1k)

thumb_up_off_alt47

chat_bubble_outline5

repeat4

shareShare

Keywords AI (YC W24)

@keywordsai

8 months ago

Excited to announce our integration with Inference - a new LLM provider hosting faster and more cost-effective open-source models! Inference[dot]net can reduce your LLM costs by up to 90%. Models like Llama 3.3 70B, DeepSeek V3, and DeepSeek R1 are significantly faster than

Excited to announce our integration with <a href="/inference_net/">Inference</a> - a new LLM provider hosting faster and more cost-effective open-source models!

Inference[dot]net can reduce your LLM costs by up to 90%. Models like Llama 3.3 70B, DeepSeek V3, and DeepSeek R1 are significantly faster than

thumb_up_off_alt29

chat_bubble_outline2

repeat8

shareShare

Sam Hogan 🇺🇸

@0xsamhogan

7 months ago

Qwen3 is live on Inference inference.net/models/qwen-3-…

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare