Vaibhav (VB) Srivastav (@reach_vb) Twitter Tweets • TwiCopy

Vaibhav (VB) Srivastav

@reach_vb

+ Follow

chief get-shit-done officer @huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own

ID: 874987512850128897

linkhttp://vaibhavs10.github.io calendar_today14-06-2017 13:50:54

8,8K Tweet

29,29K Takipçi

300 Takip Edilen

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

LETS GOOOOO!! DeepSeek provides! 🔥 huggingface.co/deepseek-ai/De…

thumb_up_off_alt193

chat_bubble_outline6

repeat28

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Incredible things are happening on Hugging Face 🤯

thumb_up_off_alt49

chat_bubble_outline8

repeat1

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

NEW DeepSeek R1 0528 competing with O3 High / O4 Mini-Medium on LiveCodeBench 💥

thumb_up_off_alt126

chat_bubble_outline3

repeat14

shareShare

DeepSeek dropped DeepSeek R1 v2 this morning! We at Hyperbolic Labs now serve DeepSeek-R1-0528, the first inference provider serving this model on Hugging Face. My vibe check: It seems to be the only model that consistently answers "what is 9.9 - 9.11?" correctly. 🐋 To whale:

thumb_up_off_alt784

chat_bubble_outline46

repeat80

shareShare

Hyperbolic

@hyperbolic_labs

5 months ago

DeepSeek-R1-0528 is now live on Hyperbolic’s Serverless Inference! We also are the first to serve the latest DeepSeek model on Hugging Face. 🟣 Run it instantly: app.hyperbolic.xyz/models/deepsee…

thumb_up_off_alt234

chat_bubble_outline80

repeat53

shareShare

Novita AI

@novita_labs

5 months ago

Our Deepseek-R1-0528 is also live on Hugging Face!

thumb_up_off_alt42

chat_bubble_outline3

repeat10

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Been amazing watching dedicated teams Hyperbolic Fireworks AI, Nebius & Novita AI sprint to get the new DeepSeek R1 0528 deployment up and ready on Hugging Face 🔥 So so much passion and sheer brute force to be the fastest, most accurate deployment out there -

thumb_up_off_alt73

chat_bubble_outline11

repeat6

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Incredible how much you can pack in ~2GB latent space 🔥

thumb_up_off_alt170

chat_bubble_outline2

repeat17

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

@levelsio Hugging Face Hahaha! We make money via compute credits + Enterprise Hub + HF Pro subs - business is good! More than a million enterprises, startups and developers of enterprises depend on it 🤗 We already have a lot of key inference providers and we’re scaling up more as I write this! In

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat43

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

There will be DeepSeek R1 0528 Qwen 3 8B too matching Qwen 3 235B Thinking in performance too 🤯 Whale COOKED!

thumb_up_off_alt695

chat_bubble_outline19

repeat66

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Smol AGI is OUTTT 😍 huggingface.co/deepseek-ai/De…

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat114

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Request to all LLM post-training labs - please cook more tool-use and coding focused models! 🙏 Happy to help beta test on my personal vibe evals

thumb_up_off_alt84

chat_bubble_outline3

repeat1

shareShare

Vaibhav (VB) Srivastav

@reach_vb

5 months ago

Pretty impressive 7B VLM coming out of Xiaomi 🤓 ViT encoder w/ MLP and powered by their 7B Text backbone Compatible w/ Qwen VL arch so works across vLLM, Transformers, SGLang and Llama.cpp Bonus: it can reason and is MIT licensed 🔥

thumb_up_off_alt404

chat_bubble_outline10

repeat48

shareShare

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Yuchen Jin

Hyperbolic

Novita AI

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav

Vaibhav (VB) Srivastav