Pritish Mishra (@pritishllm) Twitter Tweets • TwiCopy

Incredible release by NVIDIA. This model will be invaluable for latency-sensitive applications. It's small, fast, strong and fits on a single GPU, what more to ask for?

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Me: Okay, it’s time for Gemma-4… Google: Introducing T5Gemma-2 🥳 Me: Alright then, Gemma-4 next… Google: Here’s Gemma Scope-2 Me: Definitely Gemma-4 this time… Google: FunctionGemma. Not to misunderstand, these are all absolute banger releases. But I’ll be patiently

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Pritish Mishra

@pritishllm

2 months ago

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Pritish Mishra

@pritishllm

2 months ago

why even the SoTA closed-source model does this? genuinely curious to know, is this quantization or some inference-level kernel bug..

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Pritish Mishra

@pritishllm

2 months ago

If the rumours are correct Deepseek V4 and Qwen-3.5 before CNY

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Pritish Mishra

@pritishllm

a month ago

I shifted from MoE to Dense models and I've never felt better. I have more energy. My skin is clearer. My eye sight has improved.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Pritish Mishra

@pritishllm

a month ago

once you start training on H200, you just can't go back to H100.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Pritish Mishra

@pritishllm

a month ago

the exact same Hindi sentence tokenized by different models Qwen3: 222 tokens GLM-4.7: 212 tokens Nemotron: 84 tokens Gemma: 66 tokens GLM and Qwen3 are very strong models, but their tokenizers are dominantly trained on English text. As a result, they end up tokenizing almost

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Pritish Mishra

@pritishllm

19 days ago

I got unlimited access to Cursor from smallest.ai. This might not be a good idea, I'm just getting started here.

I got unlimited access to Cursor from <a href="/smallest_AI/">smallest.ai</a>. This might not be a good idea, I'm just getting started here.

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Sudarshan Kamath

@kamath_sutra

16 days ago

Announcing... Voice x Memory! We’re unpacking what makes agents listen, respond, and remember, or sometimes forget, and what that means for building better voice systems. We will move towards a world of large LLMs remembering a lot of information to smaller LMs with finite