Dhruv Diddi (@dhruvdiddi) Twitter Tweets • TwiCopy

Dhruv Diddi

@dhruvdiddi

+ Follow

Powering Hardware Aware Inference | SLMs | Computer Vision | XGBoost | Formerly at Google, Turo #OwnYourAI #GetSoloTech #PhysicalAI

ID: 1110234349465538560

linkhttps://www.linkedin.com/in/dhruvdiddi/ calendar_today25-03-2019 17:37:50

335 Tweet

77 Takipçi

220 Takip Edilen

Ji-Ha

@ji_ha_kim

8 months ago

I stumbled upon a highly underrated paper It studies the dimensionality self-attention geometry via algebraic methods, focusing on a simplified version without softmax but also a short section with it at the end, they have some conjectures if anyone will can prove it

thumb_up_off_alt941

chat_bubble_outline7

repeat111

shareShare

anpaure

@anpaure

8 months ago

you can just do things

thumb_up_off_alt7,7K

chat_bubble_outline61

repeat308

shareShare

clem 🤗

@clementdelangue

8 months ago

Distributed intelligence > super intelligence!

thumb_up_off_alt177

chat_bubble_outline27

repeat23

shareShare

Nathan Wilbanks

@nathanwilbanks_

8 months ago

Reject MCP, embrace SLOP. Simple Language Open Protocol

thumb_up_off_alt4,4K

chat_bubble_outline177

repeat339

shareShare

Yohei Nishitsuji

@yoheinishitsuji

8 months ago

float i,e,R,s;vec3 q,p,d=vec3(FC.xy/r-vec2(.6,.5),.7);for(q.zx--;i++<99.;){o.rgb+=hsv(.1,.2,min(e*s,.7-e)/35.);s=1.;p=q+=d*e*R*.1;p=vec3(log2(R=length(p))-t,exp(1.-p.z/R),atan(p.y,p.x)+cos(t)*.2);for(e=--p.y;s<3e2;s+=s)e+=sin(dot(sin(p.zxy*s)-.5,1.-cos(p.yxz*s)))/s;}#つぶやきGLSL

thumb_up_off_alt8,8K

chat_bubble_outline163

repeat991

shareShare

BlinkDL

@blinkdl_ai

8 months ago

RWKV7-G1 "GooseOne" first release: reasoning @ 0.1b params, pure RNN (attention-free), fully multilingual. Demo & weights on RWKV.com 🪿 Larger G1 training in progress.

thumb_up_off_alt677

chat_bubble_outline45

repeat105

shareShare

Rohan Paul

@rohanpaul_ai

8 months ago

Qwen2.5-Math-7B-Instruct can scale to o1 level accuracy in only 32 rollouts. This paper's methods has a 4–16x better scaling rate over our deterministic search counterparts. Current inference-time scaling often relies on imperfect reward models that cause “reward hacking.”

thumb_up_off_alt151

chat_bubble_outline2

repeat34

shareShare

Vaibhav (VB) Srivastav

@reach_vb

8 months ago

Also, does this make Gemma3 27B to be the best non-reasoning LLM????

thumb_up_off_alt163

chat_bubble_outline11

repeat14

shareShare

Charles Guillemet

@p3b7_

8 months ago

At Ledger, you might know that we have the Ledger Donjon, our dedicated team constantly conducting open security research. We recently worked with Trezor, revealing that their Trezor Safe 3 was susceptible to physical supply chain attacks. Here's a thread on our findings:🧵

At <a href="/Ledger/">Ledger</a>, you might know that we have the <a href="/DonjonLedger/">Ledger Donjon</a>, our dedicated team constantly conducting open security research.

We recently worked with Trezor, revealing that their Trezor Safe 3 was susceptible to physical supply chain attacks. Here's a thread on our findings:🧵

thumb_up_off_alt1,1K

chat_bubble_outline82

repeat172

shareShare

Dhruv Diddi

@dhruvdiddi

8 months ago

Wow ⚡️ Google back at it

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

alban denoyel

@albn

8 months ago

The open source Conversational Speech Model of Sesame is out. Already trendy as #2 on Hugging Face within less than 24 hours huggingface.co/models

thumb_up_off_alt33

chat_bubble_outline1

repeat6

shareShare

Andi Marafioti

@andimarafioti

7 months ago

🚀We just dropped SmolDocling: a 256M open-source vision LM for complete document OCR!📄✨ It's lightning fast, process a page in 0.35 sec on consumer GPU using < 500MB VRAM⚡ SOTA in document conversion, beating every competing model we tested up to 27x larger🤯 But how? 🧶⬇️

thumb_up_off_alt2,2K

chat_bubble_outline53

repeat370

shareShare

Julien Chaumond

@julien_c

7 months ago

Another week, another cool release 🥰

thumb_up_off_alt118

chat_bubble_outline5

repeat5

shareShare

Vaibhav (VB) Srivastav

@reach_vb

7 months ago

Nvidia dropping a 2B open foundation model for humanoid robots on the hub!🔥

thumb_up_off_alt263

chat_bubble_outline5

repeat38

shareShare

Maxime Labonne

@maximelabonne

7 months ago

⚡ AutoQuant I updated AutoQuant to make the GGUF versions of Gemma 3 abliterated. It implements imatrix and can split the model into multiple files. The GGUF code is based on gguf-my-repo, maintained by Xuan-Son Nguyen and Vaibhav (VB) Srivastav It also supports GPTQ, ExLlamaV2, AWQ, and HQQ!

thumb_up_off_alt132

chat_bubble_outline2

repeat17

shareShare

Tristan T

@trirpi

7 months ago

thumb_up_off_alt9,9K

chat_bubble_outline132

repeat1,1K

shareShare

Dhruv Diddi

@dhruvdiddi

7 months ago

Quantum Gravity remains unsolved because only recently physics account for perception ⚡️

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yohei Nishitsuji

@yoheinishitsuji

7 months ago

float i,e,R,s;vec3 q,p,d=vec3(FC.xy/r,1);for(q.zy--;i++<229.;){e+=i/6e4;o.rgb+=hsv(q.z-.75,R,e*R*i/30.);s=2.;p=q+=d*e*R*R*.15;p=vec3(log(R=length(p-.2))-t*.25,exp(mod(-p.z,s)/R),p+sin(t)*.07);for(e=--p.y;s<4e3;s+=s)e+=-abs(dot(sin(p*s),max(p,exp(atan(.4,e))))/s*.1);}#つぶやきGLSL