Shiva (@sshivasurya) 's Twitter Profile
Shiva

@sshivasurya

Security Engineer | CS @UWaterloo | prev @Dropbox @Yelp | Building Code-PathFinder OSS codepathfinder.dev | 2x MVH @ GPSRP & Dropbox Hackerone program

ID: 1550750996

linkhttps://shivasurya.me calendar_today27-06-2013 14:25:46

3,3K Tweet

607 Followers

322 Following

Thinking Machines (@thinkymachines) 's Twitter Profile Photo

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to
Anthropic (@anthropicai) 's Twitter Profile Photo

New on the Anthropic Engineering blog: writing effective tools for LLM agents. AI agents are only as powerful as the tools we give them. So how do we make those tools more effective? We share our best tips for developers: anthropic.com/engineering/wr…

Shiva (@sshivasurya) 's Twitter Profile Photo

Secureflow + Qwen3:4b model is blazing fast 🔥 in M2 Max. I'm tired of Claude sonnet rate limits and advising to gradually increase the API usage.

Secureflow + Qwen3:4b model is blazing fast 🔥 in M2 Max. I'm tired of Claude sonnet rate limits and advising to gradually increase the API usage.
Ara (@arafatkatze) 's Twitter Profile Photo

Here's the simplest explanation of Cline's agentic algorithm. It's just a state machine that classifies every request with a tool call into 3 types: 1. Question tools (need clarification) 2. Action tools (gather context) 3. Completion tools (present results) That's it.

Here's the simplest explanation of <a href="/cline/">Cline</a>'s agentic algorithm.

It's just a state machine that classifies every request with a tool call into 3 types:
1. Question tools (need clarification)
2. Action tools (gather context)
3. Completion tools (present results)

That's it.
Shiva (@sshivasurya) 's Twitter Profile Photo

I have started adding instructions to CLAUDE.md to stop saying "You're absolutely right" to save some output tokens 🤡

Jeff Dean (@jeffdean) 's Twitter Profile Photo

Now you can automate many kinds of browser activity using the new Chrome DevTools MCP. In the other retweets of the base post, many people are already thinking of lots of great use cases for this functionality!

Shiva (@sshivasurya) 's Twitter Profile Photo

Experimenting with reinforcement learning (RL) and verifiers is showing great promise for training AI security agents. By guiding models with precise rewards and multi-turn reasoning, we can elevate vulnerability detection beyond pattern matching to adaptive, explainable exploit

Simon Willison (@simonw) 's Twitter Profile Photo

Decided to live blog this morning's OpenAI DevDay announcements, since I'm in the audience simonwillison.net/2025/Oct/6/ope…

Shiva (@sshivasurya) 's Twitter Profile Photo

TIL wandb = "weights and biases" not "wan db" spent two days thinking it was a networking database for storing training logs. 😂