Aditya Gupta (@adi1391) 's Twitter Profile
Aditya Gupta

@adi1391

$sudoing & steering AI agents ~ Chief Hacker @attifyme ~ Curious ~ IoT tinkering ~ Educator ~ Learner

ID: 332753135

calendar_today10-07-2011 10:48:18

2,2K Tweet

8,8K Takipçi

2,2K Takip Edilen

Aditya Gupta (@adi1391) 's Twitter Profile Photo

Foundational LLM models are going to be a commodity. It's a race to zero token price. As the Foundation layer saturates, the next layer will capture most of the Value. It's the apps - the clever workflows - that will create the most wealth by leveraging AI in innovative ways

Aditya Gupta (@adi1391) 's Twitter Profile Photo

ESP32 backdoor ? Important to distinguish between developer’s oversight of leftover debug code and what’s actually a backdoor. While leftover debug codes are not ideal from a security standpoint, it’s clearly not a backdoor. The level of access required to exploit this in

Aditya Gupta (@adi1391) 's Twitter Profile Photo

Important to add security in the Infosec (and other) MCPs - else - the #MCP itself opens up a huge new attack surface ready to be exploited. Like APIs w/o Auth. If your app/product allows MCP integration, begin by asking yourself - how trusted are the inputs?

Aditya Gupta (@adi1391) 's Twitter Profile Photo

[Iteration Point] • the solution to cracking LLM reasoning seems to be more like a coordinate where iterations reveal unique properties. feels like we are barely scratching the surface of LLM driven reasoning. if/when solved, intelligence explosion is inevitable.

Aditya Gupta (@adi1391) 's Twitter Profile Photo

Fascinating to see how far LLMs can go in multi-layer and multi-step Mathematical reasoning - the only true test of intelligence, without sacrificing precision. The real breakthroughs are no longer just going to be in the models, but the harnesses around it. It's just like

Aditya Gupta (@adi1391) 's Twitter Profile Photo

⌘ + "+" is a philosophy test. VSCode & forks Cursor: All panes are equal. Everything, including Editor + UI zooms by default (0/10) Zed: Active area matters, but consistency > locality. Allows granular & easy selection from Settings of UI vs Editor. (9/10) Ghostty

Aditya Gupta (@adi1391) 's Twitter Profile Photo

Just solved & submitted Q10 of #1stProof research-level math problems. If you can prove it, but can't trace what your proof depends on, or query which claims are still hypotheses, or machine-verify the core theorem - it's not really a proof. The initial bottleneck in solving

Just solved & submitted Q10 of #1stProof research-level math problems. 

If you can prove it, but can't trace what your proof depends on, or query which claims are still hypotheses, or machine-verify the core theorem - it's not really a proof.

The initial bottleneck in solving
Aditya Gupta (@adi1391) 's Twitter Profile Photo

This Opus 4.6 run went to 17h 6m before hitting weekly rate limits. But the most valuable output was not the solution. It was learning how the LLM navigates a hard problem over many hours, and then figuring out ways in which it can be steered better. long vs short llm

This Opus 4.6 run went to 17h 6m before hitting weekly rate limits. 

But the most valuable output was not the solution. 

It was learning how the LLM navigates a hard problem over many hours, and then figuring out ways in which it can be steered better. 

long vs short llm
Chaitanya (@ant4g0nist) 's Twitter Profile Photo

Been experimenting/building Morgul - an AI debugger automation framework. Control LLDB with natural language: act(), extract(), observe(). It translates intent into bridge API code, executes it, similar to Browserbase's Stagehand controls a browser... github.com/ant4g0nist/mor…

Been experimenting/building Morgul - an AI debugger automation framework. Control LLDB with natural language: act(), extract(), observe(). It translates intent into bridge API code, executes it, similar to <a href="/browserbase/">Browserbase</a>'s Stagehand controls a browser...

github.com/ant4g0nist/mor…
Aditya Gupta (@adi1391) 's Twitter Profile Photo

Great work by OpenAI. Hacking, at its core, is about Curiosity and Thinking Differently. Instead of thinking that the game is over, the game is on. It’s time to build frameworks & systems, that can go beyond what the top models are capable of. The most interesting times.

Aditya Gupta (@adi1391) 's Twitter Profile Photo

~$21B in market value disappeared in hours : CrowdStrike (CRWD) -8%, Cloudflare (NET) -8.1%, Okta (OKTA) -9.2%, Qualys (QLYS) -10.2%, Zscaler (ZS) -5.5% - lowest in the last few years. For many : Panic Mode. but Zoom out → → → This is a Huge Win for safer code and a

Aditya Gupta (@adi1391) 's Twitter Profile Photo

doing my part to make the ai ecosystem safer. starting with llama.cpp Heap Overflow Bug identified using CFSE World Modeling. Just received : CVE-2026-27940 one step at a time. thanks Georgi Gerganov for prioritizing security and fixing at such light-speed 🙏

doing my part to make the ai ecosystem safer.

starting with llama.cpp 

Heap Overflow Bug identified using CFSE World Modeling. 

Just received : CVE-2026-27940 

one step at a time. 

thanks <a href="/ggerganov/">Georgi Gerganov</a> for prioritizing security and fixing at such light-speed 🙏
Aditya Gupta (@adi1391) 's Twitter Profile Photo

congrats Axiom Carina Hong on the raise. mathematics is the right foundation for systems that reason - and essential for a safer future. the bet on verification infrastructure over benchmark scores is the right one, and AXLE proves it where - verify_proof tells you