Amardeep Singh Sidhu (@thefirehacker) 's Twitter Profile
Amardeep Singh Sidhu

@thefirehacker

Founder-AI Researcher at @bubblspace @AIEDXLearn 🚀 Building BubblSpace TimeCapsule for AI Agents.

ID: 285830591

linkhttps://bubblspace.com calendar_today21-04-2011 21:30:01

5,5K Tweet

385 Followers

1,1K Following

Amardeep Singh Sidhu (@thefirehacker) 's Twitter Profile Photo

We need more real LLM training case studies. If you’ve seen or shared actual training runs, drop them here 👇 💡 We’re collecting these and turning them into structured breakdowns that are easy to learn and apply. Reading the Curves: How real LLMs learn, spike, recover, and

We need more real LLM training case studies.
If you’ve seen or shared actual training runs, drop them here 👇
💡 We’re collecting these and turning them into structured breakdowns that are easy to learn and apply.

Reading the Curves:
How real LLMs learn, spike, recover, and
DPIIT India (@dpiitgoi) 's Twitter Profile Photo

DPIIT releases operational guidelines for ₹10,000 crore Startup India Fund of Funds 2.0—a major push to deepen India’s startup ecosystem. #StartupIndia Read more: pib.gov.in/PressReleasePa…

Anish Moonka (@anisha_moonka) 's Twitter Profile Photo

A 9-year-old in India saw two peacocks walking through dawn fog. She ran to her dad, grabbed his camera, knelt on the dirt, and took one photo. It placed at the BBC Wildlife Photographer of the Year, picked from almost 60,000 entries across 117 countries. The photographer is

Amardeep Singh Sidhu (@thefirehacker) 's Twitter Profile Photo

I found this feature in cursor today "Generate commit message" , very useful. don't have prompt CC or Cursor " generate commit msg for staged code"

I found this feature in cursor today "Generate commit message" , very useful.

don't have  prompt CC or Cursor " generate commit msg for staged code"
Grad (@grad62304977) 's Twitter Profile Photo

I'm still confused by some of the decisions done in deepseek v4 Main confusion is why the huge focus on reducing KV cache size when with something like HiSparse u can offload most of ur kv cache (making ur decode compute bound) This also is compensated with a huge 128 heads and

Erfanzar (@eraznafre) 's Twitter Profile Photo

Releasing SpectraX is a JAX-native neural-network library built around true MPMD pipeline parallelism. Each physical rank compiles and runs its own XLA program — no shared shard_map HLO, no SPMD-same-shape constraint. Heterogeneous stages (eg, embed → blocks → head), nine

Puneet Kumar (@puneetiitm) 's Twitter Profile Photo

The real sign Indian airports have arrived isn’t the marble or the lounges. It’s that the 90-minute buffer is now obsolete. Mumbai T1 today: in at 5:30, seated by 5:42. T2 last week — same. Delhi — same. When efficiency becomes predictable, the real luxury is leaving home later.

Jeremy Howard (@jeremyphoward) 's Twitter Profile Photo

This is great - DeepSeek V4 supports prefill! :D Most other providers have been dropping support for this critically important capability, so wonderful to see at least one company stepping up. api-docs.deepseek.com/guides/chat_pr…

Yacine Mahdid (@yacinelearning) 's Twitter Profile Photo

if you are interested in a great lecture on self-distillation I’ve finished editing a ~1h30min lecture with two stellar researchers in that space Jonas Hübotter and idan shenfeld lots of different article distilled into one presentation and a whole lot of questions answered!

if you are interested in a great lecture on self-distillation I’ve finished editing a ~1h30min lecture with two stellar researchers in that space <a href="/jonashubotter/">Jonas Hübotter</a> and <a href="/IdanShenfeld/">idan shenfeld</a> 

lots of different article distilled into one presentation and a whole lot of questions answered!
Ben Burtenshaw (@ben_burtenshaw) 's Twitter Profile Photo

Humanity's Last Hackathon is NOW OPEN for registration. This is not a normal hackathon. You will be judged on the context, not the code! Use Codex OpenAI Developers to build and optimize models for local inference (kernels on Max metal). Submit through GPU MODE. Climb the

Matej Sirovatka (@m_sirovatka) 's Twitter Profile Photo

We partnered with Hugging Face and OpenAI to allow you to write Metal kernels with AI only. We've seen a rise of AI generated submissions so this time you submit directly through codex to write the fastest kernels to run your AI locally 🫡

Claude (@claudeai) 's Twitter Profile Photo

Claude now connects to the tools creative professionals already use. With the new Blender connector, you can debug a scene, build new tools, or batch-apply changes across every object, directly from Claude.

Keller Jordan (@kellerjordan0) 's Twitter Profile Photo

Modded-NanoGPT Optimization Benchmark Hundreds of neural network optimizers have been proposed in the literature, recently including dozens citing Muon: MARS, SWAN, REG, ADANA, Newton-Muon, TrasMuon, AdaMuon, HTMuon, COSMOS, Conda, ASGO, SAGE, and Magma, to name a few. The

Modded-NanoGPT Optimization Benchmark

Hundreds of neural network optimizers have been proposed in the literature, recently including dozens citing Muon: MARS, SWAN, REG, ADANA, Newton-Muon, TrasMuon, AdaMuon, HTMuon, COSMOS, Conda, ASGO, SAGE, and Magma, to name a few.

The
B62 Studios (@b62studios) 's Twitter Profile Photo

It’s time for Japan to experience the Dhurandhar energy! ⚔️🔥 Arriving in theatres across Japan on 10.7.2026.

It’s time for Japan to experience the Dhurandhar energy! ⚔️🔥

Arriving in theatres across Japan on 10.7.2026.
Priyaa (@pritopian) 's Twitter Profile Photo

India doesn’t need to build its own nuclear program. But it must lead in ensuring the benefits of deterrence are widely shared. See how absurd that sounds? you can’t lead in distributing a benefit you don’t control. if you don't own any part of the core capability chain, you