Ajitesh Shukla (@ajitesh_shukla7) 's Twitter Profile
Ajitesh Shukla

@ajitesh_shukla7

Student,Love to solve hardest math problem. LLM's, Mathematical Research(Geometric Topology,Differential Geometry),Quantum Computing.Lord Krishna is God Of Math

ID: 986296840478953472

calendar_today17-04-2018 17:34:26

46,46K Tweet

1,1K Followers

5,5K Following

Sam Power (@sp_monte_carlo) 's Twitter Profile Photo

In any case, please do take a look at the paper (arxiv.org/abs/2503.14347); it's nice and short, and could be a useful tool to keep in your toolbox.

In any case, please do take a look at the paper (arxiv.org/abs/2503.14347); it's nice and short, and could be a useful tool to keep in your toolbox.
Csaba Szepesvari (@csabaszepesvari) 's Twitter Profile Photo

Sam Power Zishun Liu Yongxin Chen I looked at the definition of the averaged moment generating function and this reminds me of the method of mixtures that I am a big fan of and which goes back to de la Pena et al. See eg sites.ualberta.ca/~szepesva/pape… and references. Looks cool what you did though and slightly different

Mattes Mollenhauer (@gaussianmeasure) 's Twitter Profile Photo

Sam Power Zishun Liu Yongxin Chen Nice! Related and potentially of interest: arxiv.org/abs/2306.11404 If you express the subgaussian proxy not uniformly, but dimension-wise in terms of a psd operator, you can obtain a sharp bound that also holds for vectors in infinite-dim Hilbert spaces.

<a href="/sp_monte_carlo/">Sam Power</a> <a href="/zliuPhD/">Zishun Liu</a> <a href="/YongxinChen1/">Yongxin Chen</a> Nice! Related and potentially of interest:  arxiv.org/abs/2306.11404

If you express the subgaussian proxy not uniformly, but dimension-wise in terms of a psd operator, you can obtain a sharp bound that also holds for vectors in infinite-dim Hilbert spaces.
Oliver Maclaren (@omaclaren) 's Twitter Profile Photo

Teaching some matrix calculus at the moment, and mostly but not fully satisfied with my notes at the moment...found these, they look very good! Gives me some nice ideas to improve my own notes -- Matrix Calculus (for Machine Learning and Beyond) arxiv.org/abs/2501.14787

Dr. Chris Rackauckas (@chrisrackauckas) 's Twitter Profile Photo

Hot take: vibe coding isn't for those who don't know how to code, it's for the experts. A perspective on the true role of Generative AI and LLM-Based Vibe Coding for the future of development. stochasticlifestyle.com/a-guide-to-gen… #GenerativeAI #llm Claude ChatGPT #vibecoding

Piotr Pomorski (@ptrpomorski) 's Twitter Profile Photo

"Is Gold an Inflation Hedge?" -> it used to be decades ago, but now it's just for large shocks. papers.ssrn.com/sol3/papers.cf…

"Is Gold an Inflation Hedge?" -&gt; it used to be decades ago, but now it's just for large shocks. 
papers.ssrn.com/sol3/papers.cf…
QuantSeeker (@quantseeker) 's Twitter Profile Photo

New paper by Daniel Bloch: Fast trading signals often look like alpha but are mostly small-sample noise. What seems like “speed” is usually just reacting faster to randomness. papers.ssrn.com/sol3/papers.cf…

New paper by Daniel Bloch: Fast trading signals often look like alpha but are mostly small-sample noise. What seems like “speed” is usually just reacting faster to randomness.

papers.ssrn.com/sol3/papers.cf…
Piotr Pomorski (@ptrpomorski) 's Twitter Profile Photo

"The Science and Practice of Trend-following Systems", great one, especially that we currently work on updating our TF system. It's 44 pages, so probably better to run through it using Benjamin AI papers.ssrn.com/sol3/papers.cf…

"The Science and Practice of Trend-following Systems", great one, especially that we currently work on updating our TF system.
It's 44 pages, so probably better to run through it using <a href="/benjaminai_co/">Benjamin AI</a> 
papers.ssrn.com/sol3/papers.cf…
Horace He (@chhillee) 's Twitter Profile Photo

Seunghyun Seo Daniel Vega-Myhre JingyuanLiu I don't quite think about it as you do. A couple notes: 1. I wouldn't say FSDP doesn't reduce activation memory. When doing these comparisons, it makes sense to keep gbsz fixed, and in that case, swapping TP=>DP lowers batch size per GPU, lowering activation memory. 2. All gather

Heming Xia (@hemingkx) 's Twitter Profile Photo

🎉Excited to share that TokenSkip has been accepted to the main conference of EMNLP 2025! Many thanks to all the coauthors for their hard work! Looking forward to seeing everyone in Suzhou😉. arxiv.org/abs/2502.12067

Chao Huang (@huang_chao4969) 's Twitter Profile Photo

🔥 DeepCode has been trending on GitHub for 2 consecutive days! 🚀 Almost hitting 2k GitHub Stars! 🌟 Fully Open Source: github.com/HKUDS/DeepCode ✨ All-in-One Agentic Coding Framework ✨ • 📄 Paper2Code - Research to Implementation • 🌐 Text2Web - Natural Language to Frontend

🔥 DeepCode has been trending on GitHub for 2 consecutive days! 🚀 Almost hitting 2k GitHub Stars!

🌟 Fully Open Source: github.com/HKUDS/DeepCode

✨ All-in-One Agentic Coding Framework ✨
• 📄 Paper2Code - Research to Implementation
• 🌐 Text2Web - Natural Language to Frontend
Qingyu (@qingyu_shi_) 's Twitter Profile Photo

The 3rd Universal Cup Semifinals is coming! Live scoreboard: qoj.ac/results/Semifi… (currently Warm-up Contest) The deadline of registering new teams is Aug 24 at 18:30 (UTC).

elie (@eliebakouch) 's Twitter Profile Photo

Didn't realize but it's easy to see if open source model are using "original" muP by looking at the config. For instance in grok1/2 there is this "input/output multiplier_scale" which correspond to the alpha input/output in the original muP. Looking at the transformers modeling

Didn't realize but it's easy to see if open source model are using "original" muP by looking at the config. For instance in grok1/2 there is this "input/output multiplier_scale" which correspond to the alpha input/output in the original muP. 

Looking at the transformers modeling
AIDB (@ai_database) 's Twitter Profile Photo

アメリカの核兵器研究で有名なロス・アラモス国立研究所が「科学者AI」を開発。人間のように論文を読み、実験を計画し、シミュレーションを実行して、科学的発見を自動化しようとするシステムとのことです。 このAIは核融合実験の設計において、従来手法よりかなり効率的だったと報告されています。

アメリカの核兵器研究で有名なロス・アラモス国立研究所が「科学者AI」を開発。人間のように論文を読み、実験を計画し、シミュレーションを実行して、科学的発見を自動化しようとするシステムとのことです。

このAIは核融合実験の設計において、従来手法よりかなり効率的だったと報告されています。
José A. Alonso (@jose_a_alonso) 's Twitter Profile Photo

LeanGeo: Formalizing competitional geometry problems in Lean. ~ Chendong Song et als. arxiv.org/abs/2508.14644 #ITP #LeanProver #Math #LLMs

José A. Alonso (@jose_a_alonso) 's Twitter Profile Photo

To zip through the cost analysis of probabilistic programs. ~ Matthias Hetzenberger, Georg Moser, Florian Zuleger. arxiv.org/abs/2508.14249… #Haskell #FunctionalProgramming

maspy (@maspy_stars) 's Twitter Profile Photo

The 3rd Universal Cup Semifinals YouTube: youtube.com/@Universal_Cup Bilibili: space.bilibili.com/35466280089706… Feel free to watch it, no matter whether you are a participant or not. なので問題解説等は無いはず。参加者画面も録画はあれど配信はないはず。 順位表だけで5時間持たせるのかな?

Simons Institute for the Theory of Computing (@simonsinstitute) 's Twitter Profile Photo

1/2 "Fundamentally, modern AI is just a mathematical object. Math is transforming the world...especially with respect to modern AI." Mikhail Belkin of UC San Diego at the Simons Institute on the triumph and failure of mathematics as it relates to AI. Video: simons.berkeley.edu/talks/mikhail-…

1/2 "Fundamentally, modern AI is just a mathematical object. Math is transforming the world...especially with respect to modern AI." Mikhail Belkin of <a href="/UCSD/">UC San Diego</a> at the Simons Institute on the triumph and failure of mathematics as it relates to AI. Video: simons.berkeley.edu/talks/mikhail-…