Noam Razin (@noamrazin) Twitter Tweets • TwiCopy

Noam Razin

@noamrazin

+ Follow

Postdoctoral Fellow at @PrincetonPLI | Past: Computer Science PhD @TelAvivUni & Apple Scholar in AI/ML | Interested in the foundations of deep learning

ID: 1261252348669767680

linkhttps://noamrazin.github.io/ calendar_today15-05-2020 11:09:34

126 Tweet

543 Takipçi

276 Takip Edilen

Zixuan Wang

@zzzixuanwang

7 months ago

LLMs can solve complex tasks that require combining multiple reasoning steps. But when are such capabilities learnable via gradient-based training? In our new COLT 2025 paper, we show that easy-to-hard data is necessary and sufficient! arxiv.org/abs/2505.23683 🧵 below (1/10)

thumb_up_off_alt186

chat_bubble_outline1

repeat34

shareShare

Yoni Slutzky

@yonislutzky

7 months ago

Do neural nets really need gradient descent to generalize?🚨 We dive into matrix factorization and find a sharp split: wide nets rely on GD, while deep nets can thrive with any low-training-error weights! arxiv.org/abs/2506.03931 🧵

thumb_up_off_alt46

chat_bubble_outline1

repeat13

shareShare

Nadav Cohen

@nadavcohen

7 months ago

Do NNs need GD to generalize? Check out our new paper 👇

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Yong Lin

@yong18850571

6 months ago

(1/4)🚨 Introducing Goedel-Prover V2 🚨 🔥🔥🔥 The strongest open-source theorem prover to date. 🥇 #1 on PutnamBench: Solves 64 problems—with far less compute. 🧠 New SOTA on MiniF2F: * 32B model hits 90.4% at Pass@32, beating DeepSeek-Prover-V2-671B’s 82.4%. * 8B > 671B: Our 8B

thumb_up_off_alt224

chat_bubble_outline6

repeat77

shareShare

Pierfrancesco Beneventano

@pierbeneventano

5 months ago

New extended version of the preprint “Edge of Stochastic Stability (EoSS)” out! w/ Arseniy Andreyev 👉 arxiv.org/pdf/2412.20553 🗓️ Tomorrow (Wed July 23, 12 PM EDT) I’ll talk about it at OWML — sfu.zoom.us/j/89334355925 I've never explained what that was about I'll do it here:

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Kilian Lieret @ICLR

@klieret

5 months ago

Releasing mini, a radically simple SWE-agent: 100 lines of code, 0 special tools, and gets 65% on SWE-bench verified! Made for benchmarking, fine-tuning, RL, or just for use from your terminal. It’s open source, simple to hack, and compatible with any LM! Link in 🧵

thumb_up_off_alt787

chat_bubble_outline12

repeat72

shareShare

Yong Lin

@yong18850571

5 months ago

The report of Goedel-Prover-V2 is on arXiv now arxiv.org/pdf/2508.03613 . Check out the details on self-correction, large scale scaffolded data sythesis framework, and the magical model averaging.

thumb_up_off_alt338

chat_bubble_outline7

repeat137

shareShare