Min Jae Song (@mj_theory) 's Twitter Profile
Min Jae Song

@mj_theory

Postdoc at UW

ID: 958179987005124613

linkhttps://mjsong32.github.io/ calendar_today30-01-2018 03:28:05

35 Tweet

61 Followers

249 Following

Nathan Lambert (@natolambert) 's Twitter Profile Photo

Reasoning model reports I recommend reading: 2025-01-22 - DeepSeek R1 - arxiv.org/abs/2501.12948 2025-01-22 - Kimi 1.5 - arxiv.org/abs/2501.12599 2025-03-31 - Open-Reasoner-Zero - arxiv.org/abs/2503.24290 2025-04-10 - Seed-Thinking 1.5 - arxiv.org/abs/2504.13914 2025-04-30 - Phi-4

Daniel Litt (@littmath) 's Twitter Profile Photo

It's good for academics to publicly experiment with new AI tools, but important to report both successes and failures when doing so. Audience capture incentivizes only doing one or the other, which is part of the reason the information environment around capabilities is so bad.

Dan Roy (@roydanroy) 's Twitter Profile Photo

Francesco Orabona Let me explain why "given access to the same data, prefer the tighter bound" is not a path to "explaining generalization". I'll tie thi back to the methodology "In Search of Robust Measures" paper. If your goal is estimating risk for the classifier you've learned, then you