Songrun He (@hesongrun) 's Twitter Profile
Songrun He

@hesongrun

Ph.D. in Finance @ WashU

ID: 1185968911268880390

linkhttps://www.songrunhe.com/ calendar_today20-10-2019 17:20:06

153 Tweet

220 Followers

700 Following

Ambrogio Cesa-Bianchi (@ambrogiocb) 's Twitter Profile Photo

By popular demand, here is my Beamer theme that allows annotation of text, figures, and tables using arrows and handwritten-like text: github.com/ambropo/Jambro… A 🤓 thread on what the theme allows to do 👇🏼

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

"Move 37" is the word-of-day - it's when an AI, trained via the trial-and-error process of reinforcement learning, discovers actions that are new, surprising, and secretly brilliant even to expert humans. It is a magical, just slightly unnerving, emergent phenomenon only

Asaf Manela (@asafmanela) 's Twitter Profile Photo

Thrilled to announce our new paper, The Natural Language of Finance, forthcoming in Foundations and Trends! We explore how #AI, #NLP, and large language models are transforming research in corporate finance and asset pricing. #econtwitter

Yasmine (@cyousakura) 's Twitter Profile Photo

🎉 Introducing Open Reasoner Zero 🚀 Performance: Matches DeepSeek R1-Zero (32B) in just 1/30 steps! 📚 Full training strategies & technical paper 💻 100% open-source: Code + Data + Model ⚖️ MIT licensed - Use it your way! 🌊 Let the Reasoner-Zero tide rise! 🚢 1/n

🎉 Introducing Open Reasoner Zero

🚀 Performance: Matches DeepSeek R1-Zero (32B) in just 1/30 steps!

📚 Full training strategies & technical paper

💻 100% open-source: Code + Data + Model

⚖️ MIT licensed - Use it your way!

🌊 Let the Reasoner-Zero tide rise!

🚢 1/n
DeepSeek (@deepseek_ai) 's Twitter Profile Photo

🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team DeepSeek exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,

Kimi.ai (@kimi_moonshot) 's Twitter Profile Photo

🚀 Introducing our new tech report: Muon is Scalable for LLM Training We found that Muon optimizer can be scaled up using the follow techniques: • Adding weight decay • Carefully adjusting the per-parameter update scale ✨ Highlights: • ~2x computational efficiency vs AdamW

🚀 Introducing our new tech report: Muon is Scalable for LLM Training

We found that Muon optimizer can be scaled up using the follow techniques: 
• Adding weight decay
• Carefully adjusting the per-parameter update scale

✨ Highlights:
• ~2x computational efficiency vs AdamW
Dacheng Xiu (@dachxiu) 's Twitter Profile Photo

📢 SoFiE Summer School 2025: Financial Machine Learning 📍 Yale SOM | July 28 – Aug 1 Deep dive into ML/AI in finance: factor models, NLP, deep learning, AI-driven asset pricing. 🔗 Apply by May 16: yalesurvey.ca1.qualtrics.com/jfe/form/SV_6D… #Finance #MachineLearning #SoFiE #AI #Econometrics

Barack Obama (@barackobama) 's Twitter Profile Photo

At a time when people are understandably focused on the daily chaos in Washington, these articles describe the rapidly accelerating impact that AI is going to have on jobs, the economy, and how we live. axios.com/2025/05/28/ai-…

Yuchen Jin (@yuchenj_uw) 's Twitter Profile Photo

Ilya Sutskever, in his speech at UToronto 2 days ago: "The day will come when AI will do all the things we can do." "The reason is the brain is a biological computer, so why can't the digital computer do the same things?" It's funny that we are debating if AI can "truly think"

Asaf Manela (@asafmanela) 's Twitter Profile Photo

🚨 Call for Papers! 🚨 Submit to the 21st Annual Olin Finance Conference @WashUOlin, Oct 16–17, 2025. Topics: asset pricing, corp finance, behavioral, more. Deadline: July 13 Details: event.olin.wustl.edu/washu-finance-… #econtwitter

Denny Zhou (@denny_zhou) 's Twitter Profile Photo

Slides for my lecture “LLM Reasoning” at Stanford CS 25: dennyzhou.github.io/LLM-Reasoning-… Key points: 1. Reasoning in LLMs simply means generating a sequence of intermediate tokens before producing the final answer. Whether this resembles human reasoning is irrelevant. The crucial

Zeyuan Allen-Zhu, Sc.D. (@zeyuanallenzhu) 's Twitter Profile Photo

Phase 1 of Physics of Language Models code release ✅our Part 3.1 + 4.1 = all you need to pretrain strong 8B base model in 42k GPU-hours ✅Canon layers = strong, scalable gains ✅Real open-source (data/train/weights) ✅Apache 2.0 license (commercial ok!) 🔗github.com/facebookresear…

Phase 1 of Physics of Language Models code release
✅our Part 3.1 + 4.1 = all you need to pretrain strong 8B base model in 42k GPU-hours
✅Canon layers = strong, scalable gains
✅Real open-source (data/train/weights)
✅Apache 2.0 license (commercial ok!)
🔗github.com/facebookresear…