Qihan Ren (@jsonren00) 's Twitter Profile
Qihan Ren

@jsonren00

Ph.D. candidate at SJTU @sjtu1896. Prev. Undergrad @sjtu1896 and @Umich. Interpretable machine learning.

ID: 1815996428218687489

linkhttps://nebularaid2000.github.io/ calendar_today24-07-2024 06:24:39

17 Tweet

21 Takipçi

82 Takip Edilen

Jiayi Pan (@jiayi_pirate) 's Twitter Profile Photo

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: github.com/Jiayi-Pan/Tiny… Here's what we learned 🧵

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works 

Through RL, the 3B base LM develops self-verification and search abilities all on its own 

You can experience the Ahah moment yourself for &lt; $30 
Code: github.com/Jiayi-Pan/Tiny…

Here's what we learned 🧵
GREG ISENBERG (@gregisenberg) 's Twitter Profile Photo

DeepSeek just proved the 'worthless' GPT wrapper startups are actually the ones with real moats. A week ago, nothing was more LOW status than being a 'GPT wrapper' startup. But I think we're learning that's DEAD wrong. Turns out they were just early to the only game that

Jason Wei (@_jasonwei) 's Twitter Profile Photo

In today’s competitive product landscape, scientific understanding of models often lags behind speed of model deployment. If the goal is to train a deployable model (especially when bottlenecked by compute), it totally makes sense to make several changes at a time without

Qihan Ren (@jsonren00) 's Twitter Profile Photo

When agents can search for and learn new tools by themselves... Amazing paper from Jiahao. Really glad to have participated in this project, and congrats for taking top in GAIA! 🚀

Jiahao Qiu (@jiahaoqiu99) 's Twitter Profile Photo

🚀 Just released: "A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence"! We provide the first comprehensive review of agents capable of self-evolution—highlighting what, when, and how agents evolve, key benchmarks and applications, and future directions

🚀 Just released: "A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence"!
We provide the first comprehensive review of agents capable of self-evolution—highlighting what, when, and how agents evolve, key benchmarks and applications, and future directions
CoinDesk (@coindesk) 's Twitter Profile Photo

🤖 AI RISK: A new study warns that self-evolving AI agents can spontaneously "unlearn" safety. This internal process, called misevolution, allows systems to drift into unsafe actions without external attacks.

🤖 AI RISK: A new study warns that self-evolving AI agents can spontaneously "unlearn" safety. 

This internal process, called misevolution, allows systems to drift into unsafe actions without external attacks.
Huaxiu Yao✈️ICLR 2025🇸🇬 (@huaxiuyaoml) 's Twitter Profile Photo

❗️Self-evolution is quietly pushing LLM agents off the rails. ⚠️ Even perfect alignment at deployment can gradually forget human alignment and shift toward self-serving strategies. Over time, LLM agents stop following values, imitate bad strategies, and even spread misaligned

Jiahao Qiu (@jiahaoqiu99) 's Twitter Profile Photo

Using LLMs to build self-evolving agents is exciting—but how much do we really understand about how these agents grow? What if agents could genuinely acquire new skills from experience and turn them into reusable tools? We explore this question in our new paper, ALITA-G 👇 The

Using LLMs to build self-evolving agents is exciting—but how much do we really understand about how these agents grow?

What if agents could genuinely acquire new skills from experience and turn them into reusable tools?

We explore this question in our new paper, ALITA-G 👇
The