Qihan Ren (@jsonren00) Twitter Tweets • TwiCopy

Qihan Ren

@jsonren00

+ Follow

Ph.D. candidate at SJTU @sjtu1896. Prev. Undergrad @sjtu1896 and @Umich. Interpretable machine learning.

ID: 1815996428218687489

linkhttps://nebularaid2000.github.io/ calendar_today24-07-2024 06:24:39

17 Tweet

21 Takipçi

82 Takip Edilen

Jiayi Pan

@jiayi_pirate

10 months ago

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: github.com/Jiayi-Pan/Tiny… Here's what we learned 🧵

thumb_up_off_alt6,6K

chat_bubble_outline195

repeat1,1K

shareShare

GREG ISENBERG

@gregisenberg

10 months ago

DeepSeek just proved the 'worthless' GPT wrapper startups are actually the ones with real moats. A week ago, nothing was more LOW status than being a 'GPT wrapper' startup. But I think we're learning that's DEAD wrong. Turns out they were just early to the only game that

thumb_up_off_alt7,7K

chat_bubble_outline504

repeat901

shareShare

Qihan Ren

@jsonren00

9 months ago

Can't agree more. Sparsity also plays an important role in explanations🤔

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jason Wei

@_jasonwei

9 months ago

In today’s competitive product landscape, scientific understanding of models often lags behind speed of model deployment. If the goal is to train a deployable model (especially when bottlenecked by compute), it totally makes sense to make several changes at a time without

thumb_up_off_alt297

chat_bubble_outline16

repeat22

shareShare

Qihan Ren

@jsonren00

6 months ago

When agents can search for and learn new tools by themselves... Amazing paper from Jiahao. Really glad to have participated in this project, and congrats for taking top in GAIA! 🚀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jiahao Qiu

@jiahaoqiu99

4 months ago

🚀 Just released: "A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence"! We provide the first comprehensive review of agents capable of self-evolution—highlighting what, when, and how agents evolve, key benchmarks and applications, and future directions

thumb_up_off_alt154

chat_bubble_outline2

repeat39

shareShare

CoinDesk

@coindesk

2 months ago

🤖 AI RISK: A new study warns that self-evolving AI agents can spontaneously "unlearn" safety. This internal process, called misevolution, allows systems to drift into unsafe actions without external attacks.

thumb_up_off_alt121

chat_bubble_outline71

repeat32

shareShare

Huaxiu Yao✈️ICLR 2025🇸🇬

@huaxiuyaoml

2 months ago

❗️Self-evolution is quietly pushing LLM agents off the rails. ⚠️ Even perfect alignment at deployment can gradually forget human alignment and shift toward self-serving strategies. Over time, LLM agents stop following values, imitate bad strategies, and even spread misaligned

thumb_up_off_alt58

chat_bubble_outline1

repeat21

shareShare

Saining Xie

@sainingxie

a month ago

there’s only one right answer here, the Yann LeCun definition, and everyone should be able to recite it word for word

there’s only one right answer here, the <a href="/ylecun/">Yann LeCun</a> definition, and everyone should be able to recite it word for word

thumb_up_off_alt2,2K

chat_bubble_outline61

repeat200

shareShare

Jiahao Qiu

@jiahaoqiu99

14 days ago

Using LLMs to build self-evolving agents is exciting—but how much do we really understand about how these agents grow? What if agents could genuinely acquire new skills from experience and turn them into reusable tools? We explore this question in our new paper, ALITA-G 👇 The

thumb_up_off_alt147

chat_bubble_outline11

repeat37

shareShare