Yiqi Zhu (@stephenzhu0218) Twitter Tweets • TwiCopy

Yiqi Zhu

@stephenzhu0218

+ Follow

Undergraduate, Tsinghua University. Now working on Large Language Models, Agents, Embodied AI, and believing in World Models.

ID: 1699988356648841216

linkhttps://zhu-yiqi.github.io calendar_today08-09-2023 03:30:02

0 Tweet

4 Followers

52 Following

Yuxiang Wei

@yuxiangwei9

4 months ago

Software agents can self-improve via self-play RL Introducing Self-play SWE-RL (SSR): training a single LLM agent to self-play between bug-injection and bug-repair, grounded in real-world repositories, no human-labeled issues or tests. 🧵

thumb_up_off_alt365

chat_bubble_outline9

repeat55

shareShare

All Hands AI

@allhands_ai

3 months ago

What is the best LLM for agentic software engineering? Today, we're releasing The OpenHands Index to answer this question. It's the first broad-coverage benchmark for AI coding agents, comparing them on accuracy, cost, and runtime across 5 task domains.

thumb_up_off_alt149

chat_bubble_outline9

repeat35

shareShare