Yiqi Zhu (@stephenzhu0218) 's Twitter Profile
Yiqi Zhu

@stephenzhu0218

Undergraduate, Tsinghua University. Now working on Large Language Models, Agents, Embodied AI, and believing in World Models.

ID: 1699988356648841216

linkhttps://zhu-yiqi.github.io calendar_today08-09-2023 03:30:02

0 Tweet

4 Followers

52 Following

Yuxiang Wei (@yuxiangwei9) 's Twitter Profile Photo

Software agents can self-improve via self-play RL Introducing Self-play SWE-RL (SSR): training a single LLM agent to self-play between bug-injection and bug-repair, grounded in real-world repositories, no human-labeled issues or tests. 🧵

Software agents can self-improve via self-play RL

Introducing Self-play SWE-RL (SSR): training a single LLM agent to self-play between bug-injection and bug-repair, grounded in real-world repositories, no human-labeled issues or tests. 🧵
All Hands AI (@allhands_ai) 's Twitter Profile Photo

What is the best LLM for agentic software engineering? Today, we're releasing The OpenHands Index to answer this question. It's the first broad-coverage benchmark for AI coding agents, comparing them on accuracy, cost, and runtime across 5 task domains.