Anton Shevtsov (@shevan05) Twitter Tweets • TwiCopy

Anton Shevtsov

@shevan05

+ Follow

ID: 536128489

calendar_today25-03-2012 08:50:48

5 Tweet

4 Takipçi

16 Takip Edilen

Nebius

@nebiusai

a year ago

We release world’s first datasets for training software engineering agents🔥 More specifically, our AI R&D team uploaded two datasets to @HuggingFace: one with 6,411 Issue-PR pairs, and the other with 80,036 agent trajectories. Learn more on our blog: eu1.hubs.ly/H0fwMB00

thumb_up_off_alt46

chat_bubble_outline5

repeat14

shareShare

hr0nix @ ICLR

@hr0nix

a year ago

As a follow up to our work on applying search to software engineering agents, today we are releasing datasets of problem instances and agent trajectories. This is the training data we previously used to achieve 40.6% on SWE-bench Verified using open-weight models only! 🧵⬇️

thumb_up_off_alt36

chat_bubble_outline2

repeat20

shareShare

hr0nix @ ICLR

@hr0nix

a year ago

Spirit of open-source is in the air thanks to DeepSeek! And today we are happy to release kvax, our implementation of flash attention for jax! It is very fast and has some advanced features such as context parallelism support that might not be easy to come by. Details ⬇️

thumb_up_off_alt66

chat_bubble_outline4

repeat19

shareShare

hr0nix @ ICLR

@hr0nix

a year ago

An extended writeup of our earlier research blogpost on training critics for SWE agents has been accepted to ICML! Some details below ⬇️

thumb_up_off_alt14

chat_bubble_outline1

repeat5

shareShare

Nebius

@nebiusai

8 months ago

Our own SWE-rebench just became the #1 most downloaded dataset on @HuggingFace 🥇 SWE-rebench is a dataset and benchmark for code agents based on LLMs, developed by our AI R&D team. It has been downloaded more than 3.9M times — 3.1M in the last month. 1/4

thumb_up_off_alt84

chat_bubble_outline2

repeat11

shareShare