Code Monkey home page Code Monkey logo

coder's Introduction

Logo of CodeR

📰 News

  • [Jun. 4, 2024]: 🎉 We release CodeR, which can solve $28.33$% of issues on SWE-bench lite in the case of submitting only once per issue, Read more in our paper.

🌏 Abstract

GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve $28.33$% of issues, in the case of submitting only once for each issue. We examine the performance impact of each design of CodeR and offer insights to advance this research direction.

CodeR

🧪 Results on SWE-agent lite

CodeR

📗 Citation

@misc{chen2024coder,
      title={CodeR: Issue Resolving with Multi-Agent and Task Graphs}, 
      author={Dong Chen and Shaoxin Lin and Muhan Zeng and Daoguang Zan and Jian-Gang Wang and Anton Cheshkov and Jun Sun and Hao Yu and Guoliang Dong and Artem Aliev and Jie Wang and Xiao Cheng and Guangtai Liang and Yuchi Ma and Pan Bian and Tao Xie and Qianxiang Wang},
      year={2024},
      eprint={2406.01304},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

❣️ Question

Please feel free to arise issues If you have any questions.

coder's People

Contributors

nl2code avatar eltociear avatar

Stargazers

st01cs avatar Matthew avatar  avatar Han Zhou avatar 潘浩淼 avatar wangwhh avatar Richie Liu avatar Nikolaus Schlemm avatar Freax Ruby avatar  avatar Elifarley C. avatar id-2 avatar  avatar  avatar  avatar Symbolk avatar  avatar tsmile avatar 清水 avatar  avatar Vladislav Sorokin avatar  avatar  avatar Zeyu Zhang avatar Harsh Gupta avatar  avatar  avatar  avatar 水木 avatar Yosu Cadilla avatar jchl avatar Dean Rie avatar niliy01 avatar  avatar Hsuching avatar liyongjin avatar  avatar  avatar  avatar  avatar WenjieLi avatar  avatar  avatar  avatar  avatar Jun Liu avatar onemore118 avatar  avatar CannedFish Liang avatar evan.chris.ho avatar aguang avatar Whiten avatar kyle avatar Farrukh Nauman avatar Koolen Dasheppi avatar hjvogel avatar Alexey Golyshev avatar Jose Cohenca avatar Nur Arifin Akbar avatar Chang Gao avatar Org Contrib avatar  avatar Jens Roland avatar Brian Mulder avatar  avatar Daniil Pokrovsky avatar  avatar  avatar Jian Liao avatar yangchao avatar Allen avatar  avatar  avatar Greg Meldrum avatar  avatar elucida avatar mannaandpoem avatar Gurumurthi V Ramanan avatar Emanuele Sabetta avatar sam avatar  avatar KyrieMing avatar Liu Chenyang avatar jie w. murphy avatar  avatar Tom pei avatar Vincent avatar Zhangtao Cheng avatar lebronran avatar Bin avatar Billy Ma avatar Kye Gomez avatar Eugene Klimov avatar  avatar  avatar  avatar Phat Tan Pham avatar yakami avatar clearwater avatar Adam Ferla avatar

Watchers

Robert avatar Dong Chen avatar Yosu Cadilla avatar  avatar  avatar  avatar  avatar

coder's Issues

Code to reproduce experiments?

Wow, congratulations on the great results on SWE-Bench Lite! We would love to reproduce the results, are there plans to release the code?

Question about manager plans and related-issue-retrieval action

Congratulations on the release and thank you for citing the AutoCodeRover paper!

The paper was a great read. After going through it, I have a couple of clarification questions:

  1. The Plan D part above Figure 3 mentions that "Plan D takes a test-driven approach with a ground truth test for issues (such as 'fail-to-pass' and 'pass-to-pass' tests in SWE-bench)."
    Does this mean the developer-written tests for the issue (i.e. the test_patch field in SWE-bench instances) were provided to CodeR?

  2. Section 2 mentions that "Action 18 retrieves the top-1 similar issue and its corresponding patch by description." (action 18 is related issue retrieval from Table 1).
    This is an interesting approach! I'm curious how you defined "similarity" between issues - was this using a RAG-based approach on the issue descriptions? Besides, how did you construct the corpus of issues to retrieve from?

Thank you very much in advance for your time and assistance!

Does PlanD use ground truth tests?

Hi, does the plan D mentioned in your paper use "fail-to-pass" tests that are actually used to evaluate the patches?

If so, this would be kind of unfair because most of the other methods do not use those.

Could you maybe specify how many instances in the paper are solved by Plan D?

Set of instances for ablation study

Congratulations on the great results on swebench! We would like to reproduce this work. Do you have any plans to release the code and data (instance id of the small set for the ablation study)?

Plan to open-source

Hi, thanks for the nice work! Do you have a plan to release the source code of CodeR, so that people can learn more about it?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.