Skip to content

Commit 4c713f5

Browse files
authored
Update recent released papers
1 parent 9fd6163 commit 4c713f5

6 files changed

+54
-3
lines changed

_posts/2000-01-01-Related Surveys.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,8 @@ author: coder
44
date: 2000-01-01 00:00:00 +0800
55
categories: [other]
66
tags: [surveys]
7-
pin: false
7+
pin: true
88
---
99
## [Deep Learning Based Code Generation Methods | A Literature Review](https://arxiv.org/ftp/arxiv/papers/2303/2303.01056.pdf)
1010
## [A Survey on Language Models for Code](https://arxiv.org/pdf/2311.07989.pdf)
11+
## [A Survey on Large Language Models for Code Generation](https://arxiv.org/abs/2406.00515)

_posts/2023-10-10-SWE-bench.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,10 +5,11 @@ date: 2023-10-10 00:00:00 +0800
55
categories: [arxiv]
66
tags: [benchmarks]
77
math: true
8+
pin: true
89
---
910

1011
- 🗂️Benchmark Name: [SWE-bench](https://arxiv.org/pdf/2310.06770.pdf)
1112
- 📚Publisher: `Arxiv`
1213
- 🏠Author Affiliation: `Princeton University`; `Princeton Language and Intelligence`; `University of Chicago`
13-
- 🔗URL: [https://www.swebench.com/](https://www.swebench.com/)
14+
- 🔗Leaderboard: [https://www.swebench.com/](https://www.swebench.com/)
1415
- Scenario: `Real-World GitHub Issues`

_posts/2024-04-05-SWE-Agent.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ math: true
88
pin: false
99
---
1010

11-
- 📙Paper: [SWE-agent](https://github.com/princeton-nlp/SWE-agent)
11+
- 📙Paper: 🔥🔥🔥[SWE-agent](https://github.com/princeton-nlp/SWE-agent)
1212
- 📚Publisher: `arxiv`
1313
- 🏠Author Affiliation: `Princeton University`
1414
- Contribution: SWE-agent turns LMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories. On SWE-bench, SWE-agent resolves 12.29% of issues, achieving the state-of-the-art performance on the full test set.

_posts/2024-04-08-AutoCodeRover.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
---
2+
title: AutoCodeRover
3+
author: coder
4+
date: 2024-04-08 00:00:00 +0800
5+
categories: [arxiv]
6+
tags: [models]
7+
math: true
8+
---
9+
10+
- 📙Paper: [AutoCodeRover: Autonomous Program Improvement](https://arxiv.org/pdf/2404.05427)
11+
- 📚Publisher: `arxiv`
12+
- 🏠Author Affiliation: `National University of Singapore, Singapore`
13+
- Leaderboard: [https://www.swebench.com](https://www.swebench.com)
14+
- Abstract: Researchers have made significant progress in automating the software development process in the past decades. Recent progress in Large Language Models (LLMs) has significantly impacted the development process, where developers can use LLM-based programming assistants to achieve automated coding. Nevertheless software engineering involves the process of program improvement apart from coding, specifically to enable software maintenance (e.g. bug fixing) and software evolution (e.g. feature additions). In this paper, we propose an automated approach for solving GitHub issues to autonomously achieve program improvement. In our approach called AutoCodeRover, LLMs are combined with sophisticated code search capabilities, ultimately leading to a program modification or patch. In contrast to recent LLM agent approaches from AI researchers and practitioners, our outlook is more software engineering oriented. We work on a program representation (abstract syntax tree) as opposed to viewing a software project as a mere collection of files. Our code search exploits the program structure in the form of classes/methods to enhance LLM's understanding of the issue's root cause, and effectively retrieve a context via iterative search. The use of spectrum based fault localization using tests, further sharpens the context, as long as a test-suite is available. Experiments on SWE-bench-lite which consists of 300 real-life GitHub issues show increased efficacy in solving GitHub issues (22-23% on SWE-bench-lite). On the full SWE-bench consisting of 2294 GitHub issues, AutoCodeRover solved around 16% of issues, which is higher than the efficacy of the recently reported AI software engineer Devin from Cognition Labs, while taking time comparable to Devin. We posit that our workflow enables autonomous software engineering, where, in future, auto-generated code from LLMs can be autonomously improved.

_posts/2024-06-04-CodeR.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
---
2+
title: CodeR
3+
author: coder
4+
date: 2024-06-04 00:00:00 +0800
5+
categories: [arxiv]
6+
tags: [models]
7+
math: true
8+
pin: true
9+
---
10+
11+
- 📙Paper: [CodeR: Issue Resolving with Multi-Agent and Task Graphs](https://arxiv.org/pdf/2406.01304)
12+
- 📚Publisher: `arxiv`
13+
- 🏠Author Affiliation: `Huawei`, `Chinese Academy of Science`, `Singapore Management University`, `Peking University`
14+
- Leaderboard: SOTA on [https://www.swebench.com](https://www.swebench.com) (24.06.04)
15+
- Contribution:
16+
+ We propose CODER, a multi-agent framework with task graphs for issue resolving. Inspired by the issue resolving process by humans in the real world, we design the roles and the actions. For plans, we design a graph data structure that can be parsed and strictly executed. It can ensure the exact execution of the plan and at the same time provide an easy-to-plug interface for plan injection from humans.
17+
+ We leverage LLM-generated code for reproducing the issue and the tests in the repository (excluding the verification tests) to get code coverage information. Coverage information improves contextual retrieval based on the keywords in the issue text and does fault localization together with BM25.
18+
+ We renew the state-of-the-art of SWE-bench lite to 28.33% (85/300) with only one submission per issue.

_posts/2024-06-11-GraphCoder.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
---
2+
title: GraphCoder
3+
author: coder
4+
date: 2024-06-11 00:00:00 +0800
5+
categories: [arxiv]
6+
tags: [models]
7+
math: true
8+
---
9+
10+
- 📙Paper: [GraphCoder: Enhancing Repository-Level Code Completion via Code Context Graph-based Retrieval and Language Model](https://arxiv.org/abs/2406.07003)
11+
- 📚Publisher: `arxiv`
12+
- 🏠Author Affiliation: `Peking University`, `Chinese Academy of Sciences`, `Huawei`
13+
- GitHub: [https://github.com/oceaneLIU/GraphCoder](https://github.com/oceaneLIU/GraphCoder)
14+
- Contribution:
15+
+ An approach GraphCoder to enhance the effectiveness of retrieval by a coarse-to-fine process, which considers both structural and lexical context, as well as the dependence distance between the completion target and the context;
16+
+ A graph-based representation CCG (code context graph) of source code to capture relevant long-distance context for predicting the semantics of code completion target instead of the widely adopted sequence-based one;
17+
+ Extensive experiments upon 5 LLMs and across 8000 code completion tasks from 20 repositories demonstrate that GraphCoder achieves higher exact match values with reduced retrieval time and overhead in database storage space.

0 commit comments

Comments
 (0)