🤖 Awesome LLM for APR

📖 Contents

👏 Citation
💡 Repair Scenarios
🙆 Human Study
🙅 Patch Correctness Assessment
📊 Benchmark
🤔 Related APR Surveys

👏 Citation

@article{zhang2024survey,
  title={A Systematic Literature Review on Large Language Models for Automated Program Repair},
  author={Zhang, Quanjun and Fang, Chunrong and Xie, Yang and Ma, Yuxiang and Sun, Weisong and Yang, Yun and Chen, Zhenyu},
  journal={arXiv preprint arXiv:2405.01466}
  year={2024}
}

🚗Todo List

add SE agent-based studies for GitHub Issues
add ISSTA 2024 Papers
release a new version of this paper on arXiv

🔥🔥 New Papers

🔥CORE: Resolving Code Quality Issues using LLMs [2024-FSE]
🔥Prompt Fix: Vulnerability Automatic Repair Technology Based on Prompt Engineering [2024-ICNC]
🔥Evaluating Large Language Models for Real-World Vulnerability Repair in C/C++ Code[2024-IWSPA]
🔥Investigating large language models capabilities for automatic code repair in Python[2024-Cluster Computing]
🔥LPR: Large Language Models-Aided Program Reduction[2024-ISSTA]
🔥A Case Study of LLM for Automated Vulnerability Repair: Assessing Impact of Reasoning and Patch Validation Feedback (2024年7月) AIware 2024
🔥When Large Language Models Confront Repository-Level Automatic Program Repair: How Well They Done? [2024-ICSE]
🔥Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair[2024-ASE]
Exploring the Potential of Conversational Test Suite Based Program Repair on SWE-bench [2024-arXiv]
Exploring and Lifting the Robustness of LLM-powered Automated Program Repair with Metamorphic Testing[2024-arXiv] [paper]
Divide-and-Conquer: Automating Code Revisions via Localization-and-Revision [2024-TOSEM]
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging [2024-arXiv] [paper] [repo]
Automated Program Repair for Introductory Programming Assignments [2024-TLT] [paper]
Automated Repair of AI Code with Large Language Models and Formal Verification [2024-arXiv] [paper]
CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair [2024-arXiv-NVIDIA] [paper]
Benchmarking Automated Program Repair: An Extensive Study on Both Real-World and Artificial Bugs [2024-ISSTA] [paper]
Automated program repair via conversation: Fixing 162 out of 337 bugs for $0.42 each using chatgpt[2024-ISSTA] [paper]
Leveraging Large Language Model for Automatic Patch Correctness Assessment[2024-TSE] [paper]
Automated program repair for variability bugs in software product line systems[2024-JSS] [paper]
PyBugHive: A Comprehensive Database of Manually Validated, Reproducible Python Bugs[2024-IEEE Access] [paper]
How to Understand Whole Software Repository? [2024-arXiv] [paper]

💡 Repair Scenarios

Semantic Bug

🔥Automated program repair for variability bugs in software product line systems[2024-JSS] [paper]
🔥A Unified Debugging Approach via LLM-Based Multi-Agent Synergy [2024-arxiv] [paper] [repo]
🔥How Far Can We Go with Practical Function-Level Program Repair? [2024-arxiv] [paper] [repo]
🔥Automated program repair via conversation: Fixing 162 out of 337 bugs for $0.42 each using chatgpt[2024-ISSTA] [paper]
Old Version: Keep the Conversation Going: Fixing 162 out of 337 bugs for $0.42 each using ChatGPT [2023-arxiv] [paper]
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language Models [2024-arxiv] [paper] [repo]
Out of Context: How important is Local Context in Neural Program Repair? [2024-ICSE] [paper] [repo]
Multi-Objective Fine-Tuning for Enhanced Program Repair with LLMs [2024-arxiv] [paper]
Aligning LLMs for FL-free Program Repair [2024-arxiv] [paper]
ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs [2024-arxiv] [paper]
Exploring the Potential of Pre-Trained Language Models of Code for Automated Program Repair [2024-Electronics] [paper]
CigaR: Cost-efficient Program Repair with LLMs [2024-arxiv] [paper] [repo]
The Fact Selection Problem in LLM-Based Program Repair [2024-arxiv] [paper] [repo]
A Novel Approach for Automated Program Repair using Round-Trip Translation with Large Language Models [2024-arxiv] [paper] [repo]
RepairAgent: An Autonomous, LLM-Based Agent for Program Repair [2024-arxiv] [paper]
A Deep Dive into Large Language Models for Automated Bug Localization and Repair [2024-FSE/ESEC] [paper]
Automated Program Repair in the Era of Large Pre-trained Language Models [2023-ICSE] [paper] [repo]
Repair Is Nearly Generation: Multilingual Program Repair with LLMs [2023-AAAI] [paper]
Retrieval-based prompt selection for code-related few-shot learning [2023-ICSE] [paper] [repo]
What makes good in-context demonstrations for code intelligence tasks with llms? [2023-ASE] [paper] [repo]
Fully Autonomous Programming with Large Language Models [2023-GECCO] [paper] [repo]
Automated Program Repair Using Generative Models for Code Infilling [2023-AIED] [paper] [repo]
STEAM: Simulating the InTeractive BEhavior of ProgrAMmers for Automatic Bug Fixing [2023-arxiv] [paper]
Conversational automated program repair [2023-arxiv] [paper]
Is ChatGPT the Ultimate Programming Assistant--How far is it? [2023-arxiv] [paper] [repo]
Using Large Language Models for Bug Localization and Fixing [2023-iCAST] [paper]
An Empirical Study on Fine-Tuning Large Language Models of Code for Automated Program Repair [2023-ASE] [paper] [repo]
An Evaluation of the Effectiveness of OpenAI's ChatGPT for Automated Python Program Bug Fixing using QuixBugs [2023-iSEMANTIC] [paper]
Explainable Automated Debugging via Large Language Model-driven Scientific Debugging [2023-arxiv] [paper]
The Right Prompts for the Job: Repair Code-Review Defects with Large Language Model [2023-arxiv] [paper]
Impact of Code Language Models on Automated Program Repair [2023-ICSE] [paper] [repo]
Towards Generating Functionally Correct Code Edits from Natural Language Issue Descriptions [2023-arxiv] [paper]
The Plastic Surgery Hypothesis in the Era of Large Language Models [2023-ASE] [paper] [repo]
Exploring the Limits of ChatGPT in Software Security Applications [2023-arxiv] [paper]
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation [2023-arxiv] [paper] [repo]
Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering [2023-arxiv] [paper] [repo]
Training Language Models for Programming Feedback Using Automated Repair Tools [2023-AIED] [paper] [repo]
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair [2023-arxiv] [paper] [repo]
Automated Code Editing with Search-Generate-Modify [2023-arxiv] [paper] [repo]
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair [2023-FSE/ESEC] [paper] [repo]
Neural Program Repair with Program Dependence Analysis and Effective Filter Mechanism [2023-arxiv] [paper]
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback [2023-arxiv] [paper] [repo]
A study on Prompt Design, Advantages and Limitations of ChatGPT for Deep Learning Program Repair [2023-arxiv] [paper]
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair [2023-FSE/ESEC] [paper] [repo]
Gamma: Revisiting Template-Based Automated Program Repair Via Mask Prediction [2023-ASE] [paper] [repo]
An Extensive Study on Model Architecture and Program Representation in the Domain of Learning-based Automated Program Repair [2023-APR] [paper] [repo]
Improving Automated Program Repair with Domain Adaptation [2023-TOSEM] [paper] [repo]
Enhancing Code Language Models for Program Repair by Curricular Fine-tuning Framework [2023-ICSME] [paper]
The potential use of ChatGPT for debugging and bug fixing [2023-] [paper]
CIRCLE: Continual Repair across Programming Languages [2022-ISSTA] [paper] [repo]
Towards JavaScript program repair with Generative Pre-trained Transformer (GPT-2) [2022-APR] [paper] [repo]
Fix Bugs with Transformer through a Neural-Symbolic Edit Grammar [2022-ICLR] [paper]
Patch Generation with Language Models: Feasibility and Scaling Behavior [2022-ICLR] [paper]
Can OpenAI's codex fix bugs?: an evaluation on QuixBugs [2022-APR] [paper]
An Analysis of the Automatic Bug Fixing Performance of ChatGPT [2022-APR] [paper] [repo]
Less training, more repairing please: revisiting automated program repair via zero-shot learning [2022-FSE/ESEC] [paer] [repo]
Framing Program Repair as Code Completion [2022-APR] [paper] [repo]
DEAR A Novel Deep Learning-based Approach for Automated Program Repair [2022-ICSE] [paper] [repo]
Generating Bug-Fixes Using Pretrained Transformers [2021-PLDI] [paper]
Applying CodeBERT for Automated Program Repair of Java Simple Bugs [2021-MSR] [paper] [repo]
CURE Code-Aware Neural Machine Translation for Automatic Program Repair [2021-ICSE] [paper] [repo]
How to Understand Whole Software Repository? [2024-arXiv] [paper]

Security Vulnerability

🔥Automated Repair of AI Code with Large Language Models and Formal Verification [2024-arXiv] [paper]
🔥NAVRepair: Node-type Aware C/C++ Code Vulnerability Repair [2024-arxiv] [paper]
Enhanced Automated Code Vulnerability Repair using Large Language Models [2024-arxiv] [paper]
Out of Sight, Out of Mind: Better Automatic Vulnerability Repair by Broadening Input Ranges and Sources [2024-ICSE] [paper] [repo]
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models [2024-arxiv] [paper] [repo]
Chain-of-Thought Prompting of Large Language Models for Discovering and Fixing Software Vulnerabilities [2024-arxiv] [paper]
Pre-trained Model-based Automated Software Vulnerability Repair: How Far are We? [2023-TDSC] [paper] [repo]
Examining zero-shot vulnerability repair with large language models [2023-S&P] [paper] [repo]
An Empirical Study on Fine-Tuning Large Language Models of Code for Automated Program Repair [2023-ASE] [paper] [repo]
A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal Verification [2023-arxiv] [paper]
Exploring the Limits of ChatGPT in Software Security Applications [2023-arxiv] [paper]
ZeroLeak: Using LLMs for Scalable and Cost Effective Side-Channel Patching [2023-arxiv] [paper]
How ChatGPT is Solving Vulnerability Management Problem [2023-arxiv] [paper] [repo]
How Effective Are Neural Networks for Fixing Security Vulnerabilities [2023-ISSTA] [paper] [repo]
Vision Transformer-Inspired Automated Vulnerability Repair [2023-TOSEM] [paper] [repo]
Can large language models find and fix vulnerable software? [2023-arxiv] [paper]
VulRepair: A T5-Based Automated Software Vulnerability Repair [2022-FSE/ESEC] [paper] [repo]

Syntax Error

A Novel Approach for Automated Program Repair using Round-Trip Translation with Large Language Models [2024-arxiv] [paper] [repo]
Repair Is Nearly Generation: Multilingual Program Repair with LLMs [2023-AAAI] [paper]
Fixing Rust Compilation Errors using LLMs [2023-arxiv] [paper]
An Empirical Study on Fine-Tuning Large Language Models of Code for Automated Program Repair [2023-ASE] [paper] [repo]
A Chain of AI-based Solutions for Resolving FQNs and Fixing Syntax Errors in Partial Code [2023-arxiv] [paper] [repo]
The Right Prompts for the Job: Repair Code-Review Defects with Large Language Model [2023-arxiv] [paper]
SYNSHINE: improved fixing of Syntax Errors [2022-TSE] [paper] [repo]

Programming Problem

🔥CraftRTL: High-quality Synthetic Data Generation for Verilog Code Models with Correct-by-Construction Non-Textual Representations and Targeted Code Repair [2024-arXiv-NVIDIA] [paper]
A Unified Debugging Approach via LLM-Based Multi-Agent Synergy [2024-arXiv] [paper] [repo]
PyDex: Repairing Bugs in Introductory Python Assignments using LLMs [2024-OOPSLA] [paper] [repo]
DebugBench: Evaluating Debugging Capability of Large Language Models [2024-arxiv] [paper] [repo]
ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs [2024-arxiv] [paper]
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair [2024-arxiv] [paper] [repo]
Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments [2024-arxiv] [paper]
Improved Program Repair Methods using Refactoring with GPT Models [2024-SIGCSE TS] [paper] [repo]
A critical review of large language model on software engineering: An example from chatgpt and automated program repair [2023-arxiv] [paper] [repo]
Automated Repair of Programs from Large Language Models [2023-ICSE] [paper] [repo]
FixEval: Execution-based Evaluation of Program Fixes for Programming Problems [2023-APR] [paper] [repo]
Refining ChatGPT-Generated Code: Characterizing and Mitigating Code Quality Issues [2023-TOSEM] [paper] [repo]
Repairing bugs in python assignments using large language models [2022-arixv] [paper]

Static Warning

Frustrated with Code Quality Issues? LLMs can Help! [2024-FSE/ESEC] [paper] [repo]
SkipAnalyzer: An Embodied Agent for Code Analysis with Large Language Models [2023-arxiv] [paper] [repo]
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair [2023-FSE/ESEC] [paper] [repo]
InferFix: End-to-End Program Repair with LLMs over Retrieval-Augmented Prompts [2023-FSE/ESEC] [paper] [repo]
Can LLMs Patch Security Issues [2023-arxiv] [paper] [repo]
Improving Automated Program Repair with Domain Adaptation [2023-TOSEM] [paper] [repo]
An empirical study of deep transfer learning-based program repair for Kotlin projects [2022-FSE/ESEC] [paper]
TFix-Learning to Fix Coding Errors with a Text-to-Text Transformer [2021-PMLR] [paper] [repo]

Self-Debug

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging [2024-arXiv] [paper] [repo]
Teaching Large Language Models to Self-Debug [2024-ICLR] [paper]
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement [2024-arxiv] [paper] [repo]
CYCLE: Learning to Self-Refine the Code Generation [2024-OOPSLA] [paper] [repo]
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step [2024-arxiv] [paper] [repo]
Leveraging Print Debugging to Improve Code Generation in Large Language Models [2024-arxiv] [paper]
SelfEvolve: A Code Evolution Framework via Large Language Models [2023-arxiv] [paper]
Self-Refine: Iterative Refinement with Self-Feedback [2023-NeurIPS] [paper] [repo]
AgentCoder: Multi Agent-Code Generation with Iterative Testing and Optimisation [2023-arxiv] [paper]
Self-Edit: Fault-Aware Code Editor for Code Generation [2023-ACL] [paper] [repo]
Is Self-Repair a Silver Bullet for Code Generation? [2023-ICLR] [paper] [repo]

Type Error

Domain Knowledge Matters: Improving Prompts with Fix Templates for Repairing Python Type Errors [2024-ICSE] [paper] [repo]
PyTy: Repairing Static Type Errors in Python [2024-ICSE] [paper] [repo]
GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair [2023-SLE] [paper] [repo]

Web UI Test

Guiding ChatGPT to Fix Web UI Tests via Explanation-Consistency Checking [2023-arxiv] [paper]

Smart Contract

ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts [2024-arxiv] [paper]
Evaluating ChatGPT for Smart Contracts Vulnerability Correction [2023-COMPSAC] [paper] [repo]

Hardware Bug

On Hardware Security Bug Code Fixes By Prompting Large Language Models [2024-TIFS] [paper] [repo]
Its pre-print: Fixing Hardware Security Bugs with Large Language Models [2022-arXiv] [paper]
HDLdebugger: Streamlining HDL debugging with Large Language Models [2024-arxiv] [paper]
RTLFixer: Automatically Fixing RTL Syntax Errors with Large Language Models [2023-arxiv] [paper]
LLM4SecHW: Leveraging domain-specific large language model for hardware debugging [2023-AsianHOST] [paper]

Performance Bug

RAPGen: An Approach for Fixing Code Inefficiencies in Zero-Shot [2023-arxiv] [paper]
DeepDev-PERF: A Deep Learning-Based Approach for Improving Software Performance [2022-FSE/ESEC] [paper] [repo]

API Misuse

Evaluating Pre-trained Language Models for Repairing API Misuses [2023-arxiv] [paper] [repo]

Crash Bug

Resolving Crash Bugs via Large Language Models: An Empirical Study [2023-arxiv] [paper] [repo]

Test Case

Automated Test Case Repair Using Language Models [2024-arxiv] [paper]
Identify and Update Test Cases when Production Code Changes: A Transformer-based Approach [2023-ASE]

Formal Proof

Baldur: Whole-Proof Generation and Repair with Large Language Models [2023-FSE/ESEC] [paper]

Translation Bug

Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code [2024-ICSE] [paper] [repo]

GitHub Issue

SWE-bench: Can Language Models Resolve Real-World GitHub Issues? [2024-ICLR] [paper] [repo]

Code Review

Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study [2024-ICSE] [paper] [repo]

Motion Planner

DrPlanner: Diagnosis and Repair of Motion Planners Using Large Language Models [2024-arxiv] [paper] [repo]

🙆 Human Study

Exploring Experiences with Automated Program Repair in Practice [2024-ICSE] [paper]
Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models [2024-arxiv] [papper] [repo]
An Empirical Study of Adoption of ChatGPT for Bug Fixing among Professional Developers [2023-ITA] [paper]

🙅 Patch Correctness Assessment

🔥Leveraging Large Language Model for Automatic Patch Correctness Assessment[2024-TSE] [paper]
APPT Boosting Automated Patch Correctness Prediction via Pre-trained Language Model [2024-TSE] [paper] [repo]
The Best of Both Worlds: Combining Learned Embeddings with Engineered Features for Accurate Prediction of Correct Patches [2023-TOSME] [paper] [repo]
Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning [2023-TSE] [paper] [repo]
PatchZero: Zero-Shot Automatic Patch Correctness Assessment [2023-arxiv] [paper]
Is this Change the Answer to that Problem? Correlating Descriptions of Bug and Code Changes for Evaluating Patch Correctness [2021-ASE] [paper] [repo]
Evaluating representation learning of code changes for predicting patch correctness in program repair [2020-ASE] [paper] [repo]

📊 Benchmark

🔥Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair[2024-ASE] [paper]
🔥MuBench: Benchmarking Automated Program Repair: An Extensive Study on Both Real-World and Artificial Bugs [2024-ISSTA] [paper]
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models [2024-arxiv] [paper] [repo]
GitBug-Java: A Reproducible Benchmark of Recent Java Bugs [2024-arxiv] [paper] [repo]
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? [2024-ICLR] [paper] [repo]
DebugBench: Evaluating Debugging Capability of Large Language Models [2024-arxiv] [paper] [repo]
ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair [2024-arxiv] [paper] [repo]
A critical review of large language model on software engineering: An example from chatgpt and automated program repair [2023-arxiv] [paper] [repo]
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation [2023-arxiv] [paper] [repo]
FixEval: Execution-based Evaluation of Program Fixes for Programming Problems [2023-APR] [paper] [repo]

🤔 Related APR Surveys

A Survey of Learning-based Automated Program Repair [2023-TOSEM] [paper] [repo]
Automatic Software Repair: A Bibliography [2018-CSUR] paper]
Automatic Software Repair: A Survey [2017-TSE] paper]

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Awesome LLM for APR

📖 Contents

👏 Citation

🚗Todo List

🔥🔥 New Papers

💡 Repair Scenarios

Semantic Bug

Security Vulnerability

Syntax Error

Programming Problem

Static Warning

Self-Debug

Type Error

Web UI Test

Smart Contract

Hardware Bug

Performance Bug

API Misuse

Crash Bug

Test Case

Formal Proof

Translation Bug

GitHub Issue

Code Review

Motion Planner

🙆 Human Study

🙅 Patch Correctness Assessment

📊 Benchmark

🤔 Related APR Surveys

Star History

About

Releases

Packages

Contributors 4

iSEngLab/AwesomeLLM4APR

Folders and files

Latest commit

History

Repository files navigation

🤖 Awesome LLM for APR

📖 Contents

👏 Citation

🚗Todo List

🔥🔥 New Papers

💡 Repair Scenarios

Semantic Bug

Security Vulnerability

Syntax Error

Programming Problem

Static Warning

Self-Debug

Type Error

Web UI Test

Smart Contract

Hardware Bug

Performance Bug

API Misuse

Crash Bug

Test Case

Formal Proof

Translation Bug

GitHub Issue

Code Review

Motion Planner

🙆 Human Study

🙅 Patch Correctness Assessment

📊 Benchmark

🤔 Related APR Surveys

Star History

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Packages