- Evaluating Large Language Models Trained on Code
- CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Evaluations on HumanEval-X
- CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis
- CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
- StarCoder: may the source be with you!
- PanGu-Coder: Program Synthesis with Function-Level Language Modeling
- InCoder: A Generative Model for Code Infilling and Synthesis
- CodeT5+: Open Code Large Language Models for Code Understanding and Generation
- AceCoder: Utilizing Existing Code to Enhance Code Generation
- Self-Edit: Fault-Aware Code Editor for Code Generation
- Structured Chain-of-Thought Prompting for Code Generation
- ToolCoder: Teach Code Generation Models to use APIs with search tools
- Self-Collaboration Code Generation via ChatGPT
- Self-Planning Code Generation with Large Language Model
- CodeT: Code Generation with Generated Tests
- Automated Program Repair in the Era of Large Pre-trained Language Models
- Automated Repair of Programs from Large Language Models
- Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning
- No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation