CodeScope/code_review at main · Suvoo/CodeScope

History

Name		Name	Last commit message	Last commit date
parent directory ..
evaluator		evaluator
inference		inference
README.md		README.md
requirements.txt		requirements.txt

README.md

Code Review

Data

The code review dataset is located in data/code_review_data.jsonl. The fields of the data are explained below:

Field	Description
id	the local id of items in the dataset
lang_cluster	the programming language of the source code
source_code	old version code
diff_hunk	code changes between old version code and new version code
diff_tag	0: quality of code changes is good that no review comments required 1: quality of code changes is poor that requires review comments
review_comment	review comment for code changes

Dependence

cd code_review
install python>=3.9 (we use python==3.9)
install torch (we suggest torch==2.1.1) based on your cuda version
pip install -r requirements.txt

Inference

Run the inference scripts to get the inference results of the targeted LLMs. The inference results code_review_result_{model_name}.jsonl will be saved under the inference/results folder. The inference logs code_review_log_{model_name}.log will be saved under the inference/logs folder.

Closed-sourced LLMs

We provide the following closed-sourced LLMs inference scripts for you:

Model Name	Model Version	Script Name
PaLM 2	text-bison-001	run_palm2.py
GPT-4	gpt-4-0613	run_gpt.py
GPT-3.5	gpt-3.5-turbo-0613	run_gpt.py

For PaLM 2, you can run the following command by replacing google_api_key with your own Google API key.

python inference/run_palm2.py --api_key google_api_key

For GPT-4 and GPT-3.5, you can run the following command by replacing openai_api_key with your own OpenAI API key, model_version with specific model version.

python inference/run_gpt.py --api_key openai_api_key --model model_version

Open-sourced LLMs

We provide the following open-sourced LLMs inference scripts for you:

Model Name	Model Checkpoint	Script Name
Code LLaMA	codellama/CodeLlama-34b-Instruct-hf	run_codellama.py
LLaMA 2	meta-llama/Llama-2-70b-chat-hf	run_llama2.py
StarCoder	HuggingFaceH4/starchat-beta	run_starcoder.py
Vicuna	lmsys/vicuna-13b-v1.5-16k	run_vicuna.py
WizardCoder	WizardLM/WizardCoder-15B-V1.0	run_wizardcoder.py

For HuggingFace models, you can run the following command by replacing huggingface_access_token with your own HuggingFace access token, cache_dir with path to a directory in which a downloaded pretrained model and tokenizer should be cached, model_checkpoint with specific model checkpoint.

python inference/run_{model_name}.py --access_token huggingface_access_token --cache_dir cache_dir --checkpoint model_checkpoint

Evaluation

Run python evaluator/score.py to get the scores of the targeted LLMs' inference results. The scores code_review_score_1.json and code_review_score_2.json will be saved under the evaluator/scores folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code_review

code_review

README.md

Code Review

Data

Dependence

Inference

Closed-sourced LLMs

Open-sourced LLMs

Evaluation

Files

code_review

Directory actions

More options

Directory actions

More options

Latest commit

History

code_review

Folders and files

parent directory

README.md

Code Review

Data

Dependence

Inference

Closed-sourced LLMs

Open-sourced LLMs

Evaluation