Skip to content

Repository of the code for the paper "Benchmarking Student Program Repair"

Notifications You must be signed in to change notification settings

KoutchemeCharles/gaied_nips23

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Benchmarking Educational Program Repair

Repository for the paper Benchmarking Educational Program Repair published at the Generative AI for EDucation (GAIED) workshop at NeurIPS. In this work, we propose to benchmark multiple language models for fixing students incorrect programs, and report preliminary results using finetuned LLMs.

Installation

The requirements.txt file can be used with pip to get the packages needed to run the codes.

pip install -r requirements.txt

You should complete the configuration files to specify:

  • the path towards your version of the FalconCode dataset
  • the path towards the singapore dataset
  • the path towards your installation of the Refactory automated repair tool
  • paths towards a local directory where to save the results of the experiments

Experiments

Training one of the model on the FalconCode dataset

python scripts/run.py --config  $config_path --experiment FinetunedRepair --train

where config_path is one of the configurations in the configs/neurips folder. Alternatively, you could create your own configuration following the same style.

Evaluating the trained model on the Singapore dataset

python scripts/run.py --config  $config_path --experiment FinetunedRepair --evaluate --evaluate_config PATH_TO_REPO/configs/repair/dataset/falconcode_skill.json
python scripts/run.py --config  $config_path --experiment FinetunedRepair --evaluate --evaluate_config PATH_TO_REPO/configs/repair/dataset/falconcode_lab.json
python scripts/run.py --config  $config_path --experiment FinetunedRepair --evaluate --evaluate_config PATH_TO_REPO/configs/repair/dataset/singapore.json

The final results can be obtained by running the results.ipynb jupyter notebooks

About

Repository of the code for the paper "Benchmarking Student Program Repair"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published