Toma
|-- Tokenization_and_FeatureExtraction.py // implement the first two phases: Tokenization phase and Feature Extraction phase
|-- Classification.py // implement the Classification phase
- Input: dataset with source codes
- Output: feature vectors of code pairs
python Tokenization_and_FeatureExtraction.py
- Input: feature vectors of dataset
- Output: recall, precision, and F1 scores of machine learning algorithm
python classification.py