Name		Name	Last commit message	Last commit date
parent directory ..
__pycache__		__pycache__
baselines		baselines
bash_files		bash_files
figs		figs
plotting_files		plotting_files
running_logs		running_logs
.DS_Store		.DS_Store
ALG.py		ALG.py
ALG_MLP.py		ALG_MLP.py
README.md		README.md
cluster.py		cluster.py
data_util.py		data_util.py
models.py		models.py
requirements.txt		requirements.txt
run_GCN_datautil.py		run_GCN_datautil.py
utils.py		utils.py

README.md

Active Learning Framework for Graph Convolutional Networks and their application to virus classification

Requirements

To install requirements:

pip install -r requirements.txt

Datasets

To train using the following models, first put your dataset under the directory of "../data"

Training

To train the model(s) in the paper:

Training ALG models

Run the following command to train ALG GCN/MLP models

python3 ALG.py --dataset=[hostg.phylum, hostg.genus, hostg.class, hostg.family, hostg.order] --seeds $((seeds))
python3 ALG_MLP.py --dataset=[hostg.phylum, hostg.genus, hostg.class, hostg.family, hostg.order] --seeds $((seeds))

Training baseline active learning methods

We implemented two baseline active learning strategies: random and selection by entropy:

python3 baseline_unified.py --dataset=[hostg.phylum, hostg.genus, hostg.class, hostg.family, hostg.order] --method=[random, entropy] --seeds $((seeds))

Training using all data

Instead of picking nodes using active learning, we also provided models traing using all data:

python3 baseline_train_all.py --dataset=[hostg.phylum, hostg.genus, hostg.class, hostg.family, hostg.order] --seeds $((seeds))

Parameters:

dataset: dataset used in training
hidden_size: batch size
no_hosts: whether to inlcude host nodes in the initially labeled datasets

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

codes

codes

README.md

Active Learning Framework for Graph Convolutional Networks and their application to virus classification

Requirements

Datasets

Training

Training ALG models

Training baseline active learning methods

Training using all data

Files

codes

Directory actions

More options

Directory actions

More options

Latest commit

History

codes

Folders and files

parent directory

README.md

Active Learning Framework for Graph Convolutional Networks and their application to virus classification

Requirements

Datasets

Training

Training ALG models

Training baseline active learning methods

Training using all data