GAT-GO:Accurate Protein Function Prediction via Graph Attention Networks with Predicted Structure Information

Citation

This is the official code repository for the paper "Accurate Protein Function Prediction via Graph Attention Networks with Predicted Structure Information".

Dependencies

pytorch >=1.7.1
pytorch-geometric >= 1.7.0

Predict sequence function with GAT-GO

To use GAT-GO with pre-processed data, we provided a data_loader which parses the pre-processed sequence features ussed in GAT-GO.
GAT-GO.py can be used to make prediction with pre-processed seuqnces. Examples can be found in this Notebook

python GAT-GO.py --ModelPath <PATH> --Device <CUDA device> --BatchSize <BatchSize> --SeqIDs <SeqIDs> --OutDir <OutDir>

**** Intput ****

--ModelPath : Path to trained model weights
--OutDir    : Output directory, where the result will be saved
--BatchSize : Batch size  
--Device    : CUDA device to be used for inferece

**** Output ****
GAT-GO_Results.pt will be saved at <OutDir> which is a serialized dictionary indexed by sequence identifiers provided in <SeqIDs>

To extract the GO-terms from the result, please see the Notebook example.

Data Format

For each sequence in PDB/PDBmmseq dataset, a serialized dictionary stores the processed features used in GAT-GO. Details can be found below

data.seq: One-hot encoded primary sequence
data.pssm: Sequence profile constructed from MSA
data.x: Residue level sequence embedding generated from ESM-1b
data.edge_index: Contact map index
data.seq_embed: Sequence level embedding generated from ESMA-1b
data.label: GO term annotation
data.chain_id: Sequence identifier

Data & Pre-trained model

Pre-processed data and pre-trained model can be downloaded at this link

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
examples		examples
images		images
src		src
GAT-GO.py		GAT-GO.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GAT-GO:Accurate Protein Function Prediction via Graph Attention Networks with Predicted Structure Information

Citation

Dependencies

Predict sequence function with GAT-GO

Data Format

Data & Pre-trained model

About

Releases

Packages

Languages

License

khyox/GAT-GO

Folders and files

Latest commit

History

Repository files navigation

GAT-GO:Accurate Protein Function Prediction via Graph Attention Networks with Predicted Structure Information

Citation

Dependencies

Predict sequence function with GAT-GO

Data Format

Data & Pre-trained model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages