Skip to content

SAPPHIRE: a stacking-based ensemble learning framework for accurate prediction of thermophilic proteins

Notifications You must be signed in to change notification settings

plenoi/SAPPHIRE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SAPPHIRE

SAPPHIRE: a stacking-based ensemble learning framework for accurate prediction of thermophilic proteins.

Dependency

The packages that this program depends on is scikit-learn==0.24.1 or higher. You can run following command in terminal.
pip install scikit-learn==0.24.1

How to use SAPPHIRE

  1. Copy your fasta file into ./input and change the name to seq.fasta
  2. Extract PSSM features from your fasta file using pssmpro (https://github.com/deeprob/pssmpro). Please follow their tutorial carefully.
  3. After finish feature extraction, copy the required PSSM features for SAPPHIRE into the ./input folder. These required PSSM features are listed as the following.
  • psssm_composition.csv
  • rpm_pssm.csv
  • s_fpssm.csv
  1. Run command
    python sapphire.py
  2. The result including sequence Header, label and probability will be saved in ./output/predicted_result.csv

About

SAPPHIRE: a stacking-based ensemble learning framework for accurate prediction of thermophilic proteins

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages