Skip to content

KGRACDA is a novel computational model for predicting circRNA-disease associations. It utilizes knowledge graph representation and recursion to explicitly capture intrinsic attributes and deep information between biological entities.

License

Notifications You must be signed in to change notification settings

maoyuanma/KGRACDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KGRACDA

Introduction

KGRACDA: Knowledge Graph-based Recursion and Attention Aggregation Model for CircRNA-Disease Association Prediction KGRACDA is a CircrNa-disease association prediction model based on knowledge graph recursion and attention aggregation. It can explicitly capture deep information between entities and generate reliable CIRcrNa-disease association predictions.

KGRACDA consistently achieves remarkable performance on three datasets (dataset1---a small dataset focusing on non-cancer used in previous works; dataset2---a larger heterogeneous dataset ; dataset3---a small dataset focusing on cancer used in previous works)

We also provide a user-friendly interactive web-based platform (named HNRBase v2.0), which is publicly available at https://lab-fly.site/KGETCDA.

Requirements

The code has been tested running under Python 3.9.15. The required packages are as follows:

  • torch == 1.11.0

The expected structure of files is:

 ── KGRACDA
    ├── data
    │   ├── dataset1
    │   ├── dataset1_ind
    │   ├── dataset2
    │   ├── dataset2_ind
    │   ├── dataset3
    │   └── dataset3_ind
    ├── results
    ├── base_model.py     
    ├── utils_KGRACDA.py
    ├── train.py
    ├── load_data.py
    └── models_KGRACDA.py

Dataset

We use three datasets here, and provide 4 entities (circRNA, miRNA, lncRNA, disease) and 5 relations (circRNA-disease, miRNA-disease, lncRNA-disease, circRNA-miRNA, miRNA-lncRNA) file and all pairs. The summary information is listed as follows:

Dataset circ-dis mir-dis lnc-dis circ-mir mir-lnc total
dataset1 346 106 527 146 202 1327
dataset2 1399 10154 3280 1129 9506 25468
dataset3 647 732 1066 756 308 3509
  • entity.txt

    • All entities file.
    • Each line is an entity with its ID: (name and ID).
  • relations.txt

    • All relations file.
    • Each line is a relation with its ID: (name and ID).
  • test.txt

    • test pairs file.
    • Each line is an entity1, an entity2, and relation: (name,'relation','name2').
  • train.txt

    • train pairs file.
    • Each line is an entity1, an entity2, and relation: (name,'relation','name2').
  • valid.txt

    • valid pairs file.
    • Each line is an entity1, an entity2, and relation: (name,'relation','name2').

Usage

You can directly run the above model KGRACDA.

We also recommend users use our KGRACDA Webserver HRNBase v2.0 (https://lab-fly.site/KGETCDA), which is user-friendly and easy to use. Consisting of 4 core functions (intelligent search and browse, model prediction, information visualization, and advanced interaction), our web-based platform enables novel visualization, accessible resources and user-friendly interaction. Everyone could upload or typein the candidate circRNAs or diseases of interest in our web without further installation, our backend server will calculate and give the prediction results to the user. Users can also choose to download the predict csv file results.

Compared methods and related papers.

In this paper, we compare our model with 9 SOTAs including: KGETCDA, GMNN2CD, KGANCDA, RNMFLP, AE-RF, DMFCDA, CD-LNLP, RWR, KATZHCDA, which are compared under the same experiment settings. The parameters of the other 9 models maintain consistency with their original papers.

About

KGRACDA is a novel computational model for predicting circRNA-disease associations. It utilizes knowledge graph representation and recursion to explicitly capture intrinsic attributes and deep information between biological entities.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages