This repository quantifies term cooccurrence in MEDLINE. It's designed for computing the cooccurence of all pairs between two MeSH termsets. The repository computes MEDLINE cooccurences for the Rephetio hetnet. See the corresponding Thinklab discussion for more information.
eutility.py
defines anesearch_query
function for retreiving PubMed IDs matching a user-defined query.cooccurrence.py
computes the cooccurences bewteen two termsets, whose associated PubMed IDs have been retrieved.
diseases.ipynb
computes disease-disease cooccurrencesymptoms.ipynb
computes symptom-disease cooccurrencetissues.ipynb
computes anatomy-disease cooccurrence. This notebook depends ondata/disease-pmids.tsv.gz
, a dataset created bysymptoms.ipynb
.
This repository is released under CC0 1.0.