Skip to content

Commit

Permalink
Update raw_data.md
Browse files Browse the repository at this point in the history
  • Loading branch information
alquraishi committed May 24, 2019
1 parent 0f439fa commit 8aef525
Showing 1 changed file with 8 additions and 1 deletion.
9 changes: 8 additions & 1 deletion docs/raw_data.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,11 @@
# Raw Data
The raw data comprising all MSAs for ProteinNet12 is available for download upon request. The data is large, approximately 4TB in size, and requires a Globus client for downloading. Please [email us](mailto:[email protected]) to request access.
## MSAs
The raw MSAs for ProteinNet12 are available for download upon request. The data is large, approximately 4TB in size, and requires a Globus client for downloading. Please [email us](mailto:[email protected]) to request access.

Once we are able to provide broad access we will post a public Globus endpoint for all ProteinNets.

## Sequence databases
If you wish to generate new MSAs using the same sequence databases used to construct ProteinNet, you may download the sequence databases in FASTA format using the links below. Note that the databases range in size from ~700MB (ProteinNet7) to ~44GB (ProteinNet12).

| [ProteinNet7](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet7.gz) | [ProteinNet8](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet8.gz) | [ProteinNet9](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet9.gz) | [ProteinNet10](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet10.gz) | [ProteinNet11](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet11.gz) | [ProteinNet12](https://sharehost.hms.harvard.edu/sysbio/alquraishi/proteinnet/sequence_dbs/proteinnet12.gz) |
| --- | --- | --- | --- | --- | --- |

0 comments on commit 8aef525

Please sign in to comment.