#####Update 5/12/2016: Construction of a new, packaged version is underway (v1)!.
#####Update 1/8/2016: This repo is under construction. I'll be pushing some usable updates soon, but they may require some adaptation to your configutations. Working on a more generalized version for the near future.
Here's a general pipeline for this code.
- Get a reference genome/assembly by chromosome (ie. chr1.fa, chr2.fa,)
- Generate a distribution of chromosome lengths (generate_chrdist.py)
- simulation.py (many parameters still must be hard-coded in this draft version.) This generates a BED file of simulated reads.
- sortBed.sh - sorts BED files by chromosome.
- getFasta.sh - requires BEDTools, generates simulated FASTA files. This is written to be easily parallelizable.
- (Optional, but used in AGBT presentation) countSimDimers.sh
This is a draft version of pblibsim. Your mileage may vary.