Skip to content

Commit

Permalink
Update ReadMe.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Akmazad authored Sep 25, 2020
1 parent 1c53377 commit d8604bd
Showing 1 changed file with 1 addition and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,6 @@ sort -k 1,1 -k2,2n mergedPeakHeightMatrix_HumanFC_filtered_single_label.overlaps
# for saving non-zero binInfo
awk -F '\t' ' {for(i=5; i<=NF; i++) if ($i != -1) {print $1"\t"$2"\t"$3"\t"$4"\t"$5; break;} }' mergedPeakHeightMatrix_HumanFC_filtered_single_label.overlaps.dropped.fixed.filtered.sorted.bed > HumanFC_single_label_nonZero.binInfo.bed
```
## fix the bins ending when it goes beyond the refWidth

## Extract DNA sequences
```r
Expand All @@ -76,7 +75,7 @@ seq <- getSeq(hg, bins$chr, start = bins$start - flankingLength, end = bins$end
seq <- as.character(as.data.frame(seq)[[1]])
bins.seq <- cbind(bins,seq)
colnames(bins.seq) <- c(colnames(bins),"dna.seq")
fwrite(bins.seq, file="HumanFC_single_label.bin.Seq.bed", sep="\t", row.names=F, quote=F)
fwrite(bins.seq, file="HumanFC_single_label.bin.Seq_Labels.bed", sep="\t", row.names=F, quote=F)
```

################## End of data-processing Pipeline ##############
Expand Down

0 comments on commit d8604bd

Please sign in to comment.