Skip to content

Commit

Permalink
Update getVariantSeq.R
Browse files Browse the repository at this point in the history
  • Loading branch information
Akmazad committed Aug 17, 2020
1 parent 967014a commit 42f2934
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions Data Processing/Variant data processing/getVariantSeq.R
Original file line number Diff line number Diff line change
Expand Up @@ -84,6 +84,11 @@ dat.sm.comb = dplyr::inner_join(dat.sm, allele.info, by=c("SNP_id" = "PEC_id"))
dat.sm.comb = dat.sm.comb %>% dplyr::select(c("SNP_chr","SNP_start","SNP_end","SNP_id","REF","ALT", "strand"))
fwrite(dat.sm.comb, paste0(dir,allSNPFileName,".BED"), col.names=F, sep="\t") # this list has both coding and non-coding SNPs)


# for HGMD file
# first run process_HGMD_file.R file (https://github.com/Akmazad/deepPsych/blob/master/Data%20Processing/Variant%20data%20processing/process_HGMD_file.R)
# allSNPFileName = "HGMD_Search_Results_PromoterActivity_processed"

message("Intersect BED to get non-coding based SNPs:",appendLF=F)
system2('intersectBed',
paste0('-v -a ', paste0(dir, allSNPFileName,".BED "), ' -b ', paste0(dir,codingExonFileName,".BED ")),
Expand Down

0 comments on commit 42f2934

Please sign in to comment.