-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CpG Calling with Reference #196
Comments
Hello @SophieValk, If you use a command such as this $ modkit pileup ${mod_bam} ${pileup_bedmethyl} --cpg --ref ${ref} The bedmethyl records will only be for positions that are $ samtools -f 4 ${modbam} > unaligned.bam then look at the calls with $ modkit extract unaligned.bam raw_probabilities.tsv --read-calls read_calls.tsv If you want the positions that were CpG in the read but not CpG in the reference (reference CH), you can use $ modkit pileup ${mod_bam} ${pileup_ch_bedmethyl} --motif CH 0 --ref ${ref} I'm going to add functionality that will allow you to subset the modification calls in a set of reads to specific read sequences in the next release, right now you'll have to use the CpG modification model. Hope this helps, happy to answer any more questions you have. |
Thanks @ArtRand for your clear and quick reply. |
Am I understanding it correctly that when you call CpGs using modekit pileup with the -ref flag, only the CpGs that are present in the reference genome will get called? And any site that is a CpG in you .bam files but not in the reference genome will not be included in the modkit pileup .bedMethyl output file?
Thanks in advance,
Sophie
The text was updated successfully, but these errors were encountered: