Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about CpGs without modification #164

Open
VasLem opened this issue Apr 16, 2024 · 1 comment
Open

Question about CpGs without modification #164

VasLem opened this issue Apr 16, 2024 · 1 comment
Labels
question Looking for clarification on inputs and/or outputs

Comments

@VasLem
Copy link

VasLem commented Apr 16, 2024

I am currently comparing different methylation calling protocols (including enzymatic technologies), but for the sake of the experiments I also need to report the coverage per detected CpG. Could you please confirm that the counts of unmodified CpGs across all occurrences do not reside in the produced modkit pileup --cpg bed file ? And if so, is it possible to somehow include them in the output? Thank you in advance.

@ArtRand
Copy link
Contributor

ArtRand commented Apr 17, 2024

Hello @VasLem,

When using modkit pileup --cpg all CpGs with at least 1 read of valid coverage will be emitted in the output. If you want all CpGs with any coverage at all, you'll have to specify --no-filtering in which case all base modification calls will "pass" and any CpG with at least a single read of coverage will be emitted. There is currently no flag to emit bedMethyl records with 0 valid coverage - but that's not a bad idea. If you have CpGs without any coverage at all, they will also be omitted. If you need these, probably the easiest thing to do is run modkit motif-bed ${fasta} CG 0 then bedtools intersect -loj. Does that answer your question or did I miss something?

@ArtRand ArtRand added the question Looking for clarification on inputs and/or outputs label Apr 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Looking for clarification on inputs and/or outputs
Projects
None yet
Development

No branches or pull requests

2 participants