Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is exactly mod_qual? #181

Open
OceaneMion opened this issue May 12, 2024 · 3 comments
Open

What is exactly mod_qual? #181

OceaneMion opened this issue May 12, 2024 · 3 comments
Labels
question Further information is requested

Comments

@OceaneMion
Copy link

Hi I would like to know the difference between mod_qual and fraction modified ?

What do you mean exactly by mod_qual, I know that fraction modified is basically the number of modified base / total number of cpg modified or not.
But what does mod_qual represent is it the quality of prediction ? I don't really understand the difference.

Thanks in avance for your help

@ArtRand
Copy link
Contributor

ArtRand commented May 20, 2024

Hello @OceaneMion,

Sorry for the slow reply.

Are you referring to the table from modkit extract? As stated in the docs mod_qual is "probability of the base modification in the next column".

@ArtRand ArtRand added the question Further information is requested label May 20, 2024
@OceaneMion
Copy link
Author

Yes thank you but the value obtain in modkit pileup for the methylation is really different, is it because it is averaging the methylation of each reads at each genomic position ? So that for example if I have reads that overlap at a specific genomic position it will do the mean of methylation for this position right ?

@ArtRand
Copy link
Contributor

ArtRand commented May 21, 2024

Hello @OceaneMion,

The ouput of pileup is a count of how many reads called each modification (or canonical) at a given genome position. You get a "percent modification" from the pileup meaning %-of reads calling the modification, so if you have 10 reads and 8 call 5mC, you'll see 80%. This is different than mod_qual which is a metric on each specific read at a given position. So for example, take a single read with a mod_qual of 0.95 for mod_code m this means that at that position in the read, the base modification model assigned a 0.95 probability of 5mC. If this value is above the pass threshold (details here) this individual read call will tabulated in the N_mod count in the pileup table.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants