Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Methylpy in plant #76

Open
zzh4399 opened this issue Mar 21, 2022 · 7 comments
Open

Methylpy in plant #76

zzh4399 opened this issue Mar 21, 2022 · 7 comments

Comments

@zzh4399
Copy link

zzh4399 commented Mar 21, 2022

Dear yupenghe,
I would like to know whether methylpy can be used for methylation analysis of plant genome because the results I get using methylpy are quite different from those of Bismark.
Thanks.

@yupenghe
Copy link
Owner

Yes, methylpy works on plant genome. Do you mind to describe the difference you referred to?

@zzh4399
Copy link
Author

zzh4399 commented Mar 22, 2022

I am very glad to receive your reply. Methylpy calculates a lower rate of methylation for the three types than Bismark (about half as much), and I tried to modify the comparison parameters, but it didn't seem to work.

@yupenghe
Copy link
Owner

That is interesting. It would be helpful to have some cases (e.g. methylated and unmethylated counts of a few Cs from methylpy and bismark). Also, is the library typical directional bisulfite sequencing library? is it pbat?

@zzh4399
Copy link
Author

zzh4399 commented Mar 22, 2022

It is really interesting. We found that methylation rates of CpG, CHG and CHH types calculated with Bismark are 40%, 20% and 2% respectively, while those calculated with methylpy are 20%, 10% and 0.5% respectively. We randomly found a single site and found that the methylation rate and the number of reads covering the site were different between the two software.

Methylpy seems to be more rigorous in determining the methylation of individual sites, which may be the reason for its lower methylation rate. For example, we found that methylpy calculated methylation rate of 0.8 and bismark calculated methylation rate of 1 for the same site.

@yupenghe
Copy link
Owner

That is interesting. What are the parameters you used to run bismark and methylpy? I did some comparison a while back and the results from these two methods are very close. I am wondering if any specific setting is used.

@zzh4399
Copy link
Author

zzh4399 commented Mar 23, 2022

Methylpy has a significant advantage in running speed and is easy to understand. In addition to comparing our own sequencing files, we also used methylpy to analyze the documented data (all from plants: Oryza sativa). We found that when methylpy was used in plant genomes, the methylation rate was significantly reduced, about half. These are the parameters of the two software we use:


methylpy paired-end-pipeline --read1-files M1-D_FDLM220001805-1a_1.clean.fq.gz --read2-files M1-D_FDLM220001805-1a_2.clean.fq.gz --forward-ref ~/bq/methy/db/rice_f --reverse-ref ~/bq/methy/db/rice_r --ref-fasta ~/bq/methy/db/IRGSP-1.0_genome.fasta --path-to-output zzh --num-procs 40 --sample M1_D


bismark --bowtie2 -N 0 -L 20 --quiet --un --ambiguous --sam -o output ~/bq/methy/db/ -1 M1-D_FDLM220001805-1a_1.clean.fq.gz -2 M1-D_FDLM220001805-1a_2.clean.fq.gz #sequences alignment

deduplicate_bismark M1-D_FDLM220001805-1a_1.clean_bismark_bt2_pe.bam # Dropping deduplication

bismark_methylation_extractor --no_overlap --paired-end --bedGraph --comprehensive --counts --remove_spaces --cytosine_report --genome_folder ~/bq/methy/db/ --buffer_size 10G --CX ../output/M1-D_FDLM220001805-1a_1.clean_bismark_bt2_pe.deduplicated.bam # Calling methylation

@yupenghe
Copy link
Owner

Nothing looks outstanding. If you manually check a few CG/CHH/CHG sites, what are the counts of reads and methylated reads you got from methylpy and bismark? For me to understand this, I would need some example data to reproduce the difference you found.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants