-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
alignment problem with reference genome #661
Comments
Hi Silvia, something here definitely looks fishy (no pun intended). Could you provide me a with a few reads of your read library (ideally raw, untrimmed reads, and let me know which kind of library prep had been performed). Maybe 200K sequences (gzipped), as an email attachement? I will in the meantime try to get the reference sequence and have it prepared. Cheers, Felix |
Thanks a lot for your fast reply! |
I took your files, trimmed 10bp from both 5' ends, and ran alignments: (directional):
So all works well, there is a 50/50 split between top and bottom strand. I am not exactly sure what went wrong on your side, but something did... Do you have enough resources available at all? |
I apologize for my late reply but I was trying to figure out what was my problem. I think I had an issue with the original fasta file of the reference genome, now I fixed it thanks to your indications. |
That's good to hear! All the best |
Hello Felix,
I’m a PhD student and I’m currently working on epigenetics data (DNA methylation) with limpets. I work in parallel with two different species, Patella caerulea and Patella ulyssiponensis, but I have some problems with the alignment of P. ulyssiponensis sequences to its reference genome. I know that it is a problem of the reference genome because I’ve aligned the sequence of P. ulyssiponensis to the other reference genome that I have (P. caerulea) and it worked well, even if the map efficiency was low (10.8%). The genome preparation was the same for the two species. And I also know that the script for the alignment is ok because it worked well with P. caerulea.
Final Alignment report
Sequence pairs analysed in total: 13359954
Number of paired-end alignments with a unique best hit: 5061646
Mapping efficiency: 37.9%
Sequence pairs with no alignments under any condition: 6812837
Sequence pairs did not map uniquely: 1485471
Sequence pairs which were discarded because genomic sequence could not be extracted: 5061646
Number of sequence pairs with unique best (first) alignment came from the bowtie output:
CT/GA/CT: 0 ((converted) top strand)
GA/CT/CT: 0 (complementary to (converted) top strand)
GA/CT/GA: 0 (complementary to (converted) bottom strand)
CT/GA/GA: 2531542 ((converted) bottom strand)
Number of alignments to (merely theoretical) complementary strands being rejected in total: 0
Final Cytosine Methylation Report
Total number of C's analysed: 0
Total methylated C's in CpG context: 0
Total methylated C's in CHG context: 0
Total methylated C's in CHH context: 0
Total methylated C's in Unknown context: 0
Total unmethylated C's in CpG context: 0
Total unmethylated C's in CHG context: 0
Total unmethylated C's in CHH context: 0
Total unmethylated C's in Unknown context: 0
Can't determine percentage of methylated Cs in CpG context if value was 0
Can't determine percentage of methylated Cs in CHG context if value was 0
Can't determine percentage of methylated Cs in CHH context if value was 0
Can't determine percentage of methylated Cs in unknown context (CN or CHN) if value was 0
Bismark completed in 0d 3h 25m 51s
Here you can find other details:
Do you have any suggestion?
I really thank you in advance for your availability.
Silvia Signorini
The text was updated successfully, but these errors were encountered: