Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

num-reads not working #179

Open
ShokodkoMariia opened this issue May 5, 2024 · 3 comments
Open

num-reads not working #179

ShokodkoMariia opened this issue May 5, 2024 · 3 comments
Labels
troubleshooting workflow and data preparation questions

Comments

@ShokodkoMariia
Copy link

Hello!
I have memory of 197G and it's not enough for summary of whole bam and even while trying to work with chromosomes there is the same problem with summary of seven most weighting ones.
I've tried this with 15cpu altering interval-size, sampling-frac or num-reads
modkit summary
-t ${CPU}
--log-filepath ${OUT_DIR}/modkit_debug.log
--interval-size 10000
--sampling-frac 0.1
--include-bed ${OUT_DIR}/${TARGET_BED}
${BAM_DIR}/${BAMFILE} > ${OUT_DIR}/modkit_summary.txt
And as result for one of chromosomes:
image
I saw that 119665 was used and thought that this could had caused the memorage problem while working with other chromosomes and the whole bam. And then altering sampling-frac or num-reads I saw that the figure of ~120000 reads stays the same every time (only when I set it to 0 number of reads was 0). Is there a problem with my usage of num-reads/sampling-frac?

@ArtRand
Copy link
Contributor

ArtRand commented May 6, 2024

Hello @ShokodkoMariia could you tell me what version of modkit you're using?

@ShokodkoMariia
Copy link
Author

i’ve tried both v0.2.7 and v0.2.8-rc1

@ArtRand
Copy link
Contributor

ArtRand commented May 9, 2024

Hello @ShokodkoMariia,

Sorry for the delay. Could you give me the exact commands you're using and whether or not they end up consuming excessive memory? Also could you tell me roughly what the BED file you're using is like (i.e. how many regions, how big are they).

@ArtRand ArtRand added the troubleshooting workflow and data preparation questions label May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
troubleshooting workflow and data preparation questions
Projects
None yet
Development

No branches or pull requests

2 participants