-
Notifications
You must be signed in to change notification settings - Fork 695
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow index format specification in samtools view #4326
base: master
Are you sure you want to change the base?
Allow index format specification in samtools view #4326
Conversation
adding an option so that if write_indexes is used the user can choose what index type they want generated.
tests are failing, but not sure how they passed before, because of the naming of the test files??
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Although I don't really like this solution, I can't think of a better way to handle this (this is such a weird syntax)
not sure about the |
I think we should avoid making any |
I like this solution way better! |
Mmm, I'm generally happy to define additional |
I guess the question here is, does the user need to be able to set this, or should this be fixed in the workflow? |
I also wondered about whether it should be e.g. |
I would say However I also just realized that |
could I also suggest that we should consider backwards compatibility when
making these changes? adding an extra input means everyone that wants to
update this in their workflow potentially has to change their code in
multiple places given how common this module is? vs. providing as ext it
can be set once in module.config using regex?
…On Wed, 15 May 2024 at 09:57, Mahesh Binzer-Panchal < ***@***.***> wrote:
I also wondered about whether it should be e.g.${prefix}.bam.baior just
${prefix}.bai.
I would say ${prefix}.bam.bai
However I also just realized that index_type should be based on file_type
right? It doesn't make sense to have bam.csi
—
Reply to this email directly, view it on GitHub
<#4326 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFVBH3K6JXKWW5LOJRG72FLZCMPOFAVCNFSM6AAAAAA7JPWSUKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMJRHE2DKNJQGU>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
My last question though was do we even need to have an input for this? Isn't it determined by the |
BAM and SAM files can have either bai or csi indexes:
https://www.htslib.org/doc/samtools-index.html
…On Wed, 15 May 2024 at 10:56, Mahesh Binzer-Panchal < ***@***.***> wrote:
My last question though was do we even need to have an input for this?
Isn't it determined by the file_type?
—
Reply to this email directly, view it on GitHub
<#4326 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFVBH3PYBLUNTLRR2N2DMC3ZCMWLBAVCNFSM6AAAAAA7JPWSUKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMJSGA3TKNZXHE>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Currently it is set to have a default, that will just generate a .csi index. You don't need to use |
I'd say the basic default should be bai, as csi are mainly used for LARGE bam file index only as far as I know |
+1 on default being bai for bam |
At the same time, the default for |
I don't know that all downstream tools can interpret it? Do you have a lot
of experience with csi?
…On Wed, 15 May 2024 at 15:15, Felix Lenner ***@***.***> wrote:
At the same time, the default for --write-index is csi. Just curious, is
there any downside using csi for bam?
—
Reply to this email directly, view it on GitHub
<#4326 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFVBH3IWFLY5FRZEKT4WVRLZCNUX5AVCNFSM6AAAAAA7JPWSUKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMJSGY3TAMRSGE>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
No, not at all. But I was thinking about adding |
I've never used csi, only bai for everything. |
I tried using csi a bit but just encountered a tool that expects and needs bai, and I'm sure there might be others like it. Maybe bai is the best default. |
Yeah, bai is best |
I added a
In the samtools modules it would feel a bit strange to specify the output type in the config and then the index type in the workflow, but I don't really see a way around it without using Would be great to find a solution though, so the same can be added to other samtools modules. |
I'm leaning towards the idea of just adding another value channel which determines if the index is made, and if so what type; if empty no index is generated. |
How does this sound to everyone? Using |
adding an option so that if write_indexes is used the user can choose what index type they want generated.
PR checklist
Closes #XXX
versions.yml
file.label
PROFILE=docker pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware
PROFILE=singularity pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware
PROFILE=conda pytest --tag <MODULE> --symlink --keep-workflow-wd --git-aware