Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

unexpected output for subseq on bed files with blank column 5 #195

Closed
timedreamer opened this issue Mar 1, 2021 · 1 comment
Closed

Comments

@timedreamer
Copy link

Hi Wei!

Maybe this is an enhancement. I notice some unexpected output when working with bed files using seqkit subseq. An example is at the end.

I sometimes got bed files with a blank column 5, but still has strand in column 6. Basically, if column 5 of the bed file is blank (still hold the column), then the column six strand information will not be used by seqkit subseq.

Although it can be easily adjusted by adding some text in column 5, the trick is not in the document (I think?).

Maybe worth mentioning this in the doc or allow the column 5 be blank.

Please correct me if I missed something!

Thanks for this awesome tool.
Ji


An example:

[V4] cat -A t1.bed
seq^I5^I8^IA^I^I+$
seq^I5^I8^IB^I^I-$
[V4] cat t.fa
>seq
actgACTGactgn
[V4] seqkit subseq --bed t1.bed t.fa
[INFO] read BED file ...
[INFO] 2 BED features loaded
>seq_6-8:. A
CTG
>seq_6-8:. B
CTG

@shenwei356
Copy link
Owner

Thanks for reporting. It's an old bug :(

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants