-
Notifications
You must be signed in to change notification settings - Fork 168
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transcriptome ID mismatch in abundance files #116
Comments
"others will experience this reporting snafu"
|
Please answer this. I have the same problem. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
When Kallisto is run on a fasta reference with the following header style:
>0 ENST00000456328.2 chr1+ 11869-12227,12613-12721,13221-14409
The feature ID appears as "ENST00000456328.2" in the abundance.tsv output file, but it appears as "0" in the abundance.h5 file.
I noticed this because I was getting
when I ran Sleuth. I'm pretty sure this issue arises because the transcript IDs I provide did not match the abundance.h5 IDs, although they match perfectly with the abundance.tsv IDs.
This isn't a bug, per se, but rather a reporting issue that causes confusion. It would probably be best to have the abundance.tsv IDs match the abundance.h5 IDs, so as to reduce confusion. Note that the header style I am using is what gtf_to_fasta produces when run on GTF files from Gencode or Ensembl, so I think others will experience this reporting snafu.
The text was updated successfully, but these errors were encountered: