You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I downloaded cDNA fasta files and gtf files of the fungal database in Ensembl. I concatenated these into single fasta and gtf files and then tried to index them. However, I get the following error when I run the index command, even on a 382G high memory node:
kallisto index -i test.gtf.gz test.fa.gz
[build] loading fasta file test.fa.gz [build] k-mer length: 31 [build] warning: clipped off poly-A tail (longer than 10) from 4479 target sequences [build] warning: replaced 2455576 non-ACGUT characters in the input sequence with pseudorandom nucleotides [build] counting k-mers ... terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc /var/log/slurm/spool_slurmd/job6981046/slurm_script: line 20: 136148 Aborted kallisto index -i test.fa.gz
The test data completes with the kallisto index command. Is this a memory issue? Do I need more that a 382G node with 2 CPUs and 40 cores? Here is my system information:
kallisto 0.45.1
(GNU libc) 2.17
Linux RedHatEnterpriseServer
Red Hat Enterprise Linux Server release 7.4 (Maipo)
Input files:
database size:
test.gtf.gz 1.3G
test.fa.gz 4.3G
I removed most of the header information from the fasta, here is a chunk of the file:
Hi all,
I downloaded cDNA fasta files and gtf files of the fungal database in Ensembl. I concatenated these into single fasta and gtf files and then tried to index them. However, I get the following error when I run the index command, even on a 382G high memory node:
kallisto index -i test.gtf.gz test.fa.gz
[build] loading fasta file test.fa.gz [build] k-mer length: 31 [build] warning: clipped off poly-A tail (longer than 10) from 4479 target sequences [build] warning: replaced 2455576 non-ACGUT characters in the input sequence with pseudorandom nucleotides [build] counting k-mers ... terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc /var/log/slurm/spool_slurmd/job6981046/slurm_script: line 20: 136148 Aborted kallisto index -i test.fa.gz
The test data completes with the kallisto index command. Is this a memory issue? Do I need more that a 382G node with 2 CPUs and 40 cores? Here is my system information:
kallisto 0.45.1
(GNU libc) 2.17
Linux RedHatEnterpriseServer
Red Hat Enterprise Linux Server release 7.4 (Maipo)
Input files:
database size:
test.gtf.gz 1.3G
test.fa.gz 4.3G
I removed most of the header information from the fasta, here is a chunk of the file:
test.fa.gz
I removed comment lines from the gtf file which contained information about each fungal species, here is a chunk of the file:
test.gtf.gz
Thanks,
James
The text was updated successfully, but these errors were encountered: