DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
-
Updated
Mar 9, 2024 - Python
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
Quickly search, compare, and analyze genomic and metagenomic data sets.
Inference of ploidy and heterozygosity structure using whole genome sequencing data
Accurate metagenomic profiling && Fast large-scale sequence/genome searching
De novo genome assembly and multisample variant calling
A versatile toolkit for k-mers with taxonomic information
modular k-mer count matrix and Bloom filter construction for large read collections
Generate unique KMERs for every contig in a FASTA file
Fast k-mer based tool for multi locus sequence typing (MLST)
Predict plasmids from uncorrected long read data
counts k-mers, written in rust
Count kmers with a more efficient (faster) hash table
Generate kmers/minimizers/hashes/MinHash signatures, including with multiple kmer sizes.
k-mer similarity analysis pipeline
🎯 Generate DNA sequences with specified amino acid, codon, and k-mer frequencies
fast and comprehensive k-mer counting package
Fast and space-efficient taxonomic classification of long reads
Add a description, image, and links to the kmer topic page so that developers can more easily learn about it.
To associate your repository with the kmer topic, visit your repo's landing page and select "manage topics."