Skip to content
/ seq2tax Public

A deep learning-based sequence classifier applied to 16S rRNA

Notifications You must be signed in to change notification settings

robsyc/seq2tax

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

seq2tax

A deep learning-based sequence classifier applied to 16S rRNA.

This repo builds further on the findings of my bachelor's project with the goal of creating a highly accurate, easy-to-use and easy-to-interpret sequence classification tool for the 16S rRNA gene but which can be applied and quickly re-trained to any marker gene.

Things to work on:

  • additional data augmentation strategies (sequence cropping)
  • creating combined model that takes in both k-mer and one-hot encoded sequences
  • reduce training memory requirements by building a TensorFlow input pipeline (tf.data)
  • create a BERT-like multi-class & multi-label classifier
  • do something about sequence entries with partially missing labels and how to weight the model around this problem...

About

A deep learning-based sequence classifier applied to 16S rRNA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published