Collection of notebooks used for teaching data science, developed by Magnus Haraldson Høie and Andreas Fønss Møller
This module covers dataset I/O handling, modelling and normalisation in Pandas, principal component analysis and differential gene expression analysis. The case data is on quantification of relative protein levels in malaria-infected mice from work by Tiberti, N et al Scientific Reports 2016.
This module covers efficient data processing in Numpy, analysis and visualisation of biological sequences and graphing in Seaborn and Matplotlib. The case data comes from a dataset of 7.7 million bacterial sequences with associated temperature data compiled by the iGEM Potsdam team for Kaggle, collected from the Bacterial Diversity Metadatabase and UniProt.