Skip to content
View MattiaCinelli's full-sized avatar

Block or report MattiaCinelli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MattiaCinelli/README.md

LinkedIn Medium GitHub last commit

I am a Data Scientist, with experience in machine learning, computer science, statistics, mathematics, deep learning, data visualization, and communication in different STEM disciplines.

Ph.D. in Machine Learning applied to Computational Immunology in the department of Infection and Immunity at UCL, doctorate founded by Microsoft Research Cambridge.

Double background in computer science (M.Sc.) and molecular biology (B.Sc.). Extensive research experience gained in several international research laboratories across all my academic career. Motivated and interested in challenging and stimulating new goals.

Table of contents

Curriculum Vitae


Work Experience

(Back to top)

  • Team Leader Data Scientist at Department of Health and Social Care (contractor position) (Nov 2020 - Sep 2021)

    • Role: Team Leader in the Test Analytics and Modelling
    • Team: DHSC/T&T/Test Demand Modelling group Under
    • I'm the team leader for the group responsible for the creation and maintaining of the Python software that has enabled a reliable meta-analysis of the Covid-19 tests performed in England and, the creation of the weekly reporting on the prediction of testing for the government. My work in the department validated the use of custom created Python software for the pandemic effort, an approach that has been later adopted throughout the DHSC
  • Data Scientist at Sensyne Health (contractor position) (March 2020 – July 2020)

  • Data Scientist at Intellegens (Dec 2019 – March 2020)

    • Role: To build the company’s data analysis and feature engineering toolboxes
    • ETL and analyses of big data databases and presentation of the results to the business’ clients
    • Direct interaction with sales, clients to design and develop innovative solutions to customer needs
  • Computational Scientist at Inivata (Jan 2017 – Nov 2019)

    • Role: To use main computational statistical research for the analysis of cancer genomics data, biomarker development, optimisation of Inivata’s technology and analysis of clinical data in collaboration with Inivata’s partners.
    • Development of Inivata’s liquid biopsy technology, on the analysis of NGS for genomics lung cancer
    • Coding, testing, design of R packages and optimization of the production pipeline in Python and reporting in R/Latex for clinical decision support.

Education

(Back to top)

PhD

Doctor of Philosophy in Infection & Immunity

University College London (UCL), United Kingdom, 2013 – 2017

Field Of Study: Computational Immunology

  • Thesis title: Analysis of murine CDR3β repertoires using machine learning techniques. (2018)

  • Role: I analysed the role and mechanisms of the CDR3 sequence, a short protein region present on the T cell receptor, using state of the art of machine learning methods. The intent of my research was to apply and develop methods for the classification of CDR3 repertoires and, though the classification process, identify the amino acids and positions that play a major role in the mechanism of antigen recognition.

Founded by Microsoft 2013 PhD Scholarship Programme in EMEA Microsoft scholarships consist of an annual bursary up to a maximum of three years for PhD supervisors and students to do collaborative research projects with Microsoft Research Cambridge.

Publications:

Master Degree

Master’s Degree (MSc) in Bioinformatics

Alma Mater Studiorum – University of Bologna, Italy, 2011 – 2013

Field Of Study: Bioinformatics

  • Thesis title: Simulating gene co-regulatory networks in the development of B-cells.

Erasmus and thesis completion: University of Tampere, Finland. January-July 2013 Thesis in B cell genomics modelling using Random Neural Network in the Computational Biology Group, Institute of Biomedical Technology, University of Tampere (Finland)

The 10th International Workshop on Computational Systems Biology, WCSB 2013, June 10-12, Tampere, Finland

Bachelor Degree

Bachelor’s Degree (BSc) Molecular Biology

Università degli Studi di Padova, Italy, 2009 – 2011 and Università degli Studi dell’Aquila, Italy, 2007 – 2009

Field Of Study: Molecular Biology / Biotechnology


Articles

(Back to top)


Latest Projects:

(Back to top)

In this Kaggle competion a set of 5,863 chest X-ray images (anterior-posterior) were selected from retrospective cohorts of pediatric patients, between the age of one to five years old, in Guangzhou. All chest X-ray imaging was performed as part of the patient's routine clinical care.

All chest radiographs were screened by two expert physicians for quality control and removing all low quality or unreadable scans.

In the picture below are show the three type of chest X-ray present in the database:

Chest X-Ray Images

A fully published Python Packege Banner

This Python package handles user created color palettes scheme to be used in Python plotting libraries (matplotlib, seaborn, etch).

Users can use specific colors in accordance with their taste or company guidelines that are different from the defaults in matplotlib. Therefore, mypalette helps to create, store, and use color palettes.

It can use a text file from https://coolors.co or a list of hexadecimal codes and saves a JSON format, comprising of colors: name, hexadecimal and RGB codes.

Example of Data Visualization

Sankey Plot in Python

If you have been more than five seconds on r/dataisbeautiful/, you will have probably encountered a Sankey plot. Everyone uses to track their expenses, job searching and every multi step processes. Indeed, it is very suitable to visualize the progression of events and their outcome.

And in my opinion, they look great!

Mattia's GitHub stats

Popular repositories Loading

  1. fromRtoLatex fromRtoLatex Public

    Every report looks better in LaTeX!

    R 7 2

  2. AlgoRL AlgoRL Public

    A repo with the most common Reinforcement Algorithms coded in Python with the objective of better understanding and testing them.

    Python 4 1

  3. notebooks notebooks Public

    Various notebooks used as bases for different articles I have published, plus occasional interesting projects.

    Jupyter Notebook 2 6

  4. UCL_PhD_Latex_template UCL_PhD_Latex_template Public

    A template for a Thesis in Latex using UCL standards

    TeX 1 1

  5. awesome-readme awesome-readme Public

    Forked from pottekkat/awesome-readme

    A guide to writing an Awesome README. Read the full article in Towards Data Science.

    1

  6. mypalette mypalette Public

    Mypalette makes easier to access and work with beautiful color palettes for any plotting needs.

    Python