Skip to content

N950/dsa2

 
 

Repository files navigation

Check here if latest commit is working :

Testing code

Main Main, test_fast_linux Main, test_full

Multi test_fast_linux test_full test_models

Preprocessors Check test_preprocess

Looking for contributors

 Maintain and setup roadmap of this excellent Data Science / ML repo.
 Goal is to unified Data Science and Machine Learning .
 Basic idea is to have one single dictionary/json for
        model, compute, data definition,
 --> easy to define, easy to track, easy to modify.

Install

 git clone 
 cd dsa2
 pip install -r zrequirements.txt

Basic usage

python  titanic_classifier.py  preprocess    --nsample 1000
python  titanic_classifier.py  train         --nsample 2000
python  titanic_classifier.py  predict

Documentation

https://github.com/arita37/dsa2/issues?q=is%3Aissue+is%3Aopen+label%3Adocumentation

image

Tutorial

https://github.com/arita37/dsa2/issues?q=is%3Aissue+is%3Aopen+label%3ATutorial

image

How to train a new dataset ?

 https://github.com/arita37/dsa2/issues/200

Examples

 https://github.com/arita37/dsa2/tree/main/example

List of preprocessor

    #### Data Over/Under sampling 

    #### Category, Numerical
    
    #### Text        

    #### Target label encoding

    #### Time Series 


    https://github.com/arita37/dsa2/issues/194

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Jupyter Notebook 54.4%
  • Python 30.4%
  • HTML 15.1%
  • Other 0.1%