Maintain and setup roadmap of this excellent Data Science / ML repo.
Goal is to unified Data Science and Machine Learning .
Basic idea is to have one single dictionary/json for
model, compute, data definition,
--> easy to define, easy to track, easy to modify.
git clone
cd dsa2
pip install -r zrequirements.txt
python titanic_classifier.py preprocess --nsample 1000
python titanic_classifier.py train --nsample 2000
python titanic_classifier.py predict
https://github.com/arita37/dsa2/issues?q=is%3Aissue+is%3Aopen+label%3Adocumentation
https://github.com/arita37/dsa2/issues?q=is%3Aissue+is%3Aopen+label%3ATutorial
https://github.com/arita37/dsa2/issues/200
https://github.com/arita37/dsa2/tree/main/example
#### Data Over/Under sampling
#### Category, Numerical
#### Text
#### Target label encoding
#### Time Series
https://github.com/arita37/dsa2/issues/194