ADULT dataset:
本次作業是需要從給定的個人資訊(如下表),預測此人的年收入是否大於50K。
共有32561筆訓練資料,16281筆測試資料
(8140 in private test set and 8141 in public test set)
https://inclass.kaggle.com/c/ml2017-hw2
https://drive.google.com/file/d/0B8Si647wj9ZoTE9uQzAwR0M5ZkU/view?usp=sharing
https://en.wikipedia.org/wiki/Cross_entropy
https://en.wikipedia.org/wiki/Logistic_regression
https://en.wikipedia.org/wiki/Multivariate_normal_distribution
https://en.wikipedia.org/wiki/Ensemble_learning
https://en.wikipedia.org/wiki/Gradient_boosting
https://www.cmlab.csie.ntu.edu.tw/~cyy/learning/tutorials/EnsembleLearning.pdf
https://docs.scipy.org/doc/numpy/reference/generated/numpy.exp.html
https://docs.scipy.org/doc/numpy/reference/generated/numpy.log.html
https://docs.scipy.org/doc/numpy/reference/generated/numpy.linalg.det.html
https://docs.scipy.org/doc/numpy/reference/generated/numpy.mean.html
https://docs.scipy.org/doc/numpy/reference/generated/numpy.average.html
https://docs.scipy.org/doc/numpy/reference/generated/numpy.clip.html
https://docs.google.com/presentation/d/1lGmk0U8N_qAwLW6-0EcODaPKI1EQnhjt7joal_sJ9SI/edit#slide=id.g1daca38085_0_37
https://codepad.org/RvIY9BU2
https://codepad.org/wlrT0lw4