Exploratory Data Analysis and Classification on the ILPD
This data set has been obtained from the online machine learning repository of the University of California, Irvine.
Link to data set:
https://archive.ics.uci.edu/ml/datasets/ILPD+(Indian+Liver+Patient+Dataset)
Data Set Information:
This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from north east of Andhra Pradesh, India. Selector is a class label used to divide into groups(liver patient or not). This data set contains 441 male patient records and 142 female patient records.
Any patient whose age exceeded 89 is listed as being of age "90".
Attribute Information:
- Age Age of the patient
- Gender Gender of the patient
- TB Total Bilirubin
- DB Direct Bilirubin
- Alkphos Alkaline Phosphotase
- Sgpt Alamine Aminotransferase
- Sgot Aspartate Aminotransferase
- TP Total Protiens
- ALB Albumin
- A/G Ratio Albumin and Globulin Ratio
- Selector field used to split the data into two sets (labeled by the experts)
The primary goal is to visualize the data and establish trends or important characteristics, if any. The next important objective is to create a classifier that can predict the class of the patient by the virtue of its input.