Skip to content

Commit

Permalink
final_analysis
Browse files Browse the repository at this point in the history
  • Loading branch information
vanshilshah97 committed Jan 6, 2021
1 parent 39b9377 commit d6b534d
Showing 1 changed file with 11 additions and 4 deletions.
15 changes: 11 additions & 4 deletions zf_task_code_submit.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -868,7 +868,14 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"![age](https://raw.githubusercontent.com/vanshilshah97/pred_maint_zf/main/images/pivoting_on_age2.png)"
"![failure_model_ratio](https://raw.githubusercontent.com/vanshilshah97/pred_maint_zf/main/images/failure_model_ratio.png)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"#### We can see that model2 variant of the machine is prone to more errors, and this is further highlighted in the case when I find the feature coefficient in logistic regression it has the second most and positive correlation with the probabilty of a machine being unhealthy "
]
},
{
Expand Down Expand Up @@ -3215,7 +3222,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"#### This is the feature importance plot generated by the random forest. It marks age, one-hot encoding and rotation as some of the important features in capturing the anomalies. I believe this information could used in our future work while working in an unsupervised domain for reducing feature dimensionality"
"### This is the feature importance plot generated by the random forest. It marks age, one-hot encoding and rotation as some of the important features in capturing the anomalies. I believe this information could used in our future work while working in an unsupervised domain for reducing feature dimensionality"
]
},
{
Expand Down Expand Up @@ -3243,8 +3250,8 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"#### As we can finally see that random forest gives superior performance, we can furhter imporve the recall by doing a gridsearch on various hyperparameters of random forest . Having a comparable performance are boosting techniques, I think with proper fine tuning of the parameters of gradient boosting techniques we could also improve the recall significantly.\n",
"#### Hence these methods are the way to go"
"### As we can finally see that random forest gives superior performance, we can furhter imporve the recall by doing a gridsearch on various hyperparameters of random forest . Having a comparable performance are boosting techniques, I think with proper fine tuning of the parameters of gradient boosting techniques we could also improve the recall significantly.\n",
"### Hence these methods are the way to go"
]
},
{
Expand Down

0 comments on commit d6b534d

Please sign in to comment.