Skip to content

In this Case Study I'm performing Exploratory Analysis & Building a model which will Classify if Patient has CHD or Not.

Notifications You must be signed in to change notification settings

relaxedplan/Framingham-Heart-Study

 
 

Repository files navigation

Framingham-Heart-Study

In this Case Study I'm performing Exploratory Analysis & Building a model which will Classify if Patient has CHD or Not.

Domain : Healthcare

Reason for Choosing this Dataset ?

  • The Reason behind choosing this model is my Personal Interest to explore various Domains out there.
  • I want to investigate how Machine Learning can help Healthcare industries to become Data-Driven. Where, ML can predict the likelihood of subject to suffer from a particular disease. Thereby, respective actions in the form of Treatments or Preventive Measures would be brought into consideration on the Individual.
  • However, this Statistical models are not prepared to use for production environment.

Source (Download On Request) : https://biolincc.nhlbi.nih.gov/teaching/

Details :

The Framingham Heart Study is a long term prospective study of the etiology of cardiovascular disease among a population of free living subjects in the community of Framingham, Massachusetts. The Framingham Heart Study was a landmark study in epidemiology in that it was the first prospective study of cardiovascular disease and identified the concept of risk factors and their joint effects FHS Longitudinal Data Document.

The dataset is a rather small subset of possible FHS dataset, having 4240 observations and 16 variables. The variables are as follows:

  • sex : the gender of the observations. The variable is a binary named “male” in the dataset.
  • age : Age at the time of medical examination in years.
  • education : A categorical variable of the participants education, with the levels: Some high school (1), high school/GED (2), some college/vocational school (3), college (4)
  • currentSmoker: Current cigarette smoking at the time of examinations
  • cigsPerDay: Number of cigarettes smoked each day
  • BPmeds: Use of Anti-hypertensive medication at exam
  • prevalentStroke: Prevalent Stroke (0 = free of disease)
  • prevalentHyp: Prevalent Hypertensive. Subject was defined as hypertensive if treated
  • diabetes: Diabetic according to criteria of first exam treated
  • totChol: Total cholesterol (mg/dL)
  • sysBP: Systolic Blood Pressure (mmHg)
  • diaBP: Diastolic blood pressure (mmHg)
  • BMI: Body Mass Index, weight (kg)/height (m)^2
  • heartRate: Heart rate (beats/minute)
  • glucose: Blood glucose level (mg/dL)
  • And finally the response variable : + TenYearCHD : The 10 year risk of coronary heart disease(CHD).

Objective :

The goal is to make some predictive models on a FHS dataset, and reviewing some exploratory and modelling techiniques.

About

In this Case Study I'm performing Exploratory Analysis & Building a model which will Classify if Patient has CHD or Not.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 56.7%
  • Jupyter Notebook 42.0%
  • Python 1.3%