nims/scripts at master · sbhattacharyay/nims

History

Name		Name	Last commit message	Last commit date
parent directory ..
accel_recording_scripts		accel_recording_scripts
functions		functions
01_motion_feature_extraction.m		01_motion_feature_extraction.m
02_missing_feature_imputation.R		02_missing_feature_imputation.R
03_bed_movement_correction.R		03_bed_movement_correction.R
04_create_repeated_cv_folds.R		04_create_repeated_cv_folds.R
05_dim_reduction.R		05_dim_reduction.R
06_prediction_models.R		06_prediction_models.R
07_calculate_metrics.ipynb		07_calculate_metrics.ipynb
08_model_calibration_calculation.R		08_model_calibration_calculation.R
09_case_study_exploration.R		09_case_study_exploration.R
10_tables_and_statistics.R		10_tables_and_statistics.R
11_manuscript_figures.R		11_manuscript_figures.R
README.md		README.md

README.md

Scripts

All of the code used in this work can be found in this directory as MATLAB (.m) files, R (.R) files, or Jupyter notebooks (.ipynb). Moreover, generalised functions have been saved in the ./functions sub-directory and .py scripts used to record accelerometry data from the bedside are available in the ./accel_recording_scripts sub-directory.

1. Complete Motion Feature Extraction

In this .m script, we iterate through compiled triaxial accelerometry information from each patient, filter each axis with a high-pass (f_c = 0.2 Hz) 4th-order Butterworth filter, and extract 7 different motion features from non-overlapping 5 second windows. Outputs are saved as .csv feature tables. We also plot short examples of the accelerometry processing pipeline for Figure 1 in this script.

2. Multiple Imputation of Missing Accelerometery Values

In this .R script, we apply our multiple missing feature imputation algorithm. In the event of totally missing recordings (n = 10/483), we impute upper extremity recordings with linear regression from ipsilateral upper extremity sensors, we impute lower extremity recordings with linear regression from contralateral upper extremity sensors, and bed sensors are imputed with sampling with replacement from the total distribution of bed sensor values. Then, the large majority of missing values were imputed with multiple, normalized time-series imputation with the Amelia II package. We create 9 imputations, each stored in a separate .csv file.

3. Bed Motion Correction and Collection of Multiple Imputations

In this .R script, we correct gross-external movements by properly adjusting for the motion features calculated from the sensor placed at the foot of each patient's bed. Based on a literature-sourced threshold of SMA for human dynamic activity, we define distributions for each feature correspodning to static activity, and correct bed sensoer feature values from extremity sensors accordingly. As a result, we have 9 bed-corrected, imputed feature sets, each stored in a separate .csv file.

4. Resampling of GCS data for classification

In this .R file, we create repeated cross-validation (5 repeats of 5-fold CV) for each tested observation window based on the available GCS observations for each observation window. We principally use the createMultiFolds function from the caret package to stratify folds by outcome labels. Folds for each observation window are stored in a .csv file in a newly created directory.

5. LOL embedding for dimensionality reduction

In this .R file, we train Linear Optimal Low Rank Projections (LOL) on model training sets and reduce both training and validation sets to low-dimensional spaces prior to model training. Prior to LOL, we normalize feature spaces per the distribution of feature type and sensor combinations. This enables us to use LOL coefficients to compare feature type and sensor significance.

6. Train and evaluate prediction models and measure feature significance scores

In this .R file, we train and validate logistic regression models (GLM) for threshold-level GCSm detection, threshold-level GOSE at discharge prediction, and threshold-level GOSE at 12 months prediction. We train and evaluate models of varying observation windows and target dimensionalities. In this script, we also calculate our feature significance scores. This is equiavalent to the absolute LOL coefficient weighted by the trained linear coefficients of the corresponding logistic regression model.

7. Calculate bootstrapping bias-corrected cross-validation (BBC-CV) area under the receiver operating characteristic curve (AUC) and classification metrics

In this .ipynb notebook, we calculate AUCs, ROC curves, and classification metrics for each observation window based on the validation set predictions returned by our models. We use bias-corrected bootstrapping for repeated cross-validation (Repeated BBC-CV) to calculate 95% confidence intervals for the metrics and the ROC curve. This script is programmed to perform bootstrapping in parallel on 10 cores.

8. Calculate model calibration on validation set predictions

In this .R file, we calculate probability calibration curves and associated calibration metrics for each observation window based on the validation set predictions returned by our models. We use bias-corrected bootstrapping for repeated cross-validation (Repeated BBC-CV) to calculate 95% confidence intervals for the metrics and the calibration curve. This script is programmed to perform bootstrapping in parallel on 10 cores.

9. Case study exploration

In this .R file, we retrospectively examine predictions of Pr(GCSm > 4) in patients (n = 6) who experienced neurological transitions across this threshold to visually determine potential clinical utility of the accelerometry-based system. For each of the 6 patients, we train optimally discriminating detection models (one with a shorter observation window of 27 minutes and one with a longer observation window of 6 hours) on the remaining patient set and validate predictions on the case study patients specifically over a large, continuously overlapping observation window set. We bootstrap across imputations to produce 95% confidence intervals that account for variation due to imputation on the predictions. Then, we prepare the probability trajectories for plotting.

10. Create manuscript tables and perform statistical analyses

In this .R file, we construct manuscript tables and perform miscellaneous statistical analyses for different parts (including figures) of the manuscript and supplementary materials. In addition to the classification metrics calculated in script no. 7, we also calculate classification accuracy with repeated BBC-CV in this script.

11. Plot figures for the manuscript

In this .R file, we produce the figures for the manuscript and the supplementary figures. The large majority of the quantitative figures in the manuscript are produced using the ggplot package.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts

scripts

README.md

Scripts

1. Complete Motion Feature Extraction

2. Multiple Imputation of Missing Accelerometery Values

3. Bed Motion Correction and Collection of Multiple Imputations

4. Resampling of GCS data for classification

5. LOL embedding for dimensionality reduction

6. Train and evaluate prediction models and measure feature significance scores

7. Calculate bootstrapping bias-corrected cross-validation (BBC-CV) area under the receiver operating characteristic curve (AUC) and classification metrics

8. Calculate model calibration on validation set predictions

9. Case study exploration

10. Create manuscript tables and perform statistical analyses

11. Plot figures for the manuscript

Files

scripts

Directory actions

More options

Directory actions

More options

Latest commit

History

scripts

Folders and files

parent directory

README.md

Scripts

1. Complete Motion Feature Extraction

2. Multiple Imputation of Missing Accelerometery Values

3. Bed Motion Correction and Collection of Multiple Imputations

4. Resampling of GCS data for classification

5. LOL embedding for dimensionality reduction

6. Train and evaluate prediction models and measure feature significance scores

7. Calculate bootstrapping bias-corrected cross-validation (BBC-CV) area under the receiver operating characteristic curve (AUC) and classification metrics

8. Calculate model calibration on validation set predictions

9. Case study exploration

10. Create manuscript tables and perform statistical analyses

11. Plot figures for the manuscript