-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' of github.com:fensorechase/AHRQ_pipeline
- Loading branch information
Showing
1 changed file
with
33 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,33 @@ | ||
# AHRQ_pipeline | ||
# AHRQ_pipeline | ||
|
||
## Key Steps | ||
Load Libraries: | ||
- caret | ||
- data.table | ||
- dplyr | ||
- sets | ||
- scales | ||
- tidyr | ||
- stringr | ||
|
||
## Summary of functionality: | ||
|
||
### Load Data: | ||
- Load patient data from a CSV file. (select your desired geographic level & years from AHRQ SDOHD). | ||
- Load years of AHRQ SDOH data from CSV files. | ||
- Load a CSV file containing feature names for AHRQ SDOH variables. | ||
|
||
### Preprocess AHRQ Data: | ||
- Merge AHRQ data from multiple years into a single data frame. | ||
- Pad ZIP codes and STATEFIPS codes with leading zeros for consistency. | ||
- Optionally perform imputation for missing values. | ||
|
||
### Merge Data: | ||
Merge the preprocessed AHRQ data with patient data using crosswalk variables: | ||
STATEFIPS | ||
ZIPCODE | ||
YEAR | ||
|
||
## The AHRQ SDOHD data used in this notebook can be accessed from the following link: | ||
- AHRQ SDOHD: https://www.ahrq.gov/sdoh/data-analytics/sdoh-data.html#download | ||
|