Skip to content

Used machine learning in R to assess individual variables involved in the intra-operative pathology consultation process.

Notifications You must be signed in to change notification settings

zacksaw/frozensection_MS_thesis

Repository files navigation

Using healthcare data and machine learning to enhance the frozen-section process @ NYU Langone

Gynecological oncology operations rely on intraoperative consultations to help determine the malignancy of suspicious tissue. Traditionally, these consultations are reliable, with a discrepancy from the final pathology report 3-8% of the time. Our study used statistical significance tests, such as t-tests and chi-square, to assess individual variables involved in the intra-operative consultation process. Further, machine learning predicative algorithms such as GLM, Naïve Bayes, and Random Forest were used to determine the most impactful factors. Combining all methods, our study found the most impactful factor in a discrepant diagnosis is the size of tumor tissue sent for testing, the surgeon, the type of tumor, and the frozen section pathologist’s specialty. These factors were the most significant within the study, however, they do not meet all thresholds for statistical significance. Due to a significant lack of discrepant samples, all machine learning algorithms could not be properly trained and therefore had accuracies near 50%. Further work using samples collected outside gynecological oncology or the NYU Langone facility are needed to improve the study’s outcome.

Machine learning code included

Final figures included

About

Used machine learning in R to assess individual variables involved in the intra-operative pathology consultation process.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published