Simulation Paper on Statistical Learning

Author

Gabriel Jobert

Date

2023-11-05

Description

This repository contains the R Markdown file and associated resources for a simulation study. The paper focuses on employing a Monte Carlo simulation approach to evaluate the impact of missing data on the performance and stability of various statistical methods.

Objective

The primary objective of this study is to provide insights into how missing data can affect the outcome and reliability of different statistical learning techniques.

Methods and Libraries Used

R Programming: The analysis and simulations are conducted using R.
Libraries:
- MASS: For support with statistical functions.
- caret: For data splitting, pre-processing, feature selection, etc.
- cluster & mclust: For clustering analysis.
- infotheo: For information theory-based methods.
- RANN: For approximate nearest neighbor techniques.
- ggplot2: For creating elegant data visualisations.
- dplyr: For data manipulation.

File Structure

Simulation Paper.Rmd: The main R Markdown file containing the study, code, and documentation.
HTML version : The paper in plain word from the RMD file in html version (prefer reading this version than the pdf one).
PDF version : Paper in plain word in PDF version.

How to Use

Ensure you have R and the necessary libraries installed.
Clone this repository.
Open the SImulation Paper.Rmd file in an R environment like RStudio.
Run the code chunks sequentially to reproduce the analysis and view the results.

This README is a brief overview of the 'Simulation Paper on Statistical Learning'. For more detailed information, please refer to the Rmd file itself.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Html version - Simulation Paper		Html version - Simulation Paper
PDF version - Simulation paper		PDF version - Simulation paper
README.md		README.md
Simulation-Paper.Rmd		Simulation-Paper.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simulation Paper on Statistical Learning

Author

Date

Description

Objective

Methods and Libraries Used

File Structure

How to Use

About

Releases

Packages

Contributors 2

GabrielJobert/Simulation_paper---Effect_of_missing_data_on_K-means_performance---MATH60603A_STATISTICAL_LEARNING

Folders and files

Latest commit

History

Repository files navigation

Simulation Paper on Statistical Learning

Author

Date

Description

Objective

Methods and Libraries Used

File Structure

How to Use

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages