The R package “NeEDS4BigData” provides approaches to implement subsampling methods to analyse big data.
New Experimental Design based Subsampling methods for Big Data.
## Installing the package from GitHub
devtools::install_github("Amalan-ConStat/NeEDS4BigData")
## Installing the package from CRAN
install.packages("NeEDS4BigData")
- A- and L-optimality based subsampling for GLMs.
- A-optimality based subsampling for Gaussian Linear Models.
- Leverage sampling for GLMs.
- Local case control sampling for logistic regression.
- A-optimality based subsampling under measurement constraints for GLMs.
- Model robust subsampling method for GLMs.
- Subsampling method for GLMs when the model is potentially misspecified.
These seven methods are described in the following articles
- Introduction - explains the need for subsampling methods.
- Linear Regression - Basic sampling.
- Linear Regression - Model robust and misspecification.
- Logistic Regression - Basic sampling.
- Logistic Regression - Model robust and misspecification.
- Poisson Regression - Basic sampling.
- Poisson Regression - Model robust and misspecification.
For