- Cambridge, MA
Highlights
- Pro
Starred repositories
AsynchroNous Disk-based Representation of MassivE DAta: An R package aimed at replacing ff for storing large data objects.
A pipe friendly way to interact with an OMOP Common Data Model
Code for reproducing examples in the book by Daniels, Linero, and Roy.
R package to compute and plot predictions, slopes, marginal means, and comparisons (contrasts, risk ratios, odds, etc.) for over 100 classes of statistical and ML models. Conduct linear and non-lin…
An extension of XGBoost to probabilistic modelling
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Migrate to PostgreSQL in a single command!
Larger-Than-Memory Data Workflows with Apache Arrow
Secure collaborative training and inference for XGBoost.
Federated gradient boosted decision tree learning
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
A topic-centric list of HQ open datasets.
This package provides functions for computing One-Sided Dynamic Principal Components, a novel multivariate time series dimension reduction technique proposed in Peña, Smucler and Yohai (2019) (http…
A fast and flexible framework for data reduction in R
OpenRefine is a free, open source power tool for working with messy data and improving it
Web Extension for saving a faithful copy of a complete web page in a single HTML file
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Join tables based on events occurring in sequence in a funnel.
The gfoRmula package implements the parametric g-formula in R. The parametric g-formula (Robins, 1986) uses longitudinal data with time-varying treatments and confounders to estimate the risk or me…
Comprehensive bindings and command line utility for the Pushover notification service
Stan-code for Markov-switching vector autoregressive models
A Python package for building Bayesian models with TensorFlow or PyTorch
Research in investment finance with Python Notebooks
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
Bayesian Data Analysis course at Aalto
A repository of data on coronavirus cases and deaths in the U.S.