Foreshadow: Simple Machine Learning Scaffolding

Foreshadow is an automatic pipeline generation tool that makes creating, iterating, and evaluating machine learning pipelines a fast and intuitive experience allowing data scientists to spend more time on data science and less time on code.

Key Features

Scikit-Learn compatible
Automatic column intent inference
- Numerical
- Categorical
- Text
- Droppable (All values in a column are either the same or different)
Allow user override on column intent and transformation functions
Automatic feature preprocessing depending on the column intent type
- Numerical: imputation followed by scaling
- Categorical: a variety of categorical encoding
- Text: TFIDF followed by SVD
Automatic model selection
Rapid pipeline development / iteration

Features in the road map

Automatic feature engineering
Automatic parameter optimization

Foreshadow supports python 3.6+

Installing Foreshadow

$ pip install foreshadow

Read the documentation to set up the project from source.

Getting Started

To get started with foreshadow, install the package using pip install. This will also install the dependencies. Now create a simple python script that uses all the defaults with Foreshadow.

First import foreshadow

from foreshadow.foreshadow import Foreshadow
from foreshadow.estimators import AutoEstimator
from foreshadow.utils import ProblemType

Also import sklearn, pandas, and numpy for the demo

import pandas as pd

from sklearn.datasets import boston_housing
from sklearn.model_selection import train_test_split

Now load in the boston housing dataset from sklearn into pandas dataframes. This is a common dataset for testing machine learning models and comes built in to scikit-learn.

boston = load_boston()
bostonX_df = pd.DataFrame(boston.data, columns=boston.feature_names)
bostony_df = pd.DataFrame(boston.target, columns=['target'])

Next, exactly as if working with an sklearn estimator, perform a train test split on the data and pass the train data into the fit function of a new Foreshadow object

X_train, X_test, y_train, y_test = train_test_split(bostonX_df,
   bostony_df, test_size=0.2)

problem_type = ProblemType.REGRESSION

estimator = AutoEstimator(
    problem_type=problem_type,
    auto="tpot",
    estimator_kwargs={"max_time_mins": 1},
)
shadow = Foreshadow(estimator=estimator, problem_type=problem_type)
shadow.fit(X_train, y_train)

Now fs is a fit Foreshadow object for which all feature engineering has been performed and the estimator has been trained and optimized. It is now possible to utilize this exactly as a fit sklearn estimator to make predictions.

shadow.score(X_test, y_test)

Great, you now have a working Foreshaow installation! Keep reading to learn how to export, modify and construct pipelines of your own.

Tutorial

We also have a jupyter notebook tutorial to go through more details under the examples folder.

Documentation

Read the docs!

Name		Name	Last commit message	Last commit date
Latest commit History 350 Commits
.github		.github
bin		bin
ci		ci
doc		doc
examples		examples
foreshadow		foreshadow
newsfragments		newsfragments
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.rst		README.rst
azure-pipelines.yml		azure-pipelines.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readthedocs.yml		readthedocs.yml
setup.cfg		setup.cfg
setup_env.sh		setup_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Foreshadow: Simple Machine Learning Scaffolding

Key Features

Features in the road map

Installing Foreshadow

Getting Started

Tutorial

Documentation

About

Releases 4

Packages

Contributors 7

Languages

License

georgian-io-archive/foreshadow

Folders and files

Latest commit

History

Repository files navigation

Foreshadow: Simple Machine Learning Scaffolding

Key Features

Features in the road map

Installing Foreshadow

Getting Started

Tutorial

Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 7

Languages

Packages