ML Pipeline for Short-Term Rental Prices in NYC

This contains an ML training pipeline which can be used to train models as and when new data arrives. This uses W&B for keeping a tab on artifacts and MLFlow to orchestrate the different components of the pipeline.

Running the entire pipeline or just a selection of steps

In order to run the pipeline when you are developing, you need to be in the root of the starter kit, then you can execute as usual:

>  mlflow run .

This will run the entire pipeline.

When developing it is useful to be able to run one step at the time. Say you want to run only the download step. The main.py is written so that the steps are defined at the top of the file, in the _steps list, and can be selected by using the steps parameter on the command line:

> mlflow run . -P steps=download

If you want to run the download and the basic_cleaning steps, you can similarly do:

> mlflow run . -P steps=download,basic_cleaning

You can override any other parameter in the configuration file using the Hydra syntax, by providing it as a hydra_options parameter. For example, say that we want to set the parameter modeling -> random_forest -> n_estimators to 10 and etl->min_price to 50:

> mlflow run . \
  -P steps=download,basic_cleaning \
  -P hydra_options="modeling.random_forest.n_estimators=10 etl.min_price=50"

Run from github

mlflow run https://github.com/Gunnvant/modelling_pipeline.git \
             -v 1.0.1 \
             -P hydra_options="etl.sample='sample2.csv'"

W&B Project Link

https://wandb.ai/gunnvant/nyc_airbnb?workspace=user-gunnvant

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
components		components
src		src
.gitignore		.gitignore
MLproject		MLproject
README.md		README.md
conda.yml		conda.yml
config.yaml		config.yaml
environment.yml		environment.yml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Pipeline for Short-Term Rental Prices in NYC

Running the entire pipeline or just a selection of steps

Run from github

W&B Project Link

About

Releases 3

Packages

Languages

Gunnvant/modelling_pipeline

Folders and files

Latest commit

History

Repository files navigation

ML Pipeline for Short-Term Rental Prices in NYC

Running the entire pipeline or just a selection of steps

Run from github

W&B Project Link

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages