Python 3.5.2 or newer is necessary
First steps:
- Download archive from GitHub
- Extract archive
- Download booking data from "https://data.deutschebahn.com/dataset/data-call-a-bike/resource/0fcce4dd-7fc6-43f8-a59c-983a7945f8ba" {4,01 GB}
- Extract archive "OPENDATA_BOOKING_CALL_A_BIKE.csv", rename to route_data.csv and save in extracted bikes archive under subfolder "data"
- Download train_test_data.csv from "https://drive.google.com/open?id=1MXj7RgAU_UGhLzo8brt3dzboRSAmmdA3" {52 MB} and save in extracted bikes archive under subfolder "data" (Note: this can also be created by running the preprocess.py again)
- Open a terminal and navigate to the extracted archive with "cd PATH/TO/FOLDER"
- Install required dependencies with "pip install -r requirements.txt"
- Run preprocess.py in an adequate run-time environment
For using the interface:
- Execute first steps
- Run interface.py
- Enlarge window
- Feed values to your hearts content
For testing/training the model:
- Execute first stepts
- In the terminal, type "python model.py" for testing accuracy or "python model.py train" to train the model (if you just want to test the model you can also just run the model.py script directly)
- See console for prediction accuracy of Support Vector Machine or wait until a new training model is created
(Note: This program runs with a pretrained model, which is used for the interface.py for demonstration purposes and to get the accuracy on the test set. It is not necessary to train the model again, but feel free to do so. It may take several days to train on any regular laptop.)