Grab AI SEA: Safety Challenge

This repository is the submission for the AI challenge for S.E.A hosted by Grab. The selected challenge is Safety. The challenge can be found via this website.

Author

by: Satsawat Natakarnkitkul (Net)

Email: [email protected]

Country: Thailand

Motivation: This challenge is very interesting in so many ways, but as I use Grab nearly everyday. Hence this challenge is the most impact to the Grab users.

Repository structure

Notebook

The notebook Grab AI Challenge_Safety_Data Exploration.ipynb is mainly used as part of data understanding and EDA for sensor data provided by Grab. You may not run this notebook, but it will provide some understandings and explanation onto telemetry data of the sensor world.
The notebook Grab AI Challenge_ML model comparison.ipynb is purposely created to train and test ML techniques to produce the final model as well as try on feature engineering and other data transformation.
The notebook Grab AI Challenge_GridSearchCV and Feature Engineering.ipynb shows the grid search for XGBoost algorithm; with the current model and feature engineering, it achieved the AUC score of 0.5945.

Model

This folder contains the final model object to be used for prediction.

Code

This folder contains the final python source code for manipulating, creating new features and predicting the data set.

Img

This folder contains the image embedded onto EDA and other notebooks.

Output

This folder contains the bookingID, predicted class, and probability of the prediction, this is the outcome from running the py script.

Run Instruction

The model is used to predict the safety of the trip as such the assumption is that this is not the real time prediction (online), but rather an offline (data for each booking ID is available). The transformation is the aggregation of each booking ID onto single observations and feed into the model for prediction.

To run the prediction, please use Safety_Prediction.py in the code directory.

The script in code folder will read in the feature data file within data/safety/features folder.
- If there's any change in the data path, please adjust the DATA_DIR onto the correct folder respectively.
The script will automatically run the feature transformation and engineering.
The script will load the XGBoost model object from model directory to make a prediction.
The script will save the prediction with bookingID onto ../output/all_prediction.csv file.
If LABEL_IND = True in the script, it will attempt to run evaluation between the prediction with true label.
- The true label file should be in the data/safety/labels folder with the proper bookingID and label columns.
- If there's any change to labels folder, please adjust this to LABEL_DIR in the script respectively.
- If the evaluation is not neeeded, you can turn this off by setting LABEL_IND = False in the script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grab AI SEA: Safety Challenge

Author

Repository structure

Notebook

Model

Code

Img

Output

Run Instruction

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
code		code
data		data
img		img
model		model
output		output
Grab AI Challenge_GridSearchCV and Feature Engineering.ipynb		Grab AI Challenge_GridSearchCV and Feature Engineering.ipynb
Grab AI Challenge_ML model comparison.ipynb		Grab AI Challenge_ML model comparison.ipynb
Grab AI Challenge_Safety_Data Exploration.ipynb		Grab AI Challenge_Safety_Data Exploration.ipynb
README.md		README.md

netsatsawat/Grab-AI-SEA

Folders and files

Latest commit

History

Repository files navigation

Grab AI SEA: Safety Challenge

Author

Repository structure

Notebook

Model

Code

Img

Output

Run Instruction

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages