This is my first Data Science Nanodegree program project
- categories.csv
- messages.csv
- ETL Pipeline Preparation.ipynb
- Disaster_Response_Database.db
- ML Pipeline Preparation.ipynb
- workspace folder
You need to install the following libraries,
- NLTK
- Numpy
- Pandas
- sklearn
- matplotlib
In the project, we will find a data set containing real messages that were sent during disaster events. I have created a machine learning pipeline to categorize these events so that you can send the messages to an appropriate disaster relief agency. This project includes a web app where an emergency worker can input a new message and get classification results in several categories. The web app will also display visualizations of the data.
Readme.md : description of the repository
messages.csv : contains the messaged id and respective messages.
categories.csv : contains the messaged id and categories of the messages.
ETL Pipeline Preparation.ipynb : This file Extract, Transform and load the cleaned data.
Disaster_Response_Database.db : This is the database where the cleaned data is stored.
ML Pipeline Preparation.ipynb : In this file we develop and test different machine learning algorithm for classifying those messages into the multi label categories.
workspace folder : It contains the web app version of the code