Disaster Response Message Classification for Figure Eight Dataset

Project Overview

Machine learning is critical to helping different organizations understand which messages are relevant to them and which messages to prioritize. During these disasters, ML models have at least the capacity to filter out messages that are important and find basic methods such as using keyword searches to provide trivial results.

In this project, I will analyze thousands of real messages provided by Figure 8, sent during natural disasters either via social media or directly to disaster response organizations. We'll build an ETL pipeline that processes message and category data from .csv files and load them into an SQLite database, which our machine learning and NLP pipelines will then read from to create and save a multi-output supervised learning model. The project will also include a Flask-based web app integrated with Plotly Dashboards. It extracts data from this database to provide data visualizations and uses the trained model to classify new messages for 36 categories (Multi-Label Classification problem). An emergency worker can input a new message and get classification results in several categories. The web app will also display visualizations of the data.

Preparing the environment

Note: I have developed this project on Linux. It can surely be run on Windows and Mac with some little changes.

Clone the repository, and navigate to the downloaded folder.

git clone https://github.com/iamirmasoud/disaster-response-app.git
cd disaster-response-app

Create (and activate) a new environment, named disaster_env with Python 3.7. If prompted to proceed with the install (Proceed [y]/n) type y.
```
conda create -n disaster_env python=3.7
source activate disaster_env
```
At this point your command line should look something like: (disaster_env) <User>:disaster-response-app <user>$. The (disaster_env) indicates that your environment has been activated, and you can proceed with further package installations.
Before you can experiment with the code, you'll have to make sure that you have all the libraries and dependencies required to support this project. You will mainly need Python 3.7+, Flask, Sklearn, and Plotly. You can install dependencies using:

pip install -r requirements.txt

Navigate back to the repo. (Also, your source environment should still be activated at this point.)

cd disaster-response-app

Open the directory of notebooks, using the below command. You'll see all the project files appear in your local environment; open the first notebook and follow the instructions.

jupyter notebook

Once you open any of the project notebooks, make sure you are in the correct disaster_env environment by clicking Kernel > Change Kernel > disaster_env.

Files Descriptions

The project contains the following files,

1_ETL_Pipeline Notebook: Notebook experiment for the ETL pipelines
2_ML_Pipeline Notebook: Notebook experiment for creating and evaluating the Machine Learning and NLP pipelines.
process_data.py: The ETL pipeline is used to process data in preparation for model building.
train_classifier.py: The Machine Learning pipeline is used to fit, tune, evaluate, and export the model to a Python pickle (pickle is not uploaded to the repo due to size constraints on Github).
app directory: Flask application folder. run.py starts the python server for the web app and prepares visualizations.

Usage

Change directory to app directory and run the following commands to set up your database and model.

To run ETL pipeline that cleans data and stores in SQLite database:

python ./process_data.py data/disaster_messages.csv data/disaster_categories.csv data/DisasterResponse.db

To run the ML pipeline that trains the classifier and saves the model:

python ./train_classifier.py data/DisasterResponse.db models/classifier_me.pkl

Run the following command in the app's directory to run your web app: python run.py
Go to http://127.0.0.1:3001 to check out the API.

Results

Below is the performance metric for different categories of messages:

and the model achieved an f1-score of 0.94 on average over all the categories.

Examples

Here are some examples of message classification:

Type in: We have a lot of problems at Delma 75 Avenue Albert Jode, those people need water and food. in the text box and click Classify Message.

More examples:

Note: This project is part of Udacity Data Scientist Nanodegree Program.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
app		app
assets		assets
.gitignore		.gitignore
1_ETL_Pipeline.ipynb		1_ETL_Pipeline.ipynb
2_ML_Pipeline.ipynb		2_ML_Pipeline.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Response Message Classification for Figure Eight Dataset

Project Overview

Preparing the environment

Files Descriptions

Usage

Results

Examples

About

Releases

Packages

Contributors 2

Languages

iamirmasoud/disaster_response_app

Folders and files

Latest commit

History

Repository files navigation

Disaster Response Message Classification for Figure Eight Dataset

Project Overview

Preparing the environment

Files Descriptions

Usage

Results

Examples

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages