Disaster Response Pipeline Project

Description
Files Descriptions
Getting Started
1. Dependencies
2. Installation

Description

Following a disaster, responsible agencies are inundated with a multitude of direct or social media communications at a time when disaster response organizations are least equipped to sift through and prioritize the most crucial messages. It is common for only one message in a thousand to hold relevance for disaster response professionals. In such situations, various organizations typically handle specific aspects of the problem. For instance, one organization focuses on providing clean water, another deals with clearing blocked roads, and yet another ensures the availability of medical supplies.

The project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis. The dataset contains pre-labelled tweet and messages from real-life disaster events.

This project is divided in the following key sections:

Processing data, building an ETL pipeline to extract data from source, clean the data and save them in a SQLite DB
Build a machine learning pipeline to train a model that can classify text message in various categories
Run a web app which can show model results in real time

Files Descriptions

The files structure is arranged as below:

- README.md: read me file
- requirement.txt: dependencies list
- workspace
	- \app
		- run.py: flask file to run the app
		- \templates
			- master.html: main page of the web application 
			- go.html: result web page
	- \data
		- disaster_categories.csv: categories dataset
		- disaster_messages.csv: messages dataset
		- process_data.py: ETL process
	- \models
		- train_classifier.py: ML & NLP pipeline code

Getting Started

Dependencies

Python 3.6+
Machine Learning Libraries: NumPy, SciPy, Pandas, Sciki-Learn
Natural Language Process Libraries: NLTK
SQLlite Database Libraqries: SQLalchemy
Web App and Data Visualization: Flask, Plotly

Installation

Clone the git repository:

git clone https://github.com/eljandoubi/DisasterResponsePipeline.git

Change directory

cd DisasterResponsePipeline

Create conda environment

conda create -n "DisasterResponsePipeline" python=3.6

Install dependencies

pip install -r requirements.txt

You can run the following commands in the project's directory to set up the database, train model and save the model.
- To run ETL pipeline to clean data and store the processed data in the database
  
  python data/process_data.py data/disaster_messages.csv data/disaster_categories.csv data/disaster_response_db.db
- To run the ML pipeline that loads data from DB, trains classifier and saves the classifier as a pickle file
  
  python models/train_classifier.py data/disaster_response_db.db models/classifier.pkl
Run the following command in the app's directory to run your web app. python app/run.py

eljandoubi / disasterresponsepipeline Goto Github PK

disasterresponsepipeline's Introduction

Disaster Response Pipeline Project

Table of Contents

Description

Files Descriptions

Getting Started

Dependencies

Installation

disasterresponsepipeline's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent