Emergency Situation Awareness

Dataset

Architecture

Classification for Impact Assessment

we built statistical classifiers that automatically identify tweets containing information about the infrastructure status, where the infrastructure includes assets such as roads, bridges, railways, hospitals, airports, commercial and residential buildings, water, electricity, gas, and sewerage supplies.

We experimented with two machine learning methods for tweet classification, naive Bayes and support vector machines (SVM). To extract useful features, we preprocessed the dataset by removing a list of stop words and tokenizing the tweets. We then constructed lexical features and Twitter-specific features for classification. These features include

word unigrams;
word bigrams;
word length;
the number of hashtags “#” contained in a tweet;
the number of user mentions, “@username”;
whether a tweet is retweeted;
whether a tweet is replied to by other users.

After feature extraction, we performed experiments using a 10-fold cross-validation over our training data.

Results

Keras

540k tweets: Crisis nlp pre-trained

Dev set: 0.98

Number of crisis tweets: 32000
Number of non-crisis tweets: 32000

Classification report : 
               precision    recall  f1-score   support

      normal       0.93      0.98      0.95     32000
      crisis       0.98      0.92      0.95     32000

    accuracy                           0.95     64000
   macro avg       0.95      0.95      0.95     64000
weighted avg       0.95      0.95      0.95     64000

1759077 tweets: Crisis nlp pre-trained

Dev set: 0.97

Number of crisis tweets: 32000
Number of non-crisis tweets: 32000

Classification report : 
               precision    recall  f1-score   support

      normal       0.92      0.97      0.95     32000
      crisis       0.97      0.92      0.94     32000

    accuracy                           0.94     64000
   macro avg       0.95      0.94      0.94     64000
weighted avg       0.95      0.94      0.94     64000

Bayes

Number of crisis tweets: 270000
Number of non-crisis tweets: 270000

Cross Validation
Accuracy: 0.97
Precision: 0.98
Recall: 0.96
F1 score: 0.97

Evaluate Bayes

Number of crisis tweets: 32000
Number of non-crisis tweets: 32000

Accuracy Score: 0.95

Classification report : 
               precision    recall  f1-score   support

      normal       0.93      0.98      0.95     32000
      crisis       0.97      0.93      0.95     32000

    accuracy                           0.95     64000
   macro avg       0.95      0.95      0.95     64000
weighted avg       0.95      0.95      0.95     64000

SVM

Number of crisis tweets: 270000
Number of non-crisis tweets: 270000

Cross Validation
Accuracy: 0.99
Precision: 1.00
Recall: 0.99
F1 score: 0.99

Evaluate SVM

Number of crisis tweets: 32000
Number of non-crisis tweets: 32000

Accuracy Score: 0.84

Classification report : 
               precision    recall  f1-score   support

      normal       0.76      1.00      0.87     32000
      crisis       1.00      0.69      0.82     32000

    accuracy                           0.85     64000
   macro avg       0.88      0.85      0.84     64000
weighted avg       0.88      0.85      0.84     64000

Number of crisis tweets: 45000
Number of non-crisis tweets: 45000
max_features: 1000

Accuracy Score: 83.409375
Classification report : 
               precision    recall  f1-score   support

      normal       0.75      1.00      0.86     32000
      crisis       1.00      0.67      0.80     32000

    accuracy                           0.83     64000
   macro avg       0.87      0.83      0.83     64000
weighted avg       0.87      0.83      0.83     64000

Loading tweets... Number of crisis tweets: 90000 Number of non-crisis tweets: 90000 Clearing tweets... Training SVM... (180000, 1000) Cross Validation... Accuracy: 0.99 Precision: 1.00 Recall: 0.99 F1 score: 0.99 Execution Time: 00:00:07.07

Plotting learning curve... Done.

Fitting... Evaluate SVM... Number of crisis tweets: 32000 Number of non-crisis tweets: 32000 Evaluating SVM ... Accuracy Score: 85.6328125 Classification report : precision recall f1-score support

  normal       0.78      1.00      0.87     32000
  crisis       1.00      0.71      0.83     32000

accuracy                           0.86     64000

macro avg 0.89 0.86 0.85 64000 weighted avg 0.89 0.86 0.85 64000

riccorl / emergency-situation-awareness Goto Github PK