Code Monkey home page Code Monkey logo

hazard-view-bird's Introduction

Hazard View Bird - On-device Disaster Scene Parsing

This project focuses on the development of an efficient model for semantic segmentation on edge devices, specifically targeting the analysis of disaster scenes from images captured by unmanned aerial vehicles (UAVs). The key challenge is to optimize the model for accuracy, speed, and power consumption, given the computational constraints of edge computing environments. Our goal is to enable rapid and accurate understanding of disaster-stricken areas through advanced computer vision techniques, facilitating quicker response times for relief operations.

Table of Contents

Dependencies

The project requires Python 3.6.9 and the following libraries:

  • PyTorch 1.11.1
  • OpenCV 4.7.0.72
  • NumPy 1.19.5
  • Pillow 8.4.0

To install the dependencies, use the provided requirements.txt file:

python3.6 -m pip install --upgrade pip
python3.6 -m pip install -r requirements.txt

Dataset

The dataset comprises 1,700 images collected by UAVs in disaster-affected areas. The images are segmented into 14 classes representing various elements and conditions found in disaster scenes. Place the dataset in the dataset/ directory following this structure:

├── dataset
│   ├── train
│   │    ├── IMG
│   │    │    └──train
│   │    └── GT
│   │         └──train
│   └── val
│       ├── IMG
│       │    └──val
│       └── GT
│       │    └──val

Results

Our evaluation focused on various architectures and backbones, emphasizing their performance in terms of the Dice Coefficient—a critical measure for segmentation tasks. Below is a summary of our findings:

Architecture Backbone Dice Coefficient
FPN EfficientNetB0 0.56
FPN MobileNetV2 0.52
UNET MobileNetV2 0.508
UNET EfficientNetB0 0.47
PSPNET MobileNetV2 0.47

Metrics

The models were evaluated based on two main metrics:

  • Accuracy: Measured by the Dice Coefficient across all 14 categories. The Dice Coefficient is a statistical tool that measures the similarity between two samples. This metric is particularly useful in segmentation tasks to quantify the model's performance in predicting the overlap between the predicted segmentation and the ground truth.

  • Speed: Assessed as the average runtime for processing one frame, expressed in seconds per frame (s/f). This metric is crucial for applications requiring real-time or near-real-time processing capabilities, especially in disaster response scenarios where timely information can significantly impact decision-making processes.

Classification Labels

The dataset includes images classified into 14 distinct categories, representing various elements commonly observed in disaster-stricken areas:

Background

  1. Avalanche
  2. Building Undamaged
  3. Building Damaged
  4. Cracks/Fissure/Subsidence
  5. Debris/Mud/Rock Flow
  6. Fire/Flare
  7. Flood/Water/River/Sea
  8. Ice Jam Flow
  9. Lava Flow
  10. Person
  11. Pyroclastic Flow
  12. Road/Railway/Bridge
  13. Vehicle

hazard-view-bird's People

Contributors

bobergot avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

varadhkaushik

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.