Code Monkey home page Code Monkey logo

vds2raw's Introduction

VFNet with ResNet-18 for Vessel Detection in S-2 Raw Imagery

This project implements a vessel detection system using VFNet with ResNet-18 as the backbone network. The framework is based on MMDet, a powerful open-source object detection toolbox. The goal of this project is to detect vessels in S-2 imagery and provide accurate bounding box predictions.

Table of Contents

  1. Introduction
  2. Installation
  3. Dataset
  4. Training
  5. Inference
  6. Results
  7. Customization
  8. Contributing
  9. License

Introduction

The goal of this project is to detect vessels in S-2 Raw imagery and provide accurate bounding box predictions. The project implements a light vessel detection system using VFNet with ResNet-18 as the backbone network. The framework is based on MMDet, a powerful open-source object detection toolbox.

VFNet with ResNet-18 architecture

The VFNet with ResNet-18 architecture combines the advantages of VFNet (an effective anchor-free object detection network) and ResNet-18 (a widely-used convolutional neural network architecture). This combination allows for accurate vessel detection in S-2 imagery, leveraging the strong feature representation capabilities of ResNet-18 and the robustness of VFNet.

The MMDet framework provides a comprehensive set of tools for training and evaluating object detection models. It supports various state-of-the-art algorithms and backbone networks, making it suitable for a wide range of computer vision tasks.

Installation

To set up the project and run the code, please follow these steps:

  1. Clone the repository:
+ git clone https://github.com/sirbastiano/VDS2Raw
# move to the dir:
+ cd VDS2Raw
  1. Set up the Python environment. It is recommended to use a virtual environment to avoid conflicts with other Python packages. The bash script provided install the environment and also the required dependencies:
+ source mmdet_install.sh # (Tested on linux and macos)

Note:
Make sure you have the necessary CUDA and CUDNN libraries installed if you plan to train the model on a GPU.

Data

To train and evaluate the vessel detection model, you will need a dataset of vessels in S-2 imagery. Ensure that your dataset is in proper format:

Place your data in the "input_data" directory.

Inference

To perform inference and detect vessels in new S-2 imagery, follow these steps:

  1. Download weights (https://drive.google.com/file/d/1EUTRbgCrp4X7Ii9p7OiLscQ5u5aMKFZi/view?usp=share_link) and place them in the "checkpoint" directory.

  2. Configure the inference settings in the project's configuration file. Specify the path to the trained model checkpoint and any other necessary parameters.

  3. Run the inference script:

+ python inference.py --img_path /path/to/image --threshold 0.5 --device cpu

The script will load the trained model, process the input imagery, and generate bounding box predictions for the vessels present in the images in the output_results folder.

  1. Evaluate the results and analyze the model's performance on the new imagery.

Results

After training and evaluating the model, you can analyze the results to assess the model's accuracy and performance. Use evaluation metrics such as precision, recall, and mean average precision (mAP) to measure the model.

VFNet with ResNet-18 architecture

vds2raw's People

Contributors

sirbastiano avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

jzvolensky

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.