Code Monkey home page Code Monkey logo

sinet-v2's Introduction

Concealed Object Detection (SINet-V2)

PyTorch implementation of our Search and Identification Network (SINet-V2).

Authors: Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, Ling Shao.

1. Preface

  • Introduction. This repository contains the source code, prediction results, and evaluation toolbox of our Search and Identification Network (SINet-V2) , which are the journal extension version of our paper SINet (github/ pdf) published at CVPR-2020.

  • Highlights. Compared to our conference version, we achieve new SOTA in the field of COD via the two well-elaborated sub-modules, including neighbor connection decoder (NCD) and group-reversal attention (GRA). Please refer to our paper for more details.

If you have any questions about our paper, feel free to contact me via e-mail ([email protected]). And if you are using our our and evaluation toolbox for your research, please cite this paper (BibTeX).

2. ๐Ÿ”ฅ NEWS ๐Ÿ”ฅ

  • [2021/02/21] Upload the whole project.
  • [2021/01/16] Create repository.

3. Overview


Figure 1: Task relationship. One of the most popular directions in computer vision is generic object detection. Note that generic objects can be either salient or camouflaged; camouflaged objects can be seen as difficult cases of generic objects. Typical generic object detection tasks include semantic segmentation and panoptic segmentation (see Fig. 2 b).


Figure 2: Given an input image (a), we present the ground-truth for (b) panoptic segmentation (which detects generic objects including stuff and things), (c) salient instance/object detection (which detects objects that grasp human attention), and (d) the proposed camouflaged object detection task, where the goal is to detect objects that have a similar pattern (e.g., edge, texture, or color) to the natural habitat. In this case, the boundaries of the two butterflies are blended with the bananas, making them difficult to identify. This task is far more challenging than the traditional salient object detection or generic object detection.

References of Salient Object Detection (SOD) benchmark works
[1] Video SOD: Shifting More Attention to Video Salient Object Detection. CVPR, 2019. (Project Page)
[2] RGB SOD: Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground. ECCV, 2018. (Project Page)
[3] RGB-D SOD: Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks. TNNLS, 2020. (Project Page)
[4] Co-SOD: Taking a Deeper Look at the Co-salient Object Detection. CVPR, 2020. (Project Page)

4. Proposed Framework

4.1. Training/Testing

The training and testing experiments are conducted using PyTorch with a single GeForce RTX TITAN GPU of 24 GB Memory.

Note that our model also supports low memory GPU, which means you should lower the batch size.

  1. Prerequisites:

    Note that SINet-V2 is only tested on Ubuntu OS with the following environments. It may work on other operating systems (i.e., Windows) as well but we do not guarantee that it will.

    • Creating a virtual environment in terminal: conda create -n SINet python=3.6.

    • Installing necessary packages: PyTorch > 1.1, opencv-python

  2. Prepare the data:

    • downloading testing dataset and move it into ./Dataset/TestDataset/, which can be found in this [download link (Coming soon ...)].

    • downloading training/validation dataset and move it into ./Dataset/TrainValDataset/, which can be found in this [download link (Coming soon ...)].

    • downloading pretrained weights and move it into ./snapshot/SINet_V2/Net_epoch_best.pth, which can be found in this download link (Google Drive).

    • downloading Res2Net weights on ImageNet dataset download link (Google Drive).

  3. Training Configuration:

    • Assigning your costumed path, like --train_save and --train_path in MyTrain_Val.py.

    • Just enjoy it via run python MyTrain_Val.py in your terminal.

  4. Testing Configuration:

    • After you download all the pre-trained model and testing dataset, just run MyTesting.py to generate the final prediction map: replace your trained model directory (--pth_path).

    • Just enjoy it!

3.2 Evaluating your trained model:

One-key evaluation is written in MATLAB code (link), please follow this the instructions in ./eval/main.m and just run it to generate the evaluation results in ./res/. The complete evaluation toolbox (including data, map, eval code, and res): link.

3.3 Pre-computed maps:

They can be found in download link.

4. Citation

Please cite our paper if you find the work useful:

@article{fan2021cancealed,
title={Concealed Object Detection},
author={Fan, Deng-Ping and Ji, Ge-Peng and Cheng, Ming-Ming and Shao, Ling},
journal={arXiv},
year={2021}
}

6. FAQ

  1. If the image cannot be loaded in the page (mostly in the domestic network situations).

    Solution Link


โฌ† back to top

sinet-v2's People

Contributors

gewelsji avatar dengpingfan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.