Code Monkey home page Code Monkey logo

dgnet's Introduction

Deep Gradient Learning for Efficient Camouflaged Object Detection

Authors: Ge-Peng Ji, Deng-Ping Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, & Luc Van Gool.

This repository contains the source code, prediction results, and evaluation toolbox of our Deep Gradient Network, also called DGNet. The technical report could be found at arXiv.

1. Features


Figure 1: We present the scatter relationship between the performance weighted F-measure and parameters of all competitors on CAMO-Test. These scatters are in various colors for better visual recognition and are also corresponding to the histogram (Right). The larger size of the coloured scatter point, the heavier the model parameter. (Right) We also report the parallel histogram comparison of model's parameters, MACs, and performance.

  • Novel supervision. We propose to excavate the texture information via learning the object level gradient rather than using boundary-supervised or uncertainty-aware modeling.

  • Simple but efficient. We decouple all the heavy designs as much as we can, yielding a simple but efficient framework. We hope this framework could be served as a baseline learning paradigm for the COD field.

  • Best trade-off. Our vision is to achieve new SOTA with the best performance-efficiency trade-off on existing cutting-edge COD benchmarks.

2. ๐Ÿ”ฅ NEWS ๐Ÿ”ฅ

  • [2022/05/30] ๐Ÿ”ฅ We release the implementation of DGNet with different AI frameworks: Pytorch-based and Jittor-based.
  • [2022/05/30] Thank @Katsuya Hyodo for adding our model into PINTO. This is a repository for storing models that have been inter-converted between various frameworks (e.g., TensorFlow, PyTorch, ONNX).
  • [2022/05/25] Releasing the codebase of DGNet (Pytorch) and whole COD benchmarking results (20 models).
  • [2022/05/23] Creating repository.

This project is still work in progress, and we invite all to contribute in making it more acessible and useful. If you have any questions about our paper, feel free to contact me via e-mail ([email protected] & [email protected] & [email protected]). And if you are using our code and evaluation toolbox for your research, please cite this paper (BibTeX).

3. Proposed Framework

3.1. Overview


Figure 2: Overall pipeline of the proposed DGNet, It consists of two connected learning branches, i.e., context encoder and texture encoder. Then, we introduce a gradient-induced transition (GIT) to collaboratively aggregate the feature that is derived from the above two encoders. Finally, a neighbor connected decoder (NCD [1]) is adopted to generate the prediction.


Figure 3: Illustration of the proposed gradient-induced transition (GIT). It use a soft grouping strategy to provide parallel nonlinear projections at multiple fine-grained sub-spaces, which enables the network to probe multi-source representations jointly.

References of neighbor connected decoder (NCD) benchmark works [1] Concealed Object Detection. TPAMI, 2022.

3.2. Usage

The training and testing experiments are conducted using PyTorch and Jittor libraries with a single GeForce RTX TITAN GPU. Note that we only report the results of the Pytorch-based DGNet in our manuscript.

  • For the Pytorch usage, please refer to our pytorch_lib.

  • For the Jittor usage, please refer to our jittor_lib.

3.3 Evaluation

One-key evaluation is written in MATLAB code ./eval/matlab/, please follow this the instructions in ./eval/matlab/main.m and just run it to generate the evaluation results in ./eval-result/.

3.4 COD Benchmark Results:

The prediction of our DGNet and DGNet-S can be found in Pytorch / Jitror. The whole benchmark results can be found at OneDrive. Here are quantitative performance comparison from three perspectives.


Figure 4: Quantitative results in terms of full metrics for cutting-edge competitors, including 8 SOD-related and 12 COD-related, on three test datasets: NC4K-Test, CAMO-Test, and COD10K-Test. @R means the ranking of the current metric, and Mean@R indicates the mean ranking of all metrics.


Figure 5: Super-classes (i.e., Amphibian, Aquatic, Flying, Terrestrial, and Other) on the COD10K-Test of the proposed methods (DGNet & DGNet-S) and other 20 competitors. Symbol \uparrow indicates the higher the score, the better, and symbol \downarrow indicates the lower, the better. The best score is marked with bold.


Figure 6: Sub-class results on COD10K-Test of 12 COD-related and 8 SOD-related baselines in terms of structure measure (\mathcal{S}_\alpha), where Am., Aq., Fl., Te., and Ot. represent Amphibian, Aquatic, Flying, Terrestrial, and Other, respectively. CDL., GP.Fish, and LS.Dragon denote Crocodile, and GhostPipeFish, LeafySeaDragon, respectively. The best score is marked with bold.

4. Citation

Please cite our paper if you find the work useful:

@article{ji2022gradient,
  title={Deep Gradient Learning for Efficient Camouflaged Object Detection},
  author={Ji, Ge-Peng and Fan, Deng-Ping and Chou, Yu-Cheng and Dai, Dengxin and Liniger, Alexander and Van Gool, Luc},
  journal={arXiv},
  year={2022}
} 

dgnet's People

Contributors

johnson111788 avatar gewelsji avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.