Code Monkey home page Code Monkey logo

dual-network-wsod's Introduction

By Xuanyi Dong, Deyu Meng, Fan Ma, Yi Yang. This paper is accepted by ACM Multimedia 2017.

Introduction

Dual-Network is a weakly supervised object detection framework leveraging deep CNN models.

This project is modified on the Matlab code of R-FCN and Fast R-CNN.

License

Dual-Network is released under the MIT License (refer to the LICENSE file for details).

Resources & Preparation

  1. ImageNet-pretrained networks: Google Drive. Please save the models into the corresponding sub-directory of models/pre_trained_models.
  2. The initial pseudo labels for PASCAL VOC 2007 by ContextLocNet : Google Drive. Please save and extract it into data.
  3. The pre-computed region proposals: Google Drive. Please save and extract it into data.
  4. Download the PASCAL VOC 2007 data into datasets, following the README in datasets.
  5. Compile Caffe located in external/caffe.
  6. Run dual_build.m to complie the nms mex functions.
  7. Run startup.m to add necessary paths.

Training & Testing

  • [TODO] re-organize the experiment codes.

Citing Dual-Network

If you find Dual-Network useful in your research, please consider citing:

@inproceedings{dong2017dual,
    title={A Dual-Network Progressive Approach to Weakly Supervised Object Detection},
    author={Dong, Xuanyi and Meng, Deyu and Ma, Fan and Yang, Yi},
    booktitle={Proceedings of the 2017 ACM on Multimedia Conference},
    pages={279--287},
    year={2017},
    organization={ACM}
}
@inproceedings{kantorov2016,
    title = {ContextLocNet: Context-aware Deep Network Models for Weakly Supervised Localization},
    author = {Kantorov, V., Oquab, M., Cho M. and Laptev, I.},
    booktitle = {Proc. European Conference on Computer Vision (ECCV), 2016},
    year = {2016}
}
@article{dai16rfcn,
    Author = {Jifeng Dai, Yi Li, Kaiming He, Jian Sun},
    Title = {{R-FCN}: Object Detection via Region-based Fully Convolutional Networks},
    Journal = {arXiv preprint arXiv:1605.06409},
    Year = {2016}
}
@inproceedings{girshick2015fast,
    title={Fast R-CNN},
    author={Girshick, Ross},
    booktitle={Proceedings of the IEEE international conference on computer vision},
    pages={1440--1448},
    year={2015}
}

dual-network-wsod's People

Contributors

d-x-y avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.