Code Monkey home page Code Monkey logo

metr's Introduction

METR

This is the official implementation of the paper "Language-aware Multiple Datasets Detection Pretraining for DETRs".

Authors: Jing Hao*, Song Chen*

Installation

We use the environment same to DINO to run METR. If you have run DN-DETR or DAB-DETR or DINO, you can skip this step. We test our models under python=3.7.3,pytorch=1.9.0,cuda=11.1. Other versions might be available as well.

  1. Clone this repo
git clone https://github.com/isbrycee/METR.git
cd METR
  1. Install Pytorch and torchvision

Follow the instruction on https://pytorch.org/get-started/locally/.

# an example:
conda install -c pytorch pytorch torchvision
  1. Install other needed packages
pip install -r requirements.txt
  1. Compiling CUDA operators
cd models/metr/ops
python setup.py build install
# unit test (should see all checking is True)
python test.py
cd ../../..

Data

Please download COCO 2017 dataset and organize them as following:

COCODIR/
  ├── train2017/
  ├── val2017/
  └── annotations/
  	├── instances_train2017.json
  	└── instances_val2017.json

Run

We use METR 4-scale model trained for 12 epochs as default experiment setting.

Training

bash scripts_METR/METR_train_dist_4scale_r50_coco.sh

Evaluation

bash scripts_METR/METR_eval_dist_4scale_r50_coco.sh

Notes:

  1. You should change the dataset path on scripts before running.
  2. This code implementation also supports for ViT backbone.

Links

Our model is based on DINO.

Bibtex

If you find our work helpful for your research, please consider citing the following BibTeX entry.

@article{hao2023language,
  title={Language-aware Multiple Datasets Detection Pretraining for DETRs},
  author={Hao, Jing and Chen, Song and Wang, Xiaodi and Han, Shumin},
  journal={arXiv preprint arXiv:2304.03580},
  year={2023}
}
@misc{zhang2022dino,
      title={DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection}, 
      author={Hao Zhang and Feng Li and Shilong Liu and Lei Zhang and Hang Su and Jun Zhu and Lionel M. Ni and Heung-Yeung Shum},
      year={2022},
      eprint={2203.03605},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

metr's People

Contributors

isbrycee avatar

Stargazers

Yangtao avatar Ellery Queen avatar  avatar  avatar CHENS avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.