Code Monkey home page Code Monkey logo

cswintt's Introduction

CSWinTT

The official implementation of the CVPR 2022 paper Transformer Tracking with Cyclic Shifting Window Attention

[Models and Raw results] (Google Driver) or [Models and Raw results] (Baidu Driver: bsa2).

CSWinTT_Framework

Highlights

Introduction

CSWinTT is a new transformer architecture with multi-scale cyclic shifting window attention for visual object tracking, elevating the attention from pixel to window level. The cross-window multi-scale attention has the advantage of aggregating attention at different scales and generates the best fine-scale match for the target object.

Performance

Tracker UAV123 (AUC) LaSOT (NP) TrackingNet (NP) GOT-10K (AO)
CSWinTT 70.5 75.2 86.7 69.4

Install the environment

conda create -n cswintt python=3.7
conda activate cswintt
bash install.sh

Data Preparation

Put the tracking datasets in ./data. It should look like:

${CSWinTT_ROOT}
 -- data
     -- lasot
         |-- airplane
         |-- basketball
         |-- bear
         ...
     -- got10k
         |-- test
         |-- train
         |-- val
     -- trackingnet
         |-- TRAIN_0
         |-- TRAIN_1
         ...
         |-- TRAIN_11
         |-- TEST

Run the following command to set paths for this project

python tracking/create_default_local_file.py --workspace_dir . --data_dir ./data --save_dir .

After running this command, you can also modify paths by editing these two files

lib/train/admin/local.py  # paths about training
lib/test/evaluation/local.py  # paths about testing

Train CSWinTT

python tracking/train.py --script cswintt --config baseline_cs --save_dir . --mode single 
python tracking/train.py --script cswintt_cls --config baseline_cs --save_dir . --mode single --script_prv cswintt --config_prv baseline_cs  

Test CSWinTT

Download the model and put it in output/checkpoints

  • UAV123
python tracking/test.py cswintt baseline_cs --dataset uav --threads 32
  • LaSOT
python tracking/test.py cswintt baseline_cs --dataset lasot --threads 32
  • GOT10K-test
python tracking/test.py cswintt baseline_got10k_only --dataset got10k_test --threads 32
  • TrackingNet
python tracking/test.py cswintt baseline_cs --dataset trackingnet --threads 32

Model Zoo and raw results

The trained models and the raw tracking results are provided in the [Models and Raw results] (Google Driver) or [Models and Raw results] (Baidu Driver: bsa2).

Contact

Zikai Song: [email protected]

Acknowledgments

  • Thanks for the PyTracking Library and STARK Library, which helps us to quickly implement our ideas.

cswintt's People

Contributors

skyesong38 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.