Code Monkey home page Code Monkey logo

sohuemily / multi-camera-vehicle-tracking-and-reidentification Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alexxiao95/multi-camera-vehicle-tracking-and-reidentification

0.0 1.0 0.0 33.16 MB

This repository contains our source code of Track 3 in the NVIDIA AI City Challenge Workshop at CVPR 2018. Please find the full source code of both Track 1 and Track 3 here ->

Home Page: https://github.com/AlexXiao95/2018AICity_TeamUW

CMake 0.62% Makefile 0.38% Shell 0.61% HTML 0.03% CSS 0.17% Python 51.20% C++ 42.38% MATLAB 2.45% M 0.01% Cuda 2.15%

multi-camera-vehicle-tracking-and-reidentification's Introduction

Multi-Camera Vehicle Tracking and Re-identification

This repository contains our source code of Track 3 in the NVIDIA AI City Challenge Workshop at CVPR 2018.

[Full source code], [Slides], [Paper], [Poster], [Project Page], [2018 NVIDIA AI City Challenge]

How It Works

We achieved Multi-Camera Vehicle Tracking and Re-identification based on a fusion of histogram-based adaptive appearance models, DCNN features, detected license plates, detected car types and traveling time information.

Getting Started

Prerequisites

The code has been tested on Ubuntu 16.04.

Dataset

The track 3 dataset contains 15 videos from 4 different locations, each around 0.5-1.5 hours long, recorded at 30 fps and 1080p resolution (1920ร—1080). The task is to identify all vehicles that pass through each recorded location at least once in the given set of videos. The camera locations and linked areas are shown below:

The UA-DETRAC dataset includes 10 hours of videos captured with a Canon EOS 550D camera at 24 different locations at Beijing and Tianjin in China. The videos are recorded at 25 frames per seconds (fps), with resolution of 960ร—540 pixels. There are more than 140 thousand frames in the UA-DETRAC dataset and 8250 vehicles that are manually annotated, leading to a total of 1.21 million labeled bounding boxes of objects. The primary aim of this dataset is to train vehicle detection algorithms.

The Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature data contains 163 car makes with 1,716 car models. The dataset is well prepared for the following computer vision tasks:
  • Fine-grained classification
  • Attribute prediction
  • Car model verification

You can find a pre-trained model here. Please refer the dataset website and follow the author's instruction to download the dataset.

The BoxCars dataset contains 116k of images of vehicles with fine-grained labels taken from surveillance cameras under various viewpoints.

Input/Output Format

Simply run bash src/run_all.sh in the command line. The input is single camera tracking results for all 15 videos by using our method in Track 1. The format of each line is as follows:

<video_id> <frame_id> <obj_id> <xmin> <ymin> <xmax> <ymax> <speed> <confidence>

  • <video_id> is the video numeric identifier, starting with 1. It represents the position of the video in the list of all track videos, sorted in alphanumeric order.
  • <frame_id> represents the frame count for the current frame in the current video, starting with 1.
  • <obj_id> is a numeric identifier. It is integer. It can be ignored for Track 1 (set to -1).
  • The axis-aligned rectangular bounding box of the detected video will be denoted by its pixel-valued coordinates within the image canvas, <xmin> <ymin> <xmax> <ymax>, computed from the top-left corner of the image (similar to the VOC2012 challenge format). All coordinates are integers.
  • <speed> denotes the instantaneous speed of the vehicle in the given frame, measured in miles per hour (mi/h), which is a non-negative real value.
  • <confidence> denotes the confidence of the prediction. Should be between 0 and 1.

The output is all possible candidates which will be used for license plate comparison. The format of each line is as follows:

<img_path> <similarity>

  • <img_path> is the path of probe vehicle and gallery vehicle. The first line is the probe image and the follwing is gallery images which are in a descending order in terms of similarity.
  • <similarity> is the similarity between probe vehicle and gallery vehicle based on a fusion of histogram-based adaptive appearance models, DCNN features, detected car types and traveling time information.

Demo [video]

Reference

Please cite these papers in your publications if it helps your research:

@inproceedings{tang2018vehicle,
  author = {Zheng Tang and Gaoang Wang and Hao Xiao and Aotian Zheng and Jenq-Neng Hwang},
  booktitle = {CVPR Workshop (CVPRW) on the AI City Challenge},
  title = {Single-camera and Inter-camera Vehicle Tracking and 3D Speed Estimation Based on Fusion of Visual and Semantic Features},
  year = {2018},
  pages = {108--115}
}

multi-camera-vehicle-tracking-and-reidentification's People

Contributors

alexxiao95 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.