Track4:Enhancing Retail Checkout through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking (Retail Counter)

This project includes our solution for Track 4 of the 2023 AI City Challenge, Welcome to the RetailCounter GitHub repository, where we present our sophisticated deep learning-based framework that can recognize, localize, track, and count products as they traverse in front of a camera. The retail industry has been experiencing an upswing in the use of cutting-edge artificial intelligence and computer vision techniques. Among the major challenges in this domain is the development of an automated checkout system that can address the multifaceted issues that arise in real-world checkout scenarios. Our proposed framework, RetailCounter, is founded on a detect-then-track paradigm, and we apply tracking on the bounding box of the detected objects. In addition, we have integrated an automatic identification of the detection region of interest (ROI) and efficient removal of unwanted objects from the ROI. Our framework's performance is competitive, as evidenced by our F1 score of 0.8177 and the fourth-place ranking we achieved in track 4 of the 2023 AI City Challenge.

Link to downlaod our paper download

Environment setup

conda create --name aic23 python=3.9.0
conda activate aic23
conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=11.0 -c pytorch
cd retailcounter
python -m pip install -r requirements.txt
cd ..

Note: for newer devices, such as the NVIDIA RTX 3090, you may use Pytorch 1.10 with cudatoolkit 11.3.

Dataset

Download the track 4 dataset from the 2023 AI City Challenge.

Reproduce results with pre-trained detector

Prepare testing dataset

If you do not want to train detector, You can download pretrain model from here and Dataset can be in AIC23_Track4_TestA.zip (or TestB) is sufficient.

export TRACK_4_DATA_ROOT={/path/to/track_4/root_dir}

For example: export TRACK_4_DATA_ROOT=/mnt/data/AIC22_Track4_TestA/Test_A

Download pre-trained model

cd retailcounter
python download_pretrained_models.py --detector

Alternatively, you may train detector at your own

Removing Outliers

Run: To create Video Inpainting follow the steps from the GitHub repo mentioned below

git clone https://github.com/MCG-NKU/E2FGVI
python inpainting_process.py --video_id $TRACK_4_DATA_ROOT/video_id.txt

video_id.txt file is available in AIC22_Track4_TestA and contain video IDs and video file names (in the same directory)

Automatic ROI Detection

Run:

python detect_ROI.py --video_id $TRACK_4_DATA_ROOT/video_id.txt

Arguments --roi_seed can be set (two values) - it specifies seed position for ROI detection (white tray) in format x y

Detect products and create submission

Run:

python detect_and_create_submission.py --video_id $TRACK_4_DATA_ROOT/video_id.txt

Parameters --tracker and --img_size can be set. The values are pre-set to tracker = BYTE, img_size = 640

Hint

All scripts are set as the result was reported to AI City Challenge and no arguments must be set (only --video_id).

Train object detector for store checkout

Prepare AI City Challenge dataset

Transform data structure - separate data by classes

cd retailcounter
cp split_data.sh Track4/Train_SynData/segmentation_labels/split_data.sh
cd Track4/Train_SynData/segmentation_labels
bash split_data.sh

cd retailcounter
cp split_data.sh Track4/Train_SynData/syn_image_train/split_data.sh
cd Track4/Train_SynData/syn_image_train
bash split_data.sh

Train detector (can take many hours/several days)

Download pretrained-model models without detector

python download_pretrained_models.py

Prepare AI City Challenge dataset as described above
Create dataset

python create_dataset.py --t_4_track {/path/to/AIC_Track4/Train_SynData}

Train detector In order to train the detector you need to follow this GitHub repo of YOLOV8 mentioned below

YOLOV8
git clone https://github.com/ultralytics/ultralytics
YOLOX
python train_detector.py

Arguments --batch_size and --epochs can be set. Explicit values are batch_size = 16, epochs = 75.

Acknowledgements

Instance segmentation: MMdetection
Inpainting: VideoInpainting
Detector: YOLOX, YOLOV8
Trackers: SORT, DEEPSORT

Citation

If you found our work useful, please cite our paper:

@inproceedings{vats-aic23,
   author    = {Arpita Vats and David C. Anastasiu},
   title     = {Enhancing Retail Checkout through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking},
   volume    = {1},
   month     = {July},
   booktitle = {2023 IEEE Conference on Computer Vision and Pattern Recognition Workshops},
   series    = {CVPRW'23},
   year      = {2023},
   pages     = {},
   location  = {Vancouver, Canada}
}

davidanastasiu / retailcounter Goto Github PK

retailcounter's Introduction

Track4:Enhancing Retail Checkout through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking (Retail Counter)

Environment setup

Dataset

Reproduce results with pre-trained detector

Prepare testing dataset

Download pre-trained model

Removing Outliers

Automatic ROI Detection

Detect products and create submission

Hint

Train object detector for store checkout

Prepare AI City Challenge dataset

Transform data structure - separate data by classes

Train detector (can take many hours/several days)

Acknowledgements

Citation

retailcounter's People

Contributors

Stargazers

Watchers

retailcounter's Issues

Recommend Projects

Recommend Topics

Recommend Org