ejcgt / attention-target-detection Goto Github PK

View Code? Open in Web Editor NEW

158.0 17.0 48.0 111.53 MB

[CVPR2020] "Detecting Attended Visual Targets in Video"

License: MIT License

Python 99.71% Shell 0.29%

cvpr2020 attention gaze-estimation pytorch gaze cvpr dataset

attention-target-detection's Introduction

CVPR 2020 - Detecting Attended Visual Targets in Video

Overview

This repo provides PyTorch implementation of our paper: 'Detecting Attended Visual Targets in Video' [paper]

We present a state-of-the-art method for predicting attention targets from third-person point of view. The model takes head bounding box of a person of interest, and outputs an attention heatmap of that person.

We release our new dataset, training/evaluation code, a demo code, and pre-trained models for the two main experiments reported in our paper. Pleaser refer to the paper for details.

Getting Started

The code has been verified on Python 3.5 and PyTorch 0.4. We provide a conda environment.yml file which you can use to re-create the environment we used. Instructions on how to create an environment from an environment.yml file can be found here.

Download our model weights using:

sh download_models.sh

Quick Demo

You can try out our demo using the sample data included in this repo by running:

python demo.py

Experiment on the GazeFollow dataset

Dataset

We use the extended GazeFollow annotation prepared by Chong et al. ECCV 2018, which makes an additional annotation to the original GazeFollow dataset regarding whether gaze targets are within or outside the frame. You can download the extended dataset from here (image and label) or here (label only).

Please adjust the dataset path accordingly in config.py.

Evaluation

Run:

python eval_on_gazefollow.py

to get the model's performance on the GazeFollow test set.

Training

Run:

python train_on_gazefollow.py

to train the model. You can expect to see similar learning curves to ours.

Experiment on the VideoAttentionTarget dataset

Dataset

We created a new dataset, VideoAttentionTarget, with fully annotated attention targets in video for this experiment. Dataset details can be found in our paper. Download the VideoAttentionTarget dataset from here.

Please adjust the dataset path accordingly in config.py.

Evaluation

Run:

python eval_on_videoatttarget.py

to get the model's performance on the VideoAttentionTarget test set.

Training

Run:

python train_on_videoatttarget.py

to do the temporal training.

Citation

If you use our dataset and/or code, please cite

@inproceedings{Chong_2020_CVPR,
  title={Detecting Attended Visual Targets in Video},
  author={Chong, Eunji and Wang, Yongxin and Ruiz, Nataniel and Rehg, James M.},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2020}
}

If you only use the extended GazeFollow annotations, please cite

@InProceedings{Chong_2018_ECCV,
author = {Chong, Eunji and Ruiz, Nataniel and Wang, Yongxin and Zhang, Yun and Rozga, Agata and Rehg, James M.},
title = {Connecting Gaze, Scene, and Attention: Generalized Attention Estimation via Joint Modeling of Gaze and Scene Saliency},
booktitle = {The European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}
}

References

We make use of the PyTorch ConvLSTM implementation provided by https://github.com/kamo-naoyuki/pytorch_convolutional_rnn.

Contact

If you have any questions, please email Eunji Chong at [email protected].

attention-target-detection's People

Contributors

Stargazers

Watchers

Forkers

nicole7han vision-lang tonykuo222 michaelliyunhao ghananeel trantorrepository jehoon1ee cervantes-loves-ai zhangjf2018 darylperalta pangoraw jeremydh horanyinora adas-eye huangdengrong hemlathapandey3 allezsyh bjj9 shashimalcse primeshshamilka balinlin elchristog coder103031 jonfreer wzg1058509446 minhkhoi1412 duongntt181298 dtmly synchrony10 seatimx amartyacodes maliktalha370 yongxinw kairi003 shivahanifi fei-chang mostafa-haggag dane98 vivek-golani hanoch666 88aggressive 5l1v3r1 tttkw natchapolshinno chwqiong linreseach

attention-target-detection's Issues

Could you provide the extended GazeFollow annotations only?

Could you provide the extended GazeFollow annotations only? I cannot download the dataset from dropbox because of the internet (6.26G is so big) .

where is demo data

hello ,
show in line 21 in demo.py, run demo need data/ dir, where to download?

parser.add_argument('--image_dir', type=str, help='images', default='data/demo/frames') parser.add_argument('--head', type=str, help='head bounding boxes', default='data/demo/person1.txt')

AttributeError: 'FigureManagerBase' object has no attribute 'window'

When i try to run the demo code i was getting this error in line fig.canvas.manager.window.move(0,0) could please help me

No such file or directory: 'model_demo.pt'

I downloaded the code to my machine and tried to run it from PyCharm (with Conda python after installing torch -on Windows without GPU-) but when running the demo.py I'm getting:

No such file or directory: 'model_demo.pt'

where can I get that file from? Is it possible to run it on windows this way? and without GPU... will it work?

got error while running train_on_videoatttarget.py

python train_on_videoatttarget.py
Loading Data
Constructing model
Loading weights
/home/anaconda3/envs/myenv/lib/python3.5/site-packages/torch/nn/functional.py:52: UserWarning: size_average and reduce args will be deprecated, please use reduction='none' instead.
warnings.warn(warning.format(ret))
Training in progress ...
(myenv) ge@Z370-AORUS-Ultra-Gaming:~/Documents/attention-target-detection-master$

got error while running eval_on_videoatttarget.py

The following error occurred when I was testing with my training weights
lib\site-packages\scipy\misc\pilutil.py:98: RuntimeWarning: invalid value encountered in less if cscale < 0:

Draw the sample image

How do you draw the image in the example? After calculating the indicators, I want to draw a picture of the predicted line of sight

Only one annotation per test image and different evaluation for GazeFollow and VideoAttention dataset

Dear authors,

Thanks for sharing your code and data.

I found that:

Though claimed two annotations are available on each test image, it seems that in the released annotation, we have only one annotation per-image. May I ask where we can download the full annotations on your test set?
In your released code, you use different methods (to compute AUC) for Gazefollow and VideoAttention dataset. For instance, on GazeFollow you use the original annotations (10 points) to compute multi-hot vector. On your own dataset, you put a Gaussian on top of the only one annotation, set all values that greater than 0 to 1 and then use such binary map as the multi-hot vector. But in your paper, you only define AUC once. Could you please confirm whether there are two different versions of AUC used in your paper or not?

Cheers,
Yu

Can you provide a download link for the VideoAttentionTarget dataset?

I can't find the dataset's link on paper.

Official training parameters

Hi!

Could you share the official training parameters for your work?

[VideoAttentionTarget eval]

This issue was solved.

video presentation

How do I run a video presentation on demo

How are the initial weights for training obtained?

Hello,

Thanks for the great work!
From the code files for training on gazefollow and video attention target, I see the models are initialized with initial_weights_for_spatial_training.pt/initial_weights_for_temporal_training.pt. I see on your paper that for training on video attention target, you only trained the layers after the encoder, so I think initial_weights_for_temporal_training.pt are the weights after training on gazefollow, is that correct? But I see the spatial model for training on Gazefollow is also initialized with initial_weights_for_spatial_training.pt. How do you get the initial weights for this? Does it contain weights of the pretrained resnet50 for the scene/head branch?

Thank you very much.

ejcgt / attention-target-detection Goto Github PK

attention-target-detection's Introduction

CVPR 2020 - Detecting Attended Visual Targets in Video

Overview

Getting Started

Quick Demo

Experiment on the GazeFollow dataset

Dataset

Evaluation

Training

Experiment on the VideoAttentionTarget dataset

Dataset

Evaluation

Training

Citation

References

Contact

attention-target-detection's People

Contributors

Stargazers

Watchers

Forkers

attention-target-detection's Issues

Recommend Projects

Recommend Topics

Recommend Org