Code Monkey home page Code Monkey logo

Comments (3)

DavideA avatar DavideA commented on June 3, 2024

Hi @Amakri1020 and thank you for interest.

Yes, in order to retrain the network, you should unroll all the .avi sequences (video_garmin.avi and video_saliency.avi) into frames. The code assumes the following structure:

<data_folder>/sequence:02d/frames/<frame_index:06d>.jpg --> for input frames
<data_folder>/sequence:02d/saliency_fix/<frame_index:06d>.png --> for fixation maps

As mentioned in this issue, you should comment out lines of code trying to load annotations from the saliency subfolder (saliency_fix is the one).

Let me know if this helps,
Best,
D

from dreyeve.

Amakri1020 avatar Amakri1020 commented on June 3, 2024

Thanks for the quick response, that is helpful!

I am also curious if there were instances where the driver looked at something out of the FoV of the camera and if so, how did you deal with these cases?

from dreyeve.

DavideA avatar DavideA commented on June 3, 2024

That's an interesting question :)

It is likely that during the recording a driver took quick peeks outside the FoV (e.g, looking at side mirrors).
Anyway, the effect of rapid shifts in attention is ameliorated by the fixation map construction procedure. Indeed, as mentioned in the journal paper, such a procedure involves a temporal aggregation of fixation points to build a single fixation maps.

Short answer: we don't deal with such cases.
I don't think these situations are encoded in fixation maps in the first place. You could still get them by looking at the ETG videos and the raw fixation recordings. But I'm not sure.

D

from dreyeve.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.