Code Monkey home page Code Monkey logo

Comments (2)

GilLevi avatar GilLevi commented on May 29, 2024

Hi Raul,

Thank you for your interest in our work.

I can offer one intuition why this might happen: at the core, the problem is that distribution of the test data is different from the distribution of the train and the validation data. This happens since the distribution of each fold is different than the other folds (which is also confirmed by your test results).

I believe that this is because each folder contains different subjects (which introduces a lot of variation between the distribution of the folds, more than the variation of having one subject with different images of which are distributed across the folds). Let's take for example an "imagenet-like" problem of classifying dog breeds. If you have a lot of data samples you can easily split it into folds that have roughly the same data distribution by making sure that each image in one fold has a similar image at the other folds. In our case, do prevent overfitting we do not allow that a subject will appear in more than one fold, so by definition (and since the data is small) it's less likely that for an image in one fold you will find a corresponding image in another fold.

This might be a bit annoying when tuning the model, but it allows for better generalization.

from agegenderdeeplearning.

rauldiaz avatar rauldiaz commented on May 29, 2024

Hi @GilLevi ,

Thanks for explaining!

from agegenderdeeplearning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.