Code Monkey home page Code Monkey logo

Comments (1)

akhti avatar akhti commented on August 31, 2024

For a fixed fold id we split all tasks in train, val, and test. To do model and hyperparameter selection we trained on train and tested on val and aggregated the results over 3 splits to select the best parameters.
The final results are on 10 different folds, i.e., 3 existing + 7 more. We could have trained the best config on the 7 remaining folds. But as dataset is relatively small, we trained 10 models on train+val from scratch using the best config and evaluated on test. Each of the models has not seen its test until the final evaluation.
Note, that one cannot take an ensemble of the best models on train/val splits and apply it to test, because test of one model is train of another model.

The function that build the splits is called get_gold: https://phyre.ai/docs/evaluator.html

create_balanced_eval_set has nothing to do with split tasks in train and test. It takes a preselected set of task ids (e.g., train set and validation set) and set of actions, and builds a subset of Cartesian product X such that it contains equal number of positive example (actions that solve the tasks) and negative examples. Using a balanced subset to evaluate model helps to have more meaningful estimate of progresses as most of (task, action) pairs are negative.

from phyre.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.