Code Monkey home page Code Monkey logo

irdm-collab-filtering's People

Contributors

avishkar58 avatar

Watchers

 avatar  avatar  avatar

irdm-collab-filtering's Issues

Split data into train/validate and test

Currently just using the top 10000 lines as a single training set. Need to split all of the data into a train, validation and test set (in appropriate proportions)

Save/restore trained model

Add the ability to save and restore a trained model (so that training and evaluation can be separated)

Rewrite deepcf.py

Currently using a slightly adapted version of Tensorflow PTB example

Grid search over hyperparameters

Determine the hyperparameters that give the best evaluation metrics:

  • Sequence Length
  • Hidden size
  • Number of layers
  • Number of epochs
  • Data splitting methodology

Dependency issue

Hi,
DO you have an updated version for this? I am trying to run on google colab. It's giving lots of dependency error.
Thanks,
Suti

Add batching to data

Currently the batch size is set to one, which results in the model learning very slowly, add batching support - but be careful to consider that some playlists may not be of the same length. See the Spotify blog post for information on how they dealt with this

Run on AWS/Legion

Not sure yet, but I assume we won't be able to run grid searches on our laptops, so we need to get the code running on AWS or Legion/Emerald cluster to make it feasible

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.