Code Monkey home page Code Monkey logo

Comments (4)

valentas1 avatar valentas1 commented on June 17, 2024

I worked on this in recent days. I believe that the current implementation is most suitable for this case (See Issue #16 Reduce memory of step 05 and the commit in the comments).
In that implementation, files are saved iteratively for every latent space and every repetition (it could be reduced even into smaller files if needed). Also, it loads the files with np.memmap that doesn't read the entire file into memory and instead creates a memory map to files on disk. It worked without errors on my Laptop.
However, when I tried the former version that appends everything to one big file (here without saving the files and instead saving the parameters of trained models), my PC was already giving memory errors during the processing (before reaching saving the files).
I am aware that the current implementation saves files with huge sizes and it is the most significant disadvantage of this approach. Therefore, as Ricardo said I should print the notification during the training and add this information to the description in README. However, I believe the current implementation is more scalable to larger datasets which would require even more CPU for processing these large data frames.
Let me know what you think about it.

from move.

simonrasmu avatar simonrasmu commented on June 17, 2024

Ok!

from move.

enryH avatar enryH commented on June 17, 2024

I know this opens a new front, but I want to put it up for discussion: Maybe a structured way of saving data could be a good idea: I attended a tutorial on SQLAlchemy (previous recording) yesterday evening or we have a look at what others (alphapept) do with HDF5 to handle large data and many files...

from move.

simonrasmu avatar simonrasmu commented on June 17, 2024

HDF5 could be something we can implement later, when more of the core functionality is updated. Closing for now

from move.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.