Code Monkey home page Code Monkey logo

fine-tuning-gpt2's People

Contributors

itsuncheng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

fine-tuning-gpt2's Issues

[Request] Create all necessary files to resume the training for each checkpoint instead of doing it at the end.

Hi there, I recently found this repository and your post explaining how everything works and I have to say its pretty good but on your post you mention that we can continue training from the last checkpoint but it only works if first you run it until it reaches the first training epoch or if it finishes processing everything. I tried running the code on a huge dataset with billions of parameters and I could not stop it and resume it because it would take too long before it reached the first training epoch, once I stopped it I couldn't resume it as there were some files that were not created by the run_language_modeling.py script, I also tested the code with a smaller dataset and there I was able to resume it as the tokenizer_config.json, vocab.json and other files needed to resume the training were created after it ended. I think it would be a lot more useful if all those files needed to resume the training were created into each checkpoint folder, I know it will probably make the checkpoint use more space but that's should not be a problem as it won't be that much of extra space we will be using.

python2 or python3

This is only support of python2, right? If I want to use python3, how do it? Many thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.