Code Monkey home page Code Monkey logo

dreambooth-anything's Introduction

Dreambooth-Anything

A repository to consolidate stable diffusion finetuning scripts in to a training hub. Train inpainting, depth, v1+, v2+, image variations, image colorization, whatever. Train with optimizations like 8-bit adam and xformers for faster and more memory efficient training.

Features

  • Train depth
  • Train inpaint
  • Train on custom image input (image latent concat to noise latent) *idea from Justin Pinkey
  • Train on custom conditionings (image embeddings instead of text for example) *idea from Justin Pinkey
  • Use filenames as prompts
  • Use bnb 8-bit adam for more memory efficient training
  • Use xformers for more memory efficient training
  • Mixed precision (fp16/bf16) training
  • Prompt shuffling (split on ',' or a custom string and shuffle based on a given probability)
  • Train depth with custom depth images

Contributing

Pull requests, discussions, requests, suggestions, and critiques are all welcome! :)

Disclaimer

This is a combination of a bunch of repos as well as my own code and edits on scripts. I will do my best to give credit where credit is due in the form of comments, licenses, a shout-out on the readme, etc. If I happen to miss giving anyone credit/include a license please email me at [email protected] and I will fix it!

Shout-Outs

  • Huge thanks to Hugging Face for the diffusers library that makes most of this code possible
  • Huge thanks to Stable Diffusion for creating the actual diffusion model and open sourcing it
  • Thanks to epitaque for depth training
  • Another thanks to Hugging Face for inpainting training
  • Shoutout to EveryDream for windows venv setup and bnb patch
  • Shoutout to Justin Pinkey/Lambda Labs for research in to training with different inputs

Contact

Reach out to [email protected] with any requests/questions/comments/suggestions/concerns

Other Training Repos

If you're interested in training text-to-audio latent diffusion go check out https://github.com/serp-co/ai-text-to-audio-latent-diffusion

dreambooth-anything's People

Contributors

francislabountyjr avatar levi avatar

Stargazers

Vitalii Bychkov avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.