rembg trainer

This code allows you to easily train U²-Net model in ONNX format to use with rembg tool.

This work is based on the U²-Net repo, which is under Apache licence. The derivative work is licensed under MIT; do as you please with it.

Fork notes

I have created this fork to share my training code that I used to create the models in skin_segmentation.

I have optimized the code to work for a 12 GB NVIDIA GPU (Ampere Architecture)
I have probably broken something for your platform
You will need a 20 series (Turing) or greater NVIDIA GPU to take advantage of my changes

Parameters at 12GB:

MAIN_SIZE = 1024 lets you use a batch size of 2
- The original code would only allow you to achieve a batch size of 1
MAIN_SIZE = 320 (base U²-Net resolution) lets you use a batch size of 20

Augmentation:

Dataset augmentation takes a round-robin approach to promote even model fitting
- This has currently broken saving. I do not plan on fixing this.
- ONNX models are saved every epoch (bug) and checkpoints as specified.

Performance

A couple of notes on performance:

~~Default parameters are fine-tuned for maximum performance on systems with 32gb of processing memory, like the Apple M1 Pro. Adjust accordingly.~~
~~Computations are performed in float32, because float16 support on Metal is a bit undercooked at the moment.~~
If this is your first time using CUDA on Windows, you'd have to install CUDA Toolkit.
For CUDA, this code uses half-precision calculations for increased performance. See "Fork notes" for hardware requirements.
For acceleration on AMD GPUs, please refer to the installation guide of AMD ROCm platform. No code changes will be required.

If the training is interrupted for any reason, don't worry — the program saves its state regularly, allowing you to resume from where you left off. Frequency of saving can be adjusted.

If you feel like optimizing the dataloader performance, I'm sure the upstream author would appreciate that. This training code is very CPU intensive.

Fancy a go?

Download the latest release
Install requirements.txt
Put your images into images folder
Put their masks into masks folder; or see below
Launch python3 u2net_train.py --help for more details on supported command line flags
Launch the script with your desired configuration
Go grab yourself a nice latte and wait…… and wait…
Once you've had your fill of waiting, here's how you use the resulting model with rembg:

rembg p -w input output -m u2net_custom -x '{"model_path": "/saved_models/u2net/27.onnx"}'
# input — folder with images to have their backgrounds removed
# output — folder for resulting images processed with custom model
# adjust path(s) as necessary!

You should note that this code does not normalize the training input, which rembg expects.

Mask extraction

If you already have a bunch of images with a removed background, then you can create masks off them using the provided alpha.py script. Create a directory called clean, put your pngs there, and launch the script.

But fair warning mate: the script is very CPU-heavy. Oh, and you'll need the ImageMagick tool installed and present in your PATH.

So, at the end of the day, you will end up with the following folder structure:

images — source images, will be needed for training
masks — required for training, to teach model where the background was
clean — images with removed background, to extract masks (they're not used for actual training)

Leave your mark 👉👈🥺

Buy the original author ~~a coffee~~ an alcohol-free cider here

If my modifications were particularly useful, you can send me some coin here

samhaswon / rembg-trainer-cuda Goto Github PK

rembg-trainer-cuda's Introduction

rembg trainer

Fork notes

Performance

Fancy a go?

Mask extraction

Leave your mark 👉👈🥺

rembg-trainer-cuda's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent