Code Monkey home page Code Monkey logo

diffuseit's Introduction

DiffuseIT

Official repository of "Diffusion-based Image Translation using Disentangled Style and Content Representation"

Environment

Pytorch 1.9.0, Python 3.9

$ conda create --name DiffuseIT python=3.9
$ conda activate DiffuseIT
$ pip install ftfy regex matplotlib lpips kornia opencv-python torch==1.9.0+cu111 torchvision==0.10.0+cu111 -f https://download.pytorch.org/whl/torch_stable.html
$ pip install color-matcher
$ pip install git+https://github.com/openai/CLIP.git

Model download

To generate images, please download the pre-trained diffusion model

imagenet 256x256 LINK

FFHQ 256x256 LINK

download the model into ./checkpoints folder

For face identity loss when using FFHQ pre-trained model, download pre-trained ArcFace model LINK

save the model into ./id_model

Text-guided Image translation

We provide Colab Demo for Text-guided Image translation Open In Colab

python main.py -p "Black Leopard" -s "Lion" -i "input_example/lion1.jpg" --output_path "./outputs/output_leopard" 
--use_range_restart --use_noise_aug_all --regularize_content

To to further regularize content when CLIP loss is extremely low, activate --regularize_content

To use noise augmented images for our ViT losses, activate --use_noise_aug_all

To use progressively increasing our contrastive loss, activate --use_prog_contrast

To restart the whole process with high rgb regularize loss, activate --use_range_restart

To use FFHQ pre-trained model, activate --use_ffhq

For memory saving, we can use single CLIP model with --clip_models 'ViT-B/32'

Image-guided Image translation

We provide Colab Demo for Image-guided Image translation Open In Colab

python main.py -i "input_example/reptile1.jpg"  --output_path "./outputs/output_reptile" 
-tg "input_example/reptile2.jpg" --use_range_restart --diff_iter 100 --timestep_respacing 200 --skip_timesteps 80 
--use_colormatch --use_noise_aug_all

To remove the color matching, deactivate --use_colormatch

Our source code rely on Blended-diffusion, guided-diffusion, flexit, splicing vit

diffuseit's People

Contributors

anon294384 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.