zsyoaoa / difface Goto Github PK

View Code? Open in Web Editor NEW

608.0 20.0 39.0 18.14 MB

DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)

License: Other

Python 93.09% C++ 2.78% Cuda 4.14%

difface's Introduction

DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)

Zongsheng Yue, Chen Change Loy

Paper

⭐ If DifFace is helpful to your images or projects, please help star this repo. Thanks! 🤗

Update

2023.12.11: Add the code for image inpainting.
2022.12.19: Add Colab demo .
2022.12.17: Add the .
2022.12.13: Create this repo.

Applications

👉 Old Photo Enhancement

👉 Face Restoration

👉 Face Inpainting

Requirements

A suitable conda environment named DifFace can be created and activated with:

conda env create -f environment.yaml
conda activate DifFace

Inference

👦 Face image restoration (cropped and aligned)

python inference_difface.py -i [image folder/image path] -o [result folder] --task restoration --eta 0.5 --aligned --use_fp16

Note that the hyper-parameter eta controls the fidelity-realness trade-off, you can freely adjust it between 0.0 and 1.0.

👮 Whole image enhancement

python inference_difface.py -i [image folder/image path] -o [result folder] --task restoration --eta 0.5 --use_fp16

👸 Face image inpainting

python inference_difface.py -i [image folder/image path] -o [result folder] --task inpainting --use_fp16

We assume that the masked area is filled with zeros in the low quality image. Based on such an assumption, the image mask is automatically deteced in our code.

Testing

To reproduce the results in our paper, please follow the following guidelines to prepare the testing data.

Download the FFHQ dataset, and resize them into size 512x512(or 256x256).

python scripts/big2small_face.py -i [Face folder(1024x1024)] -o [Saving folder(512x512)] --pch_size 512

Make the testing dataset for restoration

python scripts/prepare_testing_restoration.py -i [CelebA folder(512x512)] -o [Saving folder]

Make the testing dataset for inpainting

python scripts/prepare_testing_inpainting.py -i [CelebA folder(256x256)] -o [Saving folder]

Training

🐢 Configuration

Modify the data path in data.train and data.val according to your own settings.
Adjust the batch size based on your GPU devices.
- train.batchsize: [A, B] # A denotes the batch size for training, B denotes the batch size for validation
- train.microbatch: C # C denotes the batch size on each GPU, A = C * num_gpus * num_grad_accumulation

🐬 Train diffusion model with 8 GPUS

torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/training/diffsuion_ffhq512.yaml --save_dir [Logging Folder]

🐳 Train diffused estimator for restoration (SwinIR) with 4 GPUS

torchrun --standalone --nproc_per_node=4 --nnodes=1 main.py --cfg_path configs/training/swinir_ffhq512.yaml --save_dir [Logging Folder]

🐑 Train diffused estimator for restoration (LaMa) with 4 GPUS

torchrun --standalone --nproc_per_node=4 --nnodes=1 main.py --cfg_path configs/training/estimator_lama_inpainting.yaml --save_dir [Logging Folder]

License

This project is licensed under NTU S-Lab License 1.0. Redistribution and use should follow this license.

Acknowledgement

This project is based on Improved Diffusion Model. Some codes are brought from BasicSR, YOLOv5-face, and FaceXLib. We also adopt Real-ESRGAN to support background image enhancement. Thanks for their awesome works.

Contact

If you have any questions, please feel free to contact me via [email protected].

difface's People

Contributors

Stargazers

Watchers

Forkers

monster-999 jackzhousz ip-enhancement overbestfitting geeksloth nopeanuts achirus-code hadryan lesterzoeyxu joskid eltociear ayo-faks yerang823 cedro3 espersonnel kepengxu neuralnetworklab oboje peterzs chhaviilli jaywu109 tony109060581 wyhuai rv-chittersu wen521 yanndd1 runngezhang aijike mimonasse baris-unver milleniums kangkzeng ismeyueyue dl-diffusion madoibito80 adambear buiduchanh zhangziliang04

difface's Issues

Could you release your metric calculation script please?

Your work is awesome! I have tested with your pre-trained model on CelebaTest and got amazing visual results.

However, I noticed that your paper and the VQFR's paper both provide metrics for testing VQFR on CelebaTest, and these metrics are different.

Thus, I used your model to infer on the CelebaTest dataset provided by VQFR link here and use the calculation script provided by VQFR link here, the unexpected results were obtained.

Therefore, I am very curious about the quantitative metrics mentioned in the paper. How do you calculate the metrics? Could you release your metric calculation script please?

RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

I met this problem when I tried to run the command CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --standalone --nproc_per_node=4 --nnodes=1 main_diffusion.py --gpu_id 0123 --cfg_path configs/training/diffusion_ffhq512.yaml --save_dir myfolder. Could someone help me solve it?

About ckpts

Hi, you have done a nice work! I am interested in your paper. Could you please release your pretrain diffusion ckpts on FFHQ?

About the guidance function of estimator

Hi, I am interested in your nice work, but after reading your paper, I donot understand the way how estimator work.
Therefore, I read your code, and I suppose that the estimator(swinIR) is used to predict the x0 by low quality image y0, and add noise like diffusion forward process do?

DifFace/sampler.py

Line 247 in 35d91a1

im_hq = self.model_ir(y0)

im_hq = self.model_ir(y0)

and

DifFace/sampler.py

Line 259 in 35d91a1

yt = self.diffusion.q_sample(

yt = self.diffusion.q_sample(
             x_start=post_fun(im_hq),
             t=torch.tensor([start_timesteps,]*im_hq.shape[0], device=device),
)

finally sample by ddpm

DifFace/sampler.py

Line 279 in 35d91a1

sample = self.diffusion.p_sample_loop(

sample = self.diffusion.p_sample_loop(
                    self.model,
                    shape=yt.shape,
                    noise=yt,
                    start_timesteps=start_timesteps,
                    clip_denoised=True,
                    denoised_fn=None,
                    model_kwargs=None,
                    device=None,
                    progress=False,
 )

The script I run is as followings. Did I miss any details? ^_^

python inference_difface.py --aligned --in_path testdata/cropped_faces --out_path result/testdata --gpu_id 2

adjust_lr() missing 1 required argument

adjust_lr() missing 1 required positional argument: 'ii'adjust_lr() missing 1 required positional argument: 'ii'.

In trainer_py trainer.py", line 255, in train self.adjust_lr()self.adjust_lr()

Please correct me if I am wrong? What to put there? I am trying to train model with batch size [8, 2]

使用现有模型，测试，返回错误，如何解决？

python.exe D:\develop\DifFace\inference_difface.py --in_path D:\develop\DifFace\testdata\whole_imgs --out_path D:\data --gpu_id 0
Setting random seed 20000
Loading from ./weights/diffusion/iddpm_ffhq512_ema500000.pth...
Loaded Done
C:\Users***\AppData\Local\Programs\Python\Python39\lib\site-packages\torch\functional.py:504: UserWarning: torch.meshgrid: in an upcoming release, it will be required to pass the indexing argument. (Triggered internally at C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\TensorShape.cpp:3191.)
return _VF.meshgrid(tensors, kwargs) # type: ignore[attr-defined]
Loading from ./weights/SwinIR/General_Face_ffhq512.pth...
Loaded Done
C:\Users*****\AppData\Local\Programs\Python\Python39\lib\site-packages\torchvision\models_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead.
warnings.warn(
C:\Users*\AppData\Local\Programs\Python\Python39\lib\site-packages\torchvision\models_utils.py:223: UserWarning: Arguments other than a weight enum or None for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing weights=None.
warnings.warn(msg)
Traceback (most recent call last):
File "D:\develop\DifFace\inference_difface.py", line 160, in
main()
File "D:\develop\DifFace\inference_difface.py", line 137, in main
image_restored, face_restored, face_cropped = sampler_dist.sample_func_bfr_unaligned(
File "D:\develop\DifFace\sampler.py", line 368, in sample_func_bfr_unaligned
restored_faces = _process_batch(self.face_helper.cropped_faces)
File "D:\develop\DifFace\sampler.py", line 332, in _process_batch
restored_faces = self.sample_func_ir_aligned(
File "D:\develop\DifFace\sampler.py", line 279, in sample_func_ir_aligned
sample = self.diffusion.p_sample_loop(
File "D:\develop\DifFace\models\gaussian_diffusion.py", line 428, in p_sample_loop
for sample in self.p_sample_loop_progressive(
File "D:\develop\DifFace\models\gaussian_diffusion.py", line 484, in p_sample_loop_progressive
out = self.p_sample(
File "D:\develop\DifFace\models\gaussian_diffusion.py", line 383, in p_sample
out = self.p_mean_variance(
File "D:\develop\DifFace\models\respace.py", line 88, in p_mean_variance
return super().p_mean_variance(self._wrap_model(model), *args, **kwargs)
File "D:\develop\DifFace\models\gaussian_diffusion.py", line 278, in p_mean_variance
min_log = _extract_into_tensor(
File "D:\develop\DifFace\models\gaussian_diffusion.py", line 105, in _extract_into_tensor
res = th.from_numpy(arr).to(device=timesteps.device)[timesteps].float()
KeyboardInterrupt

SwinIR path error

Thank you for your great Job!!

You updated this repo in 2023/12/12 maybe, so there is no links for SwinIR model
https://github.com/zsyOAOA/DifFace/releases/download/V1.0/General_Face_ffhq512.pth

and now, I downloaded it(below link) from your release page instead of before link
https://github.com/zsyOAOA/DifFace/releases/download/V1.0/swinir_restoration512_L1.pth
but, I cant load the model for running inference.py

Please provide the./configs/inpainting_debug.yaml file

This file is not available for model training

Can't load diffused estimator (SwinIR) trained model

I trained SwinIR and saved the ckpts .
i'm trying to inference the new model after i changed the ckpts paths in "iddpm_ffhq512_swinir.yaml"
i have issue in loading new model state_dict

" RuntimeError: Error(s) in loading state_dict for SwinIR:
Missing key(s) in state_dict: "conv_first.1.weight", "conv_first .... "

note : " inference code is worked with pretrained "model /weights/SwinIR/General_Face_ffhq512.pth" .

training time

How long it takes to train the entire network?

self.loss_fun in trainer.py is not defined.

Thank for sharing your great work!! : )

We have tried to training DifFace with SwinIR diffused estimator and got an error that self.loss_fun(line 492, 495 in trainer.py) is not defined.

        if last_batch or self.num_gpus <= 1:
            loss = self.loss_fun(hq_pred, micro_data['gt']) / hq_pred.shape[0]
        else:
            with self.model.no_sync():
                loss = self.loss_fun(hq_pred, micro_data['gt']) / hq_pred.shape[0]

=======================
Please check these lines.

cv2.error: OpenCV(4.8.0) :-1: error: (-5:Bad argument) in function 'imencode' > Overload resolution failed: > - Can't parse 'params'. Sequence item with index 1 has a wrong type > - Can't parse 'params'. Sequence item with index 1 has a wrong type

I met this problem when I tried to run the command CUDA_VISIBLE_DEVICES=0 python main_sr.py --cfg_path configs/training/swinir_ffhq512.yaml --save_dir save_dir_1 Could someone help me solve it?

Does adjustment of lr work?

Have you ever tried to decrease the lr(the default value is 1e-4) for the training procedure of diffusion model?
Since the loss converges very quick(less than 1W iters), but the lr keeps constant during the whole training time.

Train swinIR face restoration using FFHQ on 1024x1024 resolution

@zsyOAOA how to train FFHQ on 1024x1024 resolution. I have checked the swinIR_ffhq512 the img_size : 64 (at line8).

It's not convenient to change the cuda device when running inferrence script.

Could you add a device number to the command line parameters?

module diffusers has no attribute DifFacePipeline

followed the example code from the huggingface model card but got the error

then tried the below with the same error

from diffusers import DiffusionPipeline
model_id = "OAOA/DifFace"

# load model and scheduler
difface = DiffusionPipeline.from_pretrained(model_id)

i'm using diffusers==0.25.0

retrain swinIR based on pretrained model, but met ERROR [KeyError: 'state_dict']

Number of parameters: 15.79M
=> Loaded checkpoint /home/zhuchao/code/diface/code/diface/weights/SwinIR/General_Face_ffhq512.pth
Traceback (most recent call last):
File "main_sr.py", line 32, in
trainer = Trainer(configs)
File "/home/zhuchao/code/diface/code/diface/trainer.py", line 285, in init
super().init(configs)
File "/home/zhuchao/code/diface/code/diface/trainer.py", line 67, in init
self.resume_from_ckpt()
File "/home/zhuchao/code/diface/code/diface/trainer.py", line 145, in resume_from_ckpt
util_net.reload_model(self.model, ckpt['state_dict'])
KeyError: 'state_dict'
I want to finetune swinIR on a small face datasets, but I met this error.
It seems the General_Face_ffhq512.pth not save state_dict and iters_start，I don't know how to fix it.
Hope you can help me, THANKS!

Model weight preservation in the Train diffused estimator (SwinIR) stage.

I retrained the Train diffused estimator (SwinIR) stage and found that the saved weights are dict_keys(['iter_start','log_step','log_step_img','state_dict']) and not SwinIR's weights, so the inference reports a load model error. Would like to ask if the code in main_sr.py about saving model weights is wrong? I'm looking forward to hearing from the author!

Can't run inference_difface.py

By running this command python inference_difface.py --in_path ~/Images/blur/blurred_1.png --out_path ~/Images/blur/unblurred I got error

Here the python command line stack Trace

Traceback (most recent call last):
  File "inference_difface.py", line 160, in <module>
    main()
  File "inference_difface.py", line 137, in main
    image_restored, face_restored, face_cropped = sampler_dist.sample_func_bfr_unaligned(
  File "~/programmation/DifFace/sampler.py", line 368, in sample_func_bfr_unaligned
    restored_faces = _process_batch(self.face_helper.cropped_faces)
  File "~/programmation/DifFace/sampler.py", line 328, in _process_batch
    cropped_face_t = np.stack(
  File "<__array_function__ internals>", line 180, in stack
  File "~/anaconda3/envs/DifFace/lib/python3.8/site-packages/numpy/core/shape_base.py", line 422, in stack
    raise ValueError('need at least one array to stack')
ValueError: need at least one array to stack

I have CUDA installed, here the output of nvidia-smi command

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.105.17   Driver Version: 525.105.17   CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ...  Off  | 00000000:01:00.0 Off |                  N/A |
| N/A   42C    P8    N/A /  N/A |      4MiB /  4096MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A       952      G   /usr/lib/xorg/Xorg                  4MiB |
+-----------------------------------------------------------------------------+

training time

Hi, thanks for sharing your code! Could you please tell me what and how many GPUs you used and how long you trained for?Thanks!

zsyoaoa / difface Goto Github PK

difface's Introduction

DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)

Update

Applications

👉 Old Photo Enhancement

👉 Face Restoration

👉 Face Inpainting

Requirements

Inference

👦 Face image restoration (cropped and aligned)

👮 Whole image enhancement

👸 Face image inpainting

Testing

Training

🐢 Configuration

🐬 Train diffusion model with 8 GPUS

🐳 Train diffused estimator for restoration (SwinIR) with 4 GPUS

🐑 Train diffused estimator for restoration (LaMa) with 4 GPUS

License

Acknowledgement

Contact

difface's People

Contributors

Stargazers

Watchers

Forkers

difface's Issues

We have tried to training DifFace with SwinIR diffused estimator and got an error that self.loss_fun(line 492, 495 in trainer.py) is not defined.

Recommend Projects

Recommend Topics

Recommend Org