Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution (accepted by ACMMM2020) [PAPER]

This repository is the official PyTorch implementation for our proposed WSR. The code is developed by supercaoO (Huanrong Zhang) based on SRFBN_CVPR19. In the future, the update will be released in supercaoO/WSR first.

Introduction

We propose a lightweight and fast network to learn wavelet coefficients progressively for single image super-resolution (WSR). More specifically, the network comprises two main branches. One is used for predicting the second level low-frequency wavelet coefficients, and the other one is designed in a recurrent way for predicting the rest wavelet coefficients at first and second levels. Finally, an inverse wavelet transformation is adopted to reconstruct the SR images from these coefficients. In addition, we propose a deformable convolution kernel (side window) to construct the side-information multi-distillation block (S-IMDB), which is the basic unit of the recurrent blocks (RBs). Moreover, we train WSR with loss constraints at wavelet and spatial domains.

The RNN-based framework of our proposed 4× WSR. Notice that two recurrent blocks (RBs) share the same set of weights. The details about our proposed S-IMDB can be found in our main paper.

If you find our work useful in your research or publications, please consider citing:

@inproceedings{zhang2020wsr,
    author = {Zhang, Huanrong and Jin, Zhi and Tan, Xiaojun and Li, Xiying},
    title = {Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution},
    booktitle = {Proceedings of the 28th ACM International Conference on Multimedia},
    year= {2020}
}

Requirements
Test
Train
Results
Acknowledgements

Requirements

cuda & cudnn
Python 3
PyTorch >= 1.0.0
pytorch_wavelets
tqdm
cv2
pandas
skimage
scipy = 1.0.0
Matlab

Test

Quick start

Clone this repository and cd to WSR:

git clone https://github.com/FVL2020/WSR.git
cd WSR

Check if the pre-trained model WSR_x4_BI.pth exists in ./models.

Then, run following commands for evaluation on Set5:

CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_Set5.json

Finally, PSNR/SSIM values for Set5 are shown on your terminal, you can find the reconstruction images in ./results/SR/BI.

Test on standard SR benchmark

If you have cloned this repository, you can first download SR benchmark (Set5, Set14, B100, Urban100 and Manga109) from GoogleDrive (provided by SRFBN_CVPR19) or BaiduYun (code: p9pf).
Run ./results/Prepare_TestData_HR_LR.m in Matlab to generate HR/LR images with BI degradation model.
Edit ./options/test/test_WSR_x4_BI.json for your needs according to ./options/test/README.md.

Then, run command:

cd WSR
CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_x4_BI.json

Finally, PSNR/SSIM values are shown on your terminal, you can find the reconstruction images in ./results/SR/BI. You can further evaluate SR results using ./results/Evaluate_PSNR_SSIM.m.

Test on your own images

If you have cloned this repository, you can first place your own images to ./results/LR/MyImage.
Edit ./options/test/test_WSR_own.json for your needs according to ./options/test/README.md.

Then, run command:

cd WSR
CUDA_VISIBLE_DEVICES=0 python test.py -opt options/test/test_WSR_own.json

Finally, you can find the reconstruction images in ./results/SR/MyImage.

Train

Download training set DIV2K from official link or BaiduYun (code: m84q).
Run ./scripts/Prepare_TrainData_HR_LR.m in Matlab to generate HR/LR training pairs with BI degradation model and corresponding scale factor.
Run ./results/Prepare_TestData_HR_LR.m in Matlab to generate HR/LR test images with BI degradation model and corresponding scale factor, and choose one of SR benchmark for evaluation during training.
Edit ./options/train/train_WSR.json for your needs according to ./options/train/README.md.

Then, run command:

cd WSR
CUDA_VISIBLE_DEVICES=0 python train.py -opt options/train/train_WSR.json

You can monitor the training process in ./experiments.
Finally, you can follow the Test Instructions to evaluate your model.

Results

The inference time is measured on B100 dataset (100 images) using Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz (CPU time) and NVIDIA TITAN RTX GPU (GPU time).

Quantitative Results

Comparisons on the number of network parameters, inference time, and PSNR/SSIM of different 4× SR methods. Best and second best PSNR/SSIM results are marked in red and blue, respectively.

Comparisons on the number of network parameters and inference time of different 4× SR methods. Best results are highlighted. Notice that the compared methods achieve better PSNR/SSIM results than our WSR does.

Qualitative Results

Visual comparisons with different 4× SR advances on “img018” and “img024” from Urban100 dataset. The inference time is CPU time.

Trade-off Results

Relationship between the number of network parameters, inference time, and reconstruction performance of different 4× SR advances. The color represents PSNR achieved by different 4× networks on B100 dataset. The inference time in left figure is CPU time and that in right figure is GPU time.

TODO

Option files for more scale (i.e., 2×, 8×, 16×).

Acknowledgements

Thank Paper99. Our code structure is derived from his repository SRFBN_CVPR19.

zhwzhong / wsr Goto Github PK

wsr's Introduction

Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution (accepted by ACMMM2020) [PAPER]

Introduction

Contents

Requirements

Test

Quick start

Test on standard SR benchmark

Test on your own images

Train

Results

Quantitative Results

Qualitative Results

Trade-off Results

TODO

Acknowledgements

wsr's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org