TTST (IEEE TIP 2024)

📖Paper | 🖼️PDF

PyTorch codes for "TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution", IEEE Transactions on Image Processing (TIP), 2024.

Authors: Yi Xiao, Qiangqiang Yuan*, Kui Jiang, Jiang He, Chia-Wen Lin, and Liangpei Zhang
Wuhan University, Harbin Institute of Technology, and National Tsinghua University

🎉🎉 News 🎉🎉

The pre-trained TTST (×4) was released for a quick test on remote sensing images! [Download Pre-trained Model]

Abstract

Transformer-based method has demonstrated promising performance in image super-resolution tasks, due to its long-range and global aggregation capability. However, the existing Transformer brings two critical challenges for applying it in large-area earth observation scenes: (1) redundant token representation due to most irrelevant tokens; (2) single-scale representation which ignores scale correlation modeling of similar ground observation targets. To this end, this paper proposes to adaptively eliminate the interference of irreverent tokens for a more compact self-attention calculation. Specifically, we devise a Residual Token Selective Group (RTSG) to grasp the most crucial token by dynamically selecting the top-k keys in terms of score ranking for each query. For better feature aggregation, a Multi-scale Feed-forward Layer (MFL) is developed to generate an enriched representation of multi-scale feature mixtures during feed-forward process. Moreover, we also proposed a Global Context Attention (GCA) to fully explore the most informative components, thus introducing more inductive bias to the RTSG for an accurate reconstruction. In particular, multiple cascaded RTSGs form our final Top-k Token Selective Transformer (TTST) to achieve progressive representation. Extensive experiments on simulated and real-world remote sensing datasets demonstrate our TTST could perform favorably against state-of-the-art CNN-based and Transformer-based methods, both qualitatively and quantitatively. In brief, TTST outperforms the state-of-the-art approach (HAT-L) in terms of PSNR by 0.14 dB on average, but only accounts for 47.26% and 46.97% of its computational cost and parameters.

Network

🧩 Install

git clone https://github.com/XY-boy/TTST.git

Environment

CUDA 11.1

Python 3.9.13

PyTorch 1.9.1

Torchvision 0.10.1

basicsr 1.4.2

🎁 Dataset

Please download the following remote sensing benchmarks:

Data Type	AID	DOTA-v1.0	DIOR	NWPU-RESISC45
Training	Download	None	None	None
Testing	Download	Download	Download	Download

🧩 Usage

Quick Test

Download Pre-trained Model

Step I. Use the structure below to prepare your dataset.

/xxxx/xxx/ (your data path)

/GT/ 
   /000.png  
   /···.png  
   /099.png  
/LR/ 
   /000.png  
   /···.png  
   /099.png

Step II. Change the --data_dir to your data path.
Step III. Run the eval_4x.py

python eval_4x.py

Train

python train_4x.py

🖼️ Results

Quantitative

Visual

Acknowledgments

Our TTST mainly borrows from DRSFormer (https://github.com/cschenxiang/DRSformer) and SKNet.
Thanks for these excellent open-source works!

Contact

If you have any questions or suggestions, feel free to contact me.
Email: [email protected]; [email protected]

Citation

If you find our work helpful in your research, please consider citing it. We appreciate your support！😊

@ARTICLE{xiao2024ttst,
  author={Xiao, Yi and Yuan, Qiangqiang and Jiang, Kui and He, Jiang and Lin, Chia-Wen and Zhang, Liangpei},
  journal={IEEE Transactions on Image Processing}, 
  title={TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution}, 
  year={2024},
  volume={33},
  number={},
  pages={738-752},
  doi={10.1109/TIP.2023.3349004}
}

xy-boy / ttst Goto Github PK

ttst's Introduction

TTST (IEEE TIP 2024)

📖Paper | 🖼️PDF

🎉🎉 News 🎉🎉

Abstract

Network

🧩 Install

Environment

🎁 Dataset

🧩 Usage

Quick Test

Train

🖼️ Results

Quantitative

Visual

Acknowledgments

Contact

Citation

ttst's People

Contributors

Stargazers

Watchers

Forkers

ttst's Issues

Recommend Projects

Recommend Topics

Recommend Org