Code Monkey home page Code Monkey logo

realsh's Introduction

[ICCV 2023]. Supervised Homography Learning with Realistic Dataset Generation. [Paper].

Hai Jiang1,2, Haipeng Li3,2, Haoqiang Fan2, Bing Zeng3, Songchen Han1, Shuaicheng Liu3,2

1.Sichuan University, 2.Megvii Technology

3.University of Electronic Science and Technology of China

Presentation video:

[Youtube] and [Bilibili]

Pipeline

Dependencies

pip install -r requirements.txt

Download the raw CA-unsup dataset

Please refer to Content-Aware Unsupervised Deep Homography Estimation (CAHomo).

- Original video data download links: [GoogleDriver], [BaiduYun] (key:gvor)

- Unzip the data and Run "video2img.py" to save the images to the directory "./Homo_data/img"

Be sure to scale the image to (640, 360) since the point coordinate system is based on the (640, 360).
e.g. img = cv2.imresize(img, (640, 360))

- Using the images in "train.txt" and "test.txt" for training and evaluation.

- The manually labeled evaluation files can be download from [GoogleDriver], [BaiduYun] (key:hhnc)

Download the dominant plane masks for image generation

- Download links: [GoogleDriver], [BaiduYun] (key:j1zw)

- Unzip the masks to the directory "./Homo_data/mask"

Pre-trained model

model RE LT LL SF LF Avg Model
Pre-trained 0.22 0.35 0.44 0.42 0.29 0.34 [Google] [Baidu](key:qqed)

How to train?

You need to modify dataset/data_loader.py slightly for your environment, and then

python train.py --model_dir experiments/Base/ 

How to test?

python evaluate.py --model_dir experiments/Base/ --restore_file Iter2_0.3445.pth

Supplementary Material

- Download links: [GoogleDriver], [BaiduYun] (key:gwry)

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@InProceedings{Jiang_2023_ICCV,
    author    = {Jiang, Hai and Li, Haipeng and Han, Songchen and Fan, Haoqiang and Zeng, Bing and Liu, Shuaicheng},
    title     = {Supervised Homography Learning with Realistic Dataset Generation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2023},
    pages     = {9806-9815}
}

realsh's People

Contributors

jianghaiscu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.