Code Monkey home page Code Monkey logo

df-net's Introduction

DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency

A TensorFlow re-implementation for DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency. There are some minor differences from the model described in the paper:

  • Model in the paper uses 2-frame as input, while this code uses 5-frame as input (you might use any odd numbers of frames as input, though you would need to tune the hyper-parameters)
  • FlowNet in the paper is pre-trained on SYNTHIA, while this one is pre-trained on Cityscapes

Please see the project page for more details.

Prerequisites

This codebase was developed and tested with the following settings:

Python 3.6
TensorFlow 1.2.0 (this is the only supported version)
g++ 4.x (this is the only supported version)
CUDA 8.0
Unbuntu 14.04
4 Tesla K80 GPUs (w/ 12G memory each)

Some Python packages you might not have

pypng
opencv-python

Installation

  1. Clone this repository
git clone [email protected]:vt-vl-lab/DF-Net.git
cd DF-Net
  1. Prepare models and training data
chmod +x ./misc/prepare.sh
./misc/prepare.sh

NOTE: Frames belonging to KITTI2012/2015 train/test scenes have been excluded in the provided training set. Add these frames back to the training set would improve the performance of DepthNet.

Data preparation (for evaluation)

After accepting their license conditions, download KITTI raw, KITTI flow 2012, KITTI flow 2015.

Then you can make soft-link for them

cd dataset
mkdir KITTI
cd KITTI

ln -s /path/to/KITTI/raw raw
ln -s /path/to/KITTI/2012 flow2012
ln -s /path/to/KITTI/2015 flow2015

(Optional) You can add those KITTI2012/2015 frames back to the training set, by commenting line81~line85 in data/kitti/kitti_raw_loader.py, and do

python data/prepare_train_data.py --dataset_name='kitti_raw_eigen' --dump_root=/path/to/save/ --num_threads=4

Training

export CUDA_VISIBLE_DEVICES=0,1,2,3
python train_df.py --dataset_dir=/path/to/your/data --checkpoint_dir=/path/to/save/your/model

For the first time, custom CUDA operations for FlowNet will be compiled. If you have any compliation issues, please check core/UnFlow/src/e2eflow/ops.py

  • Line31: specify your CUDA path
  • Line32: Add -I $CUDA_HOME/include, where $CUDA_HOME is your CUDA directory
  • Line38: specify your g++ version

Testing

Test DepthNet on KITTI raw (You can use the validation set to selct the best model.)

python test_kitti_depth.py --dataset_dir=/path/to/your/data --output_dir=/path/to/save/your/prediction --ckpt_file=/path/to/your/ckpt --split="val or test"
python kitti_eval/eval_depth.py --pred_file=/path/to/your/prediction --split="val or test"

Test FlowNet on KITTI 2012 (Please use training set)

python test_flownet_2012.py --dataset_dir=/path/to/your/data --ckpt_file=/path/to/your/ckpt

Test FlowNet on KITTI 2015 (Please use training set)

python test_flownet_2015.py --dataset_dir=/path/to/your/data --ckpt_file=/path/to/your/ckpt

NOTE: For KITTI 2012/2015

  • If you want to generate visualization colormap for training set, you can specify output_dir
  • If you want to test on test set and upload it to KITTI server, you can specify output_dir and test on test set.

Pre-trained model performance

You should get the following numbers if you use the pre-trained model pretrained/dfnet

DepthNet (KITTI raw test test)

abs rel sq rel rms log rms a1 a2 a3
0.1452 1.2904 5.6115 0.2194 0.8114 0.9394 0.9767

FlowNet (KITTI 2012/2015 training set)

KITTI 2012 EPE KITTI 2015 EPE KITTI 2015 F1
3.1052 7.4482 0.2695

Citation

If you find this code useful for your research, please consider citing the following paper:

@inproceedings{zou2018dfnet,
author    = {Zou, Yuliang and Luo, Zelun and Huang, Jia-Bin}, 
title     = {DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency}, 
booktitle = {European Conference on Computer Vision},
year      = {2018}
}

Acknowledgement

Codes are heavily borrowed from several great work, including SfMLearner, monodepth, and UnFlow. We thank Shih-Yang Su for the code review.

df-net's People

Contributors

zhangconghhh avatar yuliang-zou avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.