wild_deep_mvs's Introduction

Deep MVS gone wild

Pytorch implementation of "Deep MVS gone wild" published at 3DV2021 (Paper | website)

This repository provides the code to reproduce the experiments of the paper. It implements extensive comparison of Deep MVS architecture, training data and supervision.

If you find this repository useful for your research, please consider citing

@article{
  author    = {Darmon, Fran{\c{c}}ois  and
               Bascle, B{\'{e}}n{\'{e}}dicte  and
               Devaux, Jean{-}Cl{\'{e}}ment  and
               Monasse, Pascal  and
               Aubry, Mathieu},
  title     = {Deep Multi-View Stereo gone wild},
  year      = {2021},
  url       = {International Conference on 3D Vision},
}

Installation

Python packages: see requirements.txt
Fusibile:

git clone https://github.com/YoYo000/fusibile 
cd fusibile
cmake .
make .
ln -s EXE ./fusibile

COLMAP: see the github repository for installation details then link colmap executable with ln -s COLMAP_DIR/build/src/exe/colmap colmap

Training

You may find all the pretrained models here (120 Mo) or alternatively you can train models using the following instructions.

Data

Download the following data and extract to folder datasets

DTU training (19 Go)
BlendedMVS (27.5 Go)
Megadepth: MegadepthV1 (199 Go) Geometry (8 Go)

The directory structure should be as follow:

datasets
├─ blended
├─ dtu_train
├─ MegaDepth_v1
├─ undistorted_md_geometry

The data is already preprocessed for DTU and BlendedMVS. For MegaDepth, run python preprocess.py for generating the training data.

Script

The training script is train.py, launch python train.py --help for all the options. For example

python train.py --architecture vis_mvsnet --dataset md --supervised --logdir best_sup --world_size 4 --batch_size 4 for training the best performing setup for images in the wild.
python train.py --architecture mvsnet-s --dataset md --unsupervised --upsample --occ_masking --epochs 5 --lrepochs 4:10 --logdir best_unsup --world_size 3 for the best unsupervised model.

The models are saved in folder trained_models

Evaluations

We provide code for both depthmap evaluation and 3D reconstruction evaluation

Data

Download the following links and extract them to datasets

BlendedMVS (27.5 GB) same link as BlendedMVS training data
YFCC depth maps (1.1Go)
DTU MVS benchmark: Create directory datasets/dtu_eval and extract the following files
- Images (500Mo), rename it as images folder
- Ground truth (6.3Go)
- evaluation files (6.3Go), the evaluation only need ObsMask folder
In the end the folder structure should be
```
datasets
├─ dtu_eval
    ├─ ObsMask
    ├─ images
    ├─ Points
        ├─ stl
```
YFCC 3D reconstruction (1.5Go)

Depthmap evaluation

python depthmap_eval.py --model MODEL --dataset DATA

MODEL is the name of a folder found in trained_models
DATA is the evaluation dataset, either yfcc or blended

3D reconstruction

See python reconstruction_pipeline.py --help for a complete list of parameters for 3D reconstruction. For running the whole evaluation for a trained model with the parameters used in the paper, run

scripts/eval3d_dtu.sh --model MODEL (--compute_metrics) for DTU evaluation
scripts/eval3d_yfcc.sh --model MODEL (--compute_metrics) for YFCC 3D evaluation

The reconstruction will be located in datasets/dtu_eval/Points or datasets/yfcc_data/Points

Acknowledgments

This repository is inspired by MVSNet_pytorch and MVSNet repositories. We also adapt the official implementations of Vis_MVSNet and CVP_MVSNet.

Copyright

Deep MVS Gone Wild All rights reseved to Thales LAS and ENPC.

This code is freely available for academic use only and Provided “as is” without any warranty.

Modification are allowed for academic research provided that the following conditions are met :
  * Redistributions of source code or any format must retain the above copyright notice and this list of conditions.
  * Neither the name of Thales LAS and ENPC nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

wild_deep_mvs's People

Contributors

Stargazers

Watchers

wild_deep_mvs's Issues

Links to expire

hello, I find that the download link of BlendedMVS has expired, can you provide a new one? thanks very much

Re-computing f-scores reported in paper

Hi there,

Thanks for the great work!

I'm trying to re-compute the metrics reported in Table 4 of your paper (prec., rec., f-score on YFCC evaluation) but there doesn't appear to be code in your repo for doing this. I have my own function for computing the metrics, but am a bit confused about setting a threshold.

First, do you have the thresholds you used for each YFCC scene stored anywhere? I tried using the values stored in the text files in yfcc_data/gt_resolution, but using your provided trained models with this threshold doesn't give me the same results you report in Table 4.

Second, is Eq. 7 in the paper correct? I understand the goal with Eq. 7, but shouldn't the distance argument be ||K^-1 (D(p)p) - K^-1 (D(p')p')||. This way, you find the median distance in scene space between back-projected points 2 pixels away from each other. Is this perhaps what you used or did you use Eq. 7 as is?

Thanks!
Alex

gt depth map fusion parameters

Hi, thanks for a good work.
Could you provide your exact parameters for generating gt through fusing depth maps from IMC? I was fusing these scenes with parameters specified in paper "reprojection error below half a pixel and depth error below 1%", but the fusing result is not as good as the gt you provided. Thanks!

Urban datasets

Hi, thank you for sharing both the paper and the code. I'm working on something similar, so I was very happy to read your results.

I would like to ask if you ever considered urban datasets in your evaluation, especially multi-camera datasets such as nuScenes or DDAD. I'm asking this for two main reasons:

Internet data are surely "in the wild", but they mostly focus on a single giant object (i.e. a building) in the image, for which a lot of diverse views can be captured. On the other hand, urban data have no clear subject, a lot of dynamic objects and several textureless areas to deal with, which is definitely an even harder test for MVS networks.
MVS networks cannot be trained in a supervised way on urban data, therefore your insights on unsupervised methods might be interesting to be validated also on these kind of data.

What are your thoughts on this?

Recommend Projects

fdarmon / wild_deep_mvs Goto Github PK

wild_deep_mvs's Introduction

Deep MVS gone wild

Installation

Training

Data

Script

Evaluations

Data

Depthmap evaluation

3D reconstruction

Acknowledgments

Copyright

wild_deep_mvs's People

Contributors

Stargazers

Watchers

Forkers

wild_deep_mvs's Issues

Recommend Projects

Recommend Topics

Recommend Org