Hi, When I run the code on KITTI, I can not get the scores reported

Fixed in commit <a class="commit-link" data-hovercard-type="commit" data-hovercard-url

Can not get desired performance using wavelet-decomposition about wavelet-monodepth HOT 8 CLOSED

nianticlabs commented on June 2, 2024

Can not get desired performance using wavelet-decomposition

from wavelet-monodepth.

Comments (8)

MichaelRamamonjisoa commented on June 2, 2024

Hi,

Indeed your reported results are quite strange. I expect some gap to happen between different runs, but not that much.
There are a few things we need to check:

Can you confirm that you are using depth hints?
Can you evaluate your Epoch 19 result, as logs indices start with index 0?
Can you run training again, setting --num_epochs 20 ?

Do you run this for evaluation:

python evaluate_depth.py \
  --data_path <your_KITTI_path>  
  --encoder_type resnet --num_layers 50 \
  --width 1024 --height 320 \
  --load_weights_folder <path_to_model>
  --use_wavelets
  --eval_stereo
  --eval_split eigen
  --post_process

I unfortunately cannot run experiments at the moment.
It would help us debug if you could run the same experiments without wavelets, and compare against the "Depth hints Resnet 50" results.

from wavelet-monodepth.

ruili3 commented on June 2, 2024

Hi,

Thank you for your reply :D
For these things to check, I can reply to some of them now

I can confirm I use depth hints. I set --use_depth_hints and --depth_hint_path in the training code, and I got the depth hints loss using the tensorboard. As shown below, the loss curve is quite smooth. Maybe you can help to check whether the loss value is reasonable.
The result is exactly the Epoch indexed by 19 during training, I re-named it as 20 in the previous post.
Yes, I can run it again by setting --num_epochs to 20. Based on my previous experiences on running Monodepth2 and Manydepth, the total number of epochs (>20) may influence little on the individual performance.

When I run the evaluation, the code is the same as you suggested above. The code conducts stereo evaluation and sets fixed scaling factor to 5.4.

I would like to conduct DepthHints without wavelets on 1024x320 resolution to see whether the DepthHint MonoDepth deviates from the expected results in my setting. I'll post the result once I finish training. Thanks a lot for your help for further debugging!

from wavelet-monodepth.

ruili3 commented on June 2, 2024

Hi,

It's wired that I cloned the code directly from your repository, and run training and evaluation following the commands of README.md. Is that possibly caused by the training data and running environment? I use the .jpg images (as processed by Monodepth2) for training and also use these images to generate depth hints files. Can you provide more details or share the environment files for your conda environment?

from wavelet-monodepth.

MichaelRamamonjisoa commented on June 2, 2024

Hi,

Thanks a lot for these results!

The good news is that you get the correct main result: wavelets do not change the original performance, since the results are similar with or without wavelets.

I have made a few modifications of the original depth-hints code to simplify it before release, so it might also come from that, but I need more investigation, and unfortunately I cannot access GPUs to retrain with 1024x320 resolution and ResNet50.

Looking at the original depth-hints, I see that --disparity_smoothness 0 is added when training using depth-hints, and --scheduler_step_size 5. These are missing in my training command line.

Regarding jpg vs png, it seems that depth-hints are generated using .jpg format: https://github.com/nianticlabs/depth-hints/blob/aa2ecf7bc88ef2edbd434fbf064ac024cec8e85d/precompute_depth_hints.py#L178, so you should be ok.

I will leave this issue open until we solve it, but let me know if setting these flags changes everything to the final scores. In any case, a performance increase in depth hints will directly translate into a similar performance increase in WaveletMonoDepth.

In the meantime, if you'd like to try using our trained WaveletMonoDepth, I have put a link to trained weights in the README files.

from wavelet-monodepth.

ruili3 commented on June 2, 2024

Hi,

I think I find the reason :D

In L15 of KITTI/networks/network_constructors.py, the ResNet pretraining is disabled in your code, so the performance is worse than that of using the pre-trained model.

wavelet-monodepth/KITTI/networks/network_constructors.py

Line 15 in 405ffc1

encoder = encoders.ResnetEncoder(opts.num_layers, False)

Meanwhile, I have a small question regarding to the network design of the paper. I wonder why in L134-L135 of depth_decoder.py,

wavelet-monodepth/KITTI/networks/decoders/depth_decoder.py

Line 134 in 405ffc1

    
           yh = 2**(scale-1) * self.sigmoid(self.convs[("waveconv", scale, 1)](input_features)).unsqueeze(1) - \

the outputs of two "waveconv" modules are scaled 2**(scale-1) according to current scales, and the final yh is calculated using the difference of the two "waveconv" outputs? Should I follow the same practice if I want to build a new network module with similar multi-level output? Thanks a lot!

from wavelet-monodepth.

MichaelRamamonjisoa commented on June 2, 2024

Hey!

Good spot, thanks a lot for your help with debugging!

I'll put back the pretraining option then.
Regarding your question, the scaling factor is meant to make sure yh stays within [-2**(scale-1), 2**(scale-1)] range. The IDWT will then output a LL with values in range [0, 2**(scale-2)].

The "2-layer" design was used to have two separate layers for negative and positive values. But you could also use a tanh activation and remove the need for two layers, it will work fine too, although you might suffer a slight decrease in performance.

From previous experiments, I found that you can also use linear activations for waveconv layers and let the network learn to predict values in the right range, but I decided to keep these to improve training stability.

from wavelet-monodepth.

ruili3 commented on June 2, 2024

Yes, thanks for your explanation!

from wavelet-monodepth.

MichaelRamamonjisoa commented on June 2, 2024

Fixed in commit 1d9f945. Closing.

from wavelet-monodepth.

Can not get desired performance using wavelet-decomposition about wavelet-monodepth HOT 8 CLOSED

Comments (8)

Related Issues (9)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent