Code Monkey home page Code Monkey logo

pam's People

Contributors

longguangwang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

pam's Issues

left_disp和right_disp参与训练,还是无监督吗?

作者您好,感谢您的工作和开源的代码,我受益匪浅。
我有一个疑问,希望请教您一下:

根据train.py中的代码,
mask_left = ((disp_left > 0) & (disp_left < 192)).float() mask_right = ((disp_right > 0) & (disp_right < 192)).float()
以及
loss_PAM_P = loss_pam_photometric(img_left, img_right, att, valid_mask, [mask_left, mask_right])
disp_left以及disp_right的信息被用在了训练过程中,这是否和无监督训练有所冲突呢?

[单对图像test接口]

作者您好!非常感谢您的杰出工作!看您test.py里面没有单对图像的测试接口, 能否抽空提供?另外,PAM可以输出任意分辨率的图对视差图吗?

泛化性能和微调方法

您好,非常感谢您的工作。我想问一下这个模型的泛化性能怎样呢?对实际情况能够给出很好的结果吗?如果想要用自己的真实数据进行训练,还需要打上label吗?

Small mistake about the metric.

Hi,thank you very much for your great work!
But there may be a small mistake in your metric code.
Refer to Kitti benchmark for the definition of ‘D1’ is percentage of stereo disparity outliers in first frame.
The outliers are defined as the pixels whose disparity errors are larger thanmax(3px; 0.05dtruth), where dtruth denotes the ground-truth disparity.
And I guess your ‘D3’ refers to ‘bad3’,Which is Percentage of “bad”pixels whose error is greater than 3px.
If I am wrong , please tell me.

the release time

@LongguangWang
Mr Wang,
I am very interested in your work and I want to refer to your code to do some unsupervised stereo matching works for your excellent disparity results. So I want to know when will the code be released? Thank you very much!

关于parallax attention 中epipolar line和disparity的问题

作者您好,不好意思在百忙之中打扰您!现在在做一个工作,希望借鉴您的工作,但是有两个问题想请教您:
1. 如果 left image 和 right image 的偏移方向不是水平方向的,例如人在歪着头观察物体的情况下产生的 left image 和 right image,本算法也可以适用嘛。 因为根据我的理解,epipolar line是水平的,本算法对Q和K的每一行矩阵相乘,从而保证仅在epipolar line上做attention。
2. 如果 left image 和 right image 的 disparity 较小,例如偏移的像素块仅仅为2,本算法适用嘛,谢谢!

代码可以在kitti数据集上训练么

作者你好,我想请问可以在kitti数据集上训练么?因为我看到train.py里是只写了train_set = SceneFlowDatset(),但是您也提供了KITTIDataset(),所以我想请问可以把SceneFlowDatset换成KITTIDataset来训练么?

forward() missing 2 required positional arguments: 'x_left' and 'x_right'

Mr Wang,
I'm very interested in your work.But I have some problems in training with sceneflow dataset.
In the process of debugging with multiple GPUs, when debugging with two GPUs,
if the batchsize is set to 1, "TypeError: forward() missing 2 required positional arguments: 'x_left' and 'x_right'".
if the batchsize is set to 2, "RuntimeError: CUDA error: out of memory".
How to solve it?

debug

@LongguangWang
Mr Wang,

File "/home/PAM-master/PASMnet/models/modules.py", line 130, in forward
cost_right2left = torch.tril(cost_right2left)
RuntimeError: invalid argument 1: expected a matrix at /opt/conda/conda-bld/pytorch_1544202130060/work/aten/src/THC/generic/THCTensorMathPairwise.cu:174

I'm very interested in your work, and I'd like to ask you what is the reason and how to solve this problem in the process of training with sceneflow dataset.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.