ivipsourcecode / dxslam Goto Github PK

View Code? Open in Web Editor NEW

405.0 19.0 76.0 3.96 MB

License: Other

CMake 1.41% Shell 0.23% Python 0.67% C++ 97.68%

dxslam's Introduction

DXSLAM

DXSLAM is a visual SLAM system based on deep CNN feature extraction. Please

clone this repo if you want to run an offline evaluation with e.g. the TUM dataset, or
clone dxslam_ros and deep_features if you want a ROS version to work with a live camera or ROS bags e.g. from the OpenLORIS-Scene datasets, or
clone deep_features if you are interested in deep feature extraction only.

Technical details are described in this paper (to be published in IROS 2020):

Dongjiang Li, Xuesong Shi, Qiwei Long, Shenghui Liu, Wei Yang, Fangshi Wang, Qi Wei, Fei Qiao, "DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features," arXiv preprint arXiv:2008.05416, 2020.

@article{li2020dxslam,
  title={{DXSLAM}: A Robust and Efficient Visual {SLAM} System with Deep Features},
  author={Dongjiang Li and Xuesong Shi and Qiwei Long and Shenghui Liu and Wei Yang and Fangshi Wang and Qi Wei and Fei Qiao},
  journal={arXiv preprint arXiv:2008.05416},
  year={2020}
}

The SLAM pipeline in this repo is customized from ORB-SLAM2.

1. Prerequisites

We have tested the library in Ubuntu 16.04 and 18.04, but it should be easy to compile in other platforms.

C++11 or C++0x Compiler
Pangolin
OpenCV
Eigen3
Dbow、Fbow and g2o (Included in Thirdparty folder)
tensorflow(1.12)

2. Building DXSLAM library and examples

Clone the repository:

git clone https://github.com/raulmur/DXSLAM.git DXSLAM

We provide a script build.sh to build the Thirdparty libraries and DXSLAM. Please make sure you have installed all required dependencies (see section 1). Execute:

cd dxslam
chmod +x build.sh
./build.sh

This will create libDXSLAM.so at lib folder and the executables rgbd_tum in Examples folder.

3. RGB-D Example

TUM Dataset

Download a sequence from http://vision.in.tum.de/data/datasets/rgbd-dataset/download and uncompress it.
Associate RGB images and depth images using the python script associate.py

python associate.py PATH_TO_SEQUENCE/rgb.txt PATH_TO_SEQUENCE/depth.txt > associations.txt

Get HF-net output

cd hf-net
python3 getFeature.py image/path/to/rgb output/feature/path

Execute the following command. Change IVIP.yaml to TUM1.yaml,TUM2.yaml or TUM3.yaml for freiburg1, freiburg2 and freiburg3 sequences respectively. Change PATH_TO_SEQUENCE_FOLDERto the uncompressed sequence folder. Change ASSOCIATIONS_FILE to the path to the corresponding associations file.

./Examples/RGB-D/rgbd_tum Vocabulary/DXSLAM.fbow Examples/RGB-D/TUMX.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE OUTPUT/FEATURE/PATH

4. Processing your own sequences

You will need to create a settings file with the calibration of your camera. See the settings file provided for the TUM RGB-D cameras. We use the calibration model of OpenCV. RGB-D input must be synchronized and depth registered.

5. SLAM and Localization Modes

You can change between the SLAM and Localization mode using the GUI of the map viewer.

SLAM Mode

This is the default mode. The system runs in parallal three threads: Tracking, Local Mapping and Loop Closing. The system localizes the camera, builds new map and tries to close loops.

Localization Mode

This mode can be used when you have a good map of your working area. In this mode the Local Mapping and Loop Closing are deactivated. The system localizes the camera in the map (which is no longer updated), using relocalization if needed.

dxslam's People

Contributors

Stargazers

Watchers

Forkers

majiayao-roborock wbercode baodijun pipigenius cedrusx zainmehdi jinqiang zebrajack forrest-z specyfick snowcarter sunstarchan icedaigua bluestupidyu iefiac cds-mipt richexplor chengwei920412 nicolasrosa-forks yx0123 richardhansir bene-yan dingyikang xghcode 315haoge baolinv0 zhangxuelei86 tutorgaming clarkren changpeng08 nana-love-xixi kongan vallucifer woodpecker0 fhill122 gpetrak yerson001 zichaotong 951623446 fatsharks liyang-huang kinggreat24 hiber980 xuekunnan li-creator15 mfkiwl rayshark0605 sdim44 luckyt0204 kimsoohwan zczhangcheng raytang88 maddarauci alienshengzi jay80253504 feitrinh banafshebamdad siqianli liulimingcode rickkcir hijasonzou whu-huangyue kejingjing88212 autoperson ohhandsome dansonzhang hihi313 mingming715 zhongxiangrong zhangxiurui520 lktidaohuoxing fgxhxx yangyangnuc leeoxe mono0826 senselab2022

dxslam's Issues

Please give the thirdparty version certainty

some thirdparties, especiall Pangolin etc, the version is quite important in case that apis changes lead to comple failed

Supported camera types

Can this work with mono or stereo cameras or only RGBD?

How to run RGB-D Example

I know the command is as follows：

./Examples/RGB-D/rgbd_tum Vocabulary/DXSLAM.fbow Examples/RGB-D/TUMX.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE OUTPUT/FEATURE/PATH

my command is:

./Examples/RGB-D/rgbd_tum Vocabulary/DXSLAM.fbow  Examples/RGB-D/TUM3.yaml dataset/TUM/rgbd_dataset_freiburg3_walking_halfsphere/ ./ hf-net/output/feature/path/

Run this command, nothing happens。
I want to know where is ASSOCIATIONS_FILE.

terminate called after throwing an instance of 'std::runtime_error'

I am trying to run dxslam. There were few files missing like ORBvoc.txt so I got it from ORBSLAM folder. Then TUM1.yaml file was also missing which I also copied from ORB folder. After running RGBD example I get the following error.

Loading ORB Vocabulary. This could take a while... terminate called after throwing an instance of 'std::runtime_error' what(): Vocabulary::fromStream invalid signature Aborted (core dumped)

why dxslam doesn't work for stereo based camera?

Possible license violation

Hi! I want to inform you that copying ORB-SLAM2 code and removing the heading (such as in https://github.com/ivipsourcecode/dxslam/blob/master/src/Map.cc) is unethical and in possible violation of their license. You should do a proper fork of their repo.
@raulmur

How can I get the groundtruth trajectory of TUM in paper?

Separate scripts to download vocabulary and hf-net model

It is annoying that the build.sh script trys to download some data every time it runs, even if the data have already been downloaded.
How about moving the wget and tar commands out of build.sh into one or two separate script?
Thanks.

Also, it may be better to add a -p param to all the mkdir commands.

Will this work on monocular mode?

Hi,
Can this work on monocular mode or can only work on RGB-D mode?

ATE.RMSE

I use the TUM's evaluate_ate.py to get the ATE.RMSE, but I can't get the paper's result. And why did the trajectory result is different when I use the same hf-net output?
What tool did you use to get the ATE.RMSE? Can you tell me? I want to reappearance your experiment.

How to get the python script associate.py

OpenVINO model conversion

Hi, first of all thanks for your work.

I am trying to reproduce conversion of HF-Net model used in your code to OpenVINO IR format.
Is it possible to clarify the following:

Which version of model optimizer did you use to convert the model?
Is it possible to share parameters for mo_tf.py? (model optimizer converting script)
When I tried to convert HF-Net, model optimizer cannot infer shapes/values for each output node (global descriptor, local descriptor and keypoints).
I assume you used HF-Net Tensorflow model as in readme and used saved_model_dir parameter with mo_tf.py.

Also, as mentioned in your paper, "Most of the layers in HF-Net can be directly processed by the Model Optimizer, except the bilinear interpolation operation for local descriptor upsampling, which is not yet supported". Could you point out/release the part of the code for the post-processing stage after OpenVINO inference?

compile error on aarch64

any solution to compile on aarch64?

Tracking lost easily on TUM fr3_walking_xyz sequence

Hi, I follow the instruction and successfully run DXSLAM on TUM dataset fr3_walking_xyz sequence. Everything seems fine at the beginning, but soon tracking get lost and never recover.

I use TUM3.yaml from ORB-SLAM2 and DXSLAM.fbow in Vocabulary. And the outputs are as follows:

 DXSLAM 


Loading ORB Vocabulary. This could take a while...
Vocabulary loaded!
load time:37.5962


Camera Parameters: 
- fx: 535.4
- fy: 539.2
- cx: 320.1
- cy: 247.6
- k1: 0
- k2: 0
- p1: 0
- p2: 0
- fps: 30
- color order: RGB (ignored if grayscale)

Depth Threshold (Close/Far Points): 2.98842

-------
Start processing sequence ...
Images in the sequence: 827

New map created with 340 points
match numbers: 468
nmatchesMap: 206
match numbers: 297
nmatchesMap: 32
match numbers: 222
nmatchesMap: 24
match numbers: 278
nmatchesMap: 33
match numbers: 298
nmatchesMap: 56
match numbers: 303
nmatchesMap: 54
match numbers: 143
nmatchesMap: 41
match numbers: 164
nmatchesMap: 36
match numbers: 169
nmatchesMap: 42
-------

median tracking time: 0.368993
mean tracking time: 0.284506

Saving keyframe trajectory to KeyFrameTrajectory.txt ...

trajectory saved!

How can I get the access to the benchmark?

There may be an error in the README

Thanks you for your generous sharing of the open source code.
In README.md
3. RGB-D Example
3.4 Execute the following......
./Examples/RGB-D/rgbd_tum Vocabulary/ORBvoc.txt Examples/RGB-D/TUMX.yaml ./Examples/RGB-D/rgbd_tum Vocabulary/ORBvoc.txt Examples/RGB-D/TUMX.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE OUTPUT/FEATURE/PATH
Following this instruction will cause an error，Perhaps DXSLAM.fbow should be called here instead of the ORB.txt：
./Examples/RGB-D/rgbd_tum Vocabulary/DXSLAM.fbow Examples/RGB-D/TUMX.yaml ./Examples/RGB-D/rgbd_tum Vocabulary/ORBvoc.txt Examples/RGB-D/TUMX.yaml PATH_TO_SEQUENCE_FOLDER
After the replacement，it works fine.

请问New College数据集是如何处理，画出PR曲线的？

请问New College数据集是如何处理，画出PR曲线的。

Online version

Hi, I came across this paper today and it looks really nice!

I'm really interested in the online CPU implementation of the feature detector using the OpenVINO framework. Are you planning to release that part of the code?

Best,
Matias

please follow the hand by hand installation

https://blog.csdn.net/weixin_39977764/article/details/122453967

The error obtained when reproducing the TUM RGB-D data set is very large.

  I downloaded the image (yerld/dxslam-built) in docker  and tested TUM RGB-D rgbd_dataset_fr3_walking_xyz data set. But the KeyFrameTrajectory.txt trajectory drawn using evo is very different from the ground truth, and the same is true for rgbd_dataset_freiburg1_xyz when tested. 
 What's  wrong with my  experiment?

Is the clone code in README correct?

Hi, Thank you for your open source.

I found the git clone code is the following:
git clone https://github.com/raulmur/DXSLAM.git DXSLAM
is the github user name correct?

This may be a BUG in /src/Matcher.cc

I think this may be a bug in the program src/Matcher.cc on lines 68 to 75.The original code is as follows:

        // The size of the window will depend on the viewing direction
        float r = RadiusByViewingCos(pMP->mTrackViewCos);

        if(bFactor)
            r*=th;

        const std::vector<size_t> vIndices =
                F.GetFeaturesInArea(pMP->mTrackProjX,pMP->mTrackProjY,1,nPredictedLevel-1,nPredictedLevel);

The program comment is described as "The size of the window will depend on the viewing direction",but here the size is fixed to 1 pixel. It's strange. After testing, I found that setting the size to 1.2*r makes the tracking more robust.
The modified code is as follows:

        // The size of the window will depend on the viewing direction
        float r = RadiusByViewingCos(pMP->mTrackViewCos);

        if(bFactor)
            r*=th;

        const std::vector<size_t> vIndices =
                F.GetFeaturesInArea(pMP->mTrackProjX,pMP->mTrackProjY,1.2*r,nPredictedLevel-1,nPredictedLevel);

编译出错，这是怎么回事呢？

CMakeFiles/DXSLAM.dir/build.make:446: recipe for target 'CMakeFiles/DXSLAM.dir/src/Initializer.cc.o' failed
make[2]: *** [CMakeFiles/DXSLAM.dir/src/Initializer.cc.o] Error 1
CMakeFiles/Makefile2:104: recipe for target 'CMakeFiles/DXSLAM.dir/all' failed
make[1]: *** [CMakeFiles/DXSLAM.dir/all] Error 2
Makefile:83: recipe for target 'all' failed
make: *** [all] Error 2

terminate called after throwing an instance of 'std::runtime_error'

Loading ORB Vocabulary. This could take a while...
terminate called after throwing an instance of 'std::runtime_error'
what(): Vocabulary::fromStream invalid signature
Aborted (core dumped)

How can I get the ATE RMSE RESULTS ON TUM RGB-D?

Exact version of prerequisites

Hi,
I'm trying to run the TUM RGBD demo. But I'm a total freshman on this field, I spent a lot of time on configuring the environment. Could you please provide the exact version of prerequisites to me？Which cuda and OpenCV version can run the work？
Looking forward to your reply. Thank you so much.

compile error

‘int DXSLAM::Initializer::CheckRT(const cv::Mat&, const cv::Mat&, const std::vectorcv::KeyPoint&, const std::vectorcv::KeyPoint&, const std::vector<std::pair<int, int> >&, std::vector&, const cv::Mat&, std::vector<cv::Point3_ >&, float, std::vector&, float&)’:
/home/mengzhe/dxslam/src/Initializer.cc:822:40: error: ‘isfinite’ was not declared in this scope
if(!isfinite(p3dC1.at(0)) || !isfinite(p3dC1.at(1)) || !isfinite(p3dC1.at(2)))

Get features from a GPU-trained hf-net

First of all, thank you for your great work.
I want to use DXSLAM to get features from a re-trained hf-net in GPU. I compare the two graphs (yours optimized in OpenVino and mine trained in GPU) in tensorboard and I observed that the two graphs have a significant difference. The optimized graph has two extra parts: simple_nms and top_k_keypoints which include the two tensors used in GetFeature.py script:
pred/simple_nms/radius:0
pred/top_k_keypoints/k:0
where they don't exist in the re-trained hfnet.
Is this difference a modification which was made for some reason, or it is the optimized version by OpenVino ?

How could I get features from a re-trained in GPU Hf-net ?

how do I train the bag of words in this code?

Sorry to ask a potentially low-level question, how do I train the bag of words in this code? To be precise, how to use fbow to train a bag of words featuring HFNet. The code for the fbow example is generated only for examples of several mainstream features, such as ORB and SIFT.Thank you for your answers。

How Can I get Bovisa_2008-09-01 Dataset?

I want to train the Vocabulary Myself, But The download link on the website may not be working.Can someone help with this? My Emil is [email protected].