Code Monkey home page Code Monkey logo

monodepth's Introduction

Mono Depth ROS

Configuration

  • Topics subscribed by the ROS node
    • /image/camera_raw - Input image from camera (can be changed on the parameter topic_color)
  • Topics published by the ROS node, containing depth and point cloud data generated.
    • /image/depth - Image message containing the depth image estimated (can be changed on the parameter topic_depth).
    • /pointcloud - Pointcloud2 message containing a estimated point cloud (can be changed on the parameter topic_pointcloud).
  • Parameters that can be configurated
    • frame_id - TF Frame id to be published in the output messages.
    • debug - If set true a window with the output result if displayed.
    • min_depth, max_depth - Min and max depth values considered for scaling.
    • batch_size - Batch size used when predicting the depth image using the model provided.
    • model_file - Keras model file used, relative to the monodepth package.

Setup

  • Install Python 2 and ROS dependencies
apt-get install python python-pip curl
pip install rosdep rospkg rosinstall_generator rosinstall wstool vcstools catkin_tools catkin_pkg
  • Install project dependencies
pip install tensorflow keras pillow matplotlib scikit-learn scikit-image opencv-python pydot GraphViz tk
  • Clone the project into your ROS workspace and download pretrained models
git clone https://github.com/tentone/monodepth.git
cd monodepth/models
curl –o nyu.h5 https://s3-eu-west-1.amazonaws.com/densedepth/nyu.h5

Launch

  • Example ROS launch entry provided bellow, for easier integration into your already existing ROS launch pipeline.
<node pkg="monodepth" type="monodepth.py" name="monodepth" output="screen" respawn="true">
    <param name="topic_color" value="/camera/image_raw"/>
    <param name="topic_depth" value="/camera/depth"/>
</node>

Pretrained models

Datasets for training

  • NYU Depth V2 (50K)
    • The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect.
    • Download dataset (4.1 GB)
  • KITTI Dataset (80K)
    • Datasets captured by driving around the mid-size city of Karlsruhe, in rural areas and on highways. Up to 15 cars and 30 pedestrians are visible per image.

monodepth's People

Contributors

tentone avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

monodepth's Issues

Slow performance

Hello sir, good job on the implementation but I was wondering..... Is there a faster way to get pointcloud from mono image?? I think the nested loop inside the "create_pointcloud_msg" function is the reason behind the slow performance

GPU utilization is very low(26%), GPU‘s memory is almost full

When I run this project, the depth map will change after the object in the camera changes for about 8 seconds. The release frequency of the depth image is about 3hz, and the GPU's memory is almost full, but the GPU utilization rate is very low, close to 26%.The GPU is an Nvidia RTX 2060.how to solve this problem?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.