Code Monkey home page Code Monkey logo

meta-rangeseg's Introduction

Meta-RangeSeg

Song Wang, Jianke Zhu*, Ruixiang Zhang

This is the official implementation of Meta-RangeSeg: LiDAR Sequence Semantic Segmentation Using Multiple Feature Aggregation [Paper] [Video]. arXiv PWC

Prediction Groud Truth
Perspective View z z
Bird's-Eye View z z

Demo

Model Zoo

Model Task mIoU(paper)
[on test set]
mIoU(reprod.)
[on test set]
Results
Meta-RangeSeg_v1 multiple scans semantic segmentation 49.5 49.7 valid_pred
test_pred

Abstract

LiDAR sensor is essential to the perception system in autonomous vehicles and intelligent robots. To fulfill the real-time requirements in real-world applications, it is necessary to efficiently segment the LiDAR scans. Most of previous approaches directly project 3D point cloud onto the 2D spherical range image so that they can make use of the efficient 2D convolutional operations for image segmentation. Although having achieved the encouraging results, the neighborhood information is not well-preserved in the spherical projection. Moreover, the temporal information is not taken into consideration in the single scan segmentation task. To tackle these problems, we propose a novel approach to semantic segmentation for LiDAR sequences named Meta-RangeSeg, where a novel range residual image representation is introduced to capture the spatial-temporal information. Specifically, Meta-Kernel is employed to extract the meta features, which reduces the inconsistency between the 2D range image coordinates input and Cartesian coordinates output. An efficient U-Net backbone is used to obtain the multi-scale features. Furthermore, Feature Aggregation Module (FAM) aggregates the meta features and multi-scale features, which tends to strengthen the role of range channel. We have conducted extensive experiments for performance evaluation on SemanticKITTI, which is the de-facto dataset for LiDAR semantic segmentation. The promising results show that our proposed Meta-RangeSeg method is more efficient and effective than the existing approaches.

TO DO

This paper is under review currently. Code will be made publicly available soon!

  • Add pretrained model and results for multiple scans
  • Add pretrained model and results for single scan
  • Add code for inference
  • Add demo for Meta-RangeSeg
  • Add full training code and results

Citations

@misc{wang2022metarangeseg,
      title={Meta-RangeSeg: LiDAR Sequence Semantic Segmentation Using Multiple Feature Aggregation}, 
      author={Song Wang and Jianke Zhu and Ruixiang Zhang},
      year={2022},
      eprint={2202.13377},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

meta-rangeseg's People

Contributors

songw-zju avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.