Code Monkey home page Code Monkey logo

2017-lidar-segmentation-reading-list's Introduction

2017-Summer-Reading-List

My reading list for my Summer 2017 internship at Voyage. Material that I found useful or interesting during work.

Paper Link Paper Title Paper Description
Link Multi-View 3D Object Detection Network for Autonomous Driving Baidu paper that uses object region proposals followed by ROI pooling to generate feature maps for multiple perspeectives (LiDAR Bird's Eye View, LiDAR Fron View, RGB Front View) and then a deep fusion network to generate 3D bounding boxes and class scores.
Link Real-time Object Classification in 3D Point Clouds Using Point Feature Histograms Uses a 2D occupancy grid and connected components to generate non-overlapping bounding boxes. Then generates feature histrograms for each object proposal to classify each object with an SVM. Operates at 10 hz with good performance
Link Segmentation of 3D Lidar Data in non-flat Urban Environments using a Local Convexity Criterion Uses a heuristic on LiDAR scanners to generate graph directly from the LiDAR scan (without going to point cloud). This graph is then used to quickly generate a attribute at each point which is an approximation for the surface normal. They use this to estimate object segmentations by considering two points in the same object if they are "locally convex".
Link On the Segmentation of 3D LIDAR Point Clouds Provides a few different methods for scene segmentation from 3D point clouds. Discusses different representations of 3D point clouds that make segmentation more efficient. Shows that removal of ground points is a good way to improve performance.
Link Deep Watershed Transform for Instance Segmentation Trains a network to take in RGB image data and learn two masks. The first mask represents the direction at each pixel to the nearest object boundary and the second mask represents the distance to the nearest object boundary. This distance mask is then used as a representation of watershed energy and object boundaries are inferred. Very good instance segmentation results.
Link Rich feature hierarchies for accurate object detection and semantic segmentation (R-CNN) The R-CNN paper. Extracts region proposals and generates class scores for each one.
Link Fast R-CNN Speeds up R-CNN by sharing computation on object proposals
Link Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Uses k anchors and a sliding window to generate region proposals.
Link Mask R-CNN Improves the idea further by using aligning the regions of interest before transforming a region of interest into a fixed size region of interest feature map. They call this RoI align and it improves performance.
Link A Brief History of CNNs in Image Segmentation: From R-CNN to Mask R-CNN Good review of the history of R-CNN and related papers.
Link PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation Generates semantic segmentation from unordered point cloud data.
Link You Only Look Once: Unified, Real-Time Object Detection Convolutional object detection but with a single pass. Uses anchors and fixed grid to generate a fixed number of bounding boxes with class scores and probabilities. Idea is that R-CNN and region proposal networks are over complicated and that just making it a single network is a good simple idea to explore.
Link YOLO9000: Better, Faster, Stronger YOLO v2 proposes improvements to their previous work that improve performance. Removes fully connected layers, increases resolution so that their detector network doesnt have to learn a new resolution, bounds offsets with a logistic activation so that the model cant put proposals anywhere in the image but only in the local anchor area this makes the model more stable, changes classification vector to not be 1000 long but 1369 long to represent probabilities of intermediate nodes in imagenet and softmaxes along branching points
Link Deep Semantic Classification for 3D LiDAR Data They use both an objectness score and a “dynamicity” score to detect and classify objects. Also some other things going on, good read.

Also this:

https://github.com/takeitallsource/awesome-autonomous-vehicles

2017-lidar-segmentation-reading-list's People

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.