Light

coderhaoranlee / 2017-lidar-segmentation-reading-list Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 13 KB

My reading list for my Summer 2017 internship at Voyage. Material that I found useful or interesting during work.

2017-lidar-segmentation-reading-list's Introduction

2017-Summer-Reading-List

My reading list for my Summer 2017 internship at Voyage. Material that I found useful or interesting during work.

Paper Link	Paper Title	Paper Description
Link	Multi-View 3D Object Detection Network for Autonomous Driving	Baidu paper that uses object region proposals followed by ROI pooling to generate feature maps for multiple perspeectives (LiDAR Bird's Eye View, LiDAR Fron View, RGB Front View) and then a deep fusion network to generate 3D bounding boxes and class scores.
Link	Real-time Object Classification in 3D Point Clouds Using Point Feature Histograms	Uses a 2D occupancy grid and connected components to generate non-overlapping bounding boxes. Then generates feature histrograms for each object proposal to classify each object with an SVM. Operates at 10 hz with good performance
Link	Segmentation of 3D Lidar Data in non-flat Urban Environments using a Local Convexity Criterion	Uses a heuristic on LiDAR scanners to generate graph directly from the LiDAR scan (without going to point cloud). This graph is then used to quickly generate a attribute at each point which is an approximation for the surface normal. They use this to estimate object segmentations by considering two points in the same object if they are "locally convex".
Link	On the Segmentation of 3D LIDAR Point Clouds	Provides a few different methods for scene segmentation from 3D point clouds. Discusses different representations of 3D point clouds that make segmentation more efficient. Shows that removal of ground points is a good way to improve performance.
Link	Deep Watershed Transform for Instance Segmentation	Trains a network to take in RGB image data and learn two masks. The first mask represents the direction at each pixel to the nearest object boundary and the second mask represents the distance to the nearest object boundary. This distance mask is then used as a representation of watershed energy and object boundaries are inferred. Very good instance segmentation results.
Link	Rich feature hierarchies for accurate object detection and semantic segmentation (R-CNN)	The R-CNN paper. Extracts region proposals and generates class scores for each one.
Link	Fast R-CNN	Speeds up R-CNN by sharing computation on object proposals
Link	Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks	Uses k anchors and a sliding window to generate region proposals.
Link	Mask R-CNN	Improves the idea further by using aligning the regions of interest before transforming a region of interest into a fixed size region of interest feature map. They call this RoI align and it improves performance.
Link	A Brief History of CNNs in Image Segmentation: From R-CNN to Mask R-CNN	Good review of the history of R-CNN and related papers.
Link	PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation	Generates semantic segmentation from unordered point cloud data.
Link	You Only Look Once: Unified, Real-Time Object Detection	Convolutional object detection but with a single pass. Uses anchors and fixed grid to generate a fixed number of bounding boxes with class scores and probabilities. Idea is that R-CNN and region proposal networks are over complicated and that just making it a single network is a good simple idea to explore.
Link	YOLO9000: Better, Faster, Stronger	YOLO v2 proposes improvements to their previous work that improve performance. Removes fully connected layers, increases resolution so that their detector network doesnt have to learn a new resolution, bounds offsets with a logistic activation so that the model cant put proposals anywhere in the image but only in the local anchor area this makes the model more stable, changes classification vector to not be 1000 long but 1369 long to represent probabilities of intermediate nodes in imagenet and softmaxes along branching points
Link	Deep Semantic Classification for 3D LiDAR Data	They use both an objectness score and a “dynamicity” score to detect and classify objects. Also some other things going on, good read.

Also this:

https://github.com/takeitallsource/awesome-autonomous-vehicles

2017-lidar-segmentation-reading-list's People

Watchers

Forkers

jjdblast jianshuzhao einsnull nivir hyh21521038 sanketgujar nishanthjois wutaoo

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.