Code Monkey home page Code Monkey logo

nuinsseg's Introduction

nuInsSeg

provide comprehensive information for nuInsSeg dataset nuInsSeg instance segmentation annotaion demo

1.Introduction

Through investigation, we found that although existing large-scale autonomous driving datasets provide multi-modal data, they lack multi-modal and consistent annotations. So on the basis of the nuScenes dataset, we supplement the image instance segmentation annotation, and call it nuInsSeg. As far as we know, this is the first one that contains both: LiDAR point cloud, RGB image, Open source autonomous driving datasets with manual 2D annotations (2D detection boxes, 2D instance segmentation), 3D annotations (3D detection boxes...), and these 2D and 3D annotations have instance-level consistency. Added 947,072 2D annotations (box detection and instance segmentation) to the 204,894 images in the nuScenes-trainval data.

It is hoped that nuInsSeg can help researchers to better study large-scale autonomous driving scenarios, especially from the perspective of multimodal sensor fusion.

2.Statistics

The nuScenes training and validation dataset contains 1,166,187 3D detection box annotations, but because many objects are occluded or unclear in the camera's line of sight, the corresponding 2D annotations will be less. The following are the data statistics of the nuInsSeg data training set and validation set, and the target category is consistent with the nuScenes detection task.

nuInsSeg taining set class distribution

category car bus motorcycle barrier truck const.veh pedestrain trailer bicyle traff.cone total
instances 357006 12917 7483 103028 63532 8413 143857 14824 7241 70892 789193

nuInsSeg validation set class distribution

category car bus motorcycle barrier truck const.veh pedestrain trailer bicyle traff.cone total
instances 68578 3084 1934 21964 14179 1836 27778 2695 1942 13889 157879

3.Benchmark

Here we provide the benchmark for instance segmentation and Weakly supervised instance segmentation on nuInsSeg dataset. The category of instance segmentation is the same as nuScenes detection tasks.

We adopt a standard evaluation metric of instance segmentation which is the same as that used in Mask R-CNN. It includes AP (average precision over IoU thresholds), AP50, AP75 and APs, APm, APl (AP at different scale). In our experiment, models are implemented using the official codebase without modification.

Task1: Instance Segmentation

Instance Segmentation aims to recognize distinct instances of objects in an image by predicting pixel-level category and instance identity, which has benefited a wide range of applications such as autonomous driving. Here we provide the results of several compatitable models on nuInsSeg dataset(official code).

Model Backbone Lr_schedule Mask_AP Download Log
Mask R-CNN R-50 1x 47.55 link link
Mask R-CNN R-101 1x 49.13 link link
PointRend R-50 1x 48.03 link link
PointRend R-101 1x 49.12 link link
CondInst R-50 1x 44.88 link link
CondInst R-101 1x 46.83 link link

R-50 and R-101 is pretrained on imagenet. 'Lr_schedule' means the training iterations and '1x' is 7w iterations.

Task2: Weakly Supervised Instance Segmentation

Weakly supervised instance segmentation aims to extract objects with simple and cheap annotation such as image-level tags, scribble and bounding box instead of expensive pixel-level annotations.

Model Backbone Annotations Lr_schedule Mask_AP Download Log
BoxInst R-50 box 1x 33.65 link link
BoxInst R-101 box 1x 34.39 link link
PointSup R-50 box+point 1x 43.80 link link
PointSup R-101 box+point 1x 44.72 link link
LWSIS+BoxInst R-50 3dbox+pc 1x 35.65 link link
LWSIS+BoxInst R-101 3dbox+pc 1x 36,22 link link
LWSIS+PointSup R-50 3dbox+pc 1x 45.46 link link
LWSIS+PointSup R-101 3dbox+pc 1x 46.17 link link

Here we explain different annotations used in the exp. 'box' means only using the 2D bounding box annotation for each instance, 'point' means using a specific number of points with human annotation indicating the background/foreground, '3dbox' means using the 3D bounding box annotations for each instance and 'pc' means the original point cloud.

LWSIS is the method used in our paper "LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving."

4.Tutorial

We provide development tookit to access the instance segmentation results easily: nuInsSeg-devkit.

5.Download

Here we provide the download link of the nuInsSeg dataset.

  1. nuInsSeg

  2. nuInsSeg-mini

6.Acknowledgement

It is stated here that we only add the annotation of instance segmentation on the basis of the nuscenes dataset, in order to better conduct research on multimodal perception for researchers. Many thanks to nuscnes for contributions to open source data.

nuinsseg's People

Contributors

serenos avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

nuinsseg's Issues

missing nuinsseg.json from v1.0-trainval

Hi, thanks alot for providing the instance GT. However, it seems as the GT file for the v1.0 is the same as for the v1.0-mini. Could you perhaps provide that annotation file, too? Many thanks in advance!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.