Comments (5)
At present, there are only several public semantic segmentation datasets available, which focus only on certain applications. MS COCO [1] provides semantic segmentation dataset containing common objects recognition in common scenes, and its semantic labelling task focuses on person, car, animal and different stuffs. Pascal VOC dataset [2] also provides objects like bus, car, cow, dog for semantic segmentation task. Other semantic segmentation datasets are designed for street scene objects recognition. Their target objects include pedestrians, cars, roads, lanes, traffic lights, trees and other street scene related objects. Specially, CamVid [3] provides continuously labelled driving frames, which can be used for temporal consistency evaluation. Highway Driving dataset [4] provides 30Hz labels that are even denser in temporal domain, and it is designed for semantic video segmentation for driving scenes. Daimler Urban Segmentation dataset [5] is also a video dataset for street scene understanding, but its labels are sparser in temporal domain. Cityscapes dataset [6] focuses more on data variation as it is much larger in the number of labelled frames, which are collected from 50 cities, making it closer to real world complexity. Each frame is much larger in size compared with CamVid. The newly published Berkeley Deep Drive dataset [7] has even more image labels with medium image size across multiple street scenes. The KITTI Vision Benchmark Suite [8] also provides images of medium size for the task. To help learning models to generalize well across different scenes, ADE20K dataset [9] contributes as it spans more diverse scenes, and objects from much more different categories are labelled. ADE20K dataset brings more variability and complexity for general object representations in images. For remote sensing community, aerial image dataset is provided for ISPRS 2D semantic labelling contest [10]. All datasets above have had great impacts on the development of current state-of-the-art semantic segmentation methods.
目前,只有几种公共语义分割数据集可用,它们只关注某些应用。 MS COCO [1]提供了在常见场景中包含共同对象识别的语义分割数据集,其语义标注任务侧重于人,车,动物和不同的东西。 Pascal VOC数据集[2]还提供了用于语义分割任务的公共汽车,汽车,牛,狗等对象。其他语义分割数据集被设计用于街道场景对象识别。他们的目标对象包括行人,汽车,道路,车道,交通灯,树木和其他街道场景相关的物体。特别地,CamVid [3]提供连续标记的驾驶图像,可用于时间一致性评估。高速公路驾驶数据集[4]提供30Hz标签,在时域中更加密集,它被设计用于驾驶场景的语义视频分割。戴姆勒城市分割数据集[5]也是用于街道场景理解的视频数据集,但其标签在时域中较为稀疏。城市景观数据集[6]更侧重于数据变化,因为标记帧的数量要大得多,这些帧从50个城市收集,使其更接近现实世界的复杂性。与CamVid相比,每个框架的尺寸都要大得多。新发布的Berkeley Deep Drive数据集[7]在多个街景中拥有更多具有中等图像尺寸的图像标签。 KITTI Vision Benchmark Suite [8]还提供适合该任务的中等大小的图像。为了帮助学习模型在不同场景中得到很好的推广,ADE20K数据集[9]有助于跨越更多样化的场景,并标记来自更多不同类别的对象。 ADE20K数据集为图像中的一般对象表示带来了更多的可变性和复杂性。对于遥感社区,为ISPRS 2D语义标记竞赛[10]提供了航空图像数据集。上述所有数据集都对当前最先进的语义分割方法的发展产生了很大的影响。
As present, most of the modern visual semantic segmentation tasks use information acquired on the ground. However, another data acquisition platform is more and more utilized, which is the unmanned aerial vehicle(UAV). Compact and light weighted UAVs are a trend for future data acquisition. The UAVs make image retrieval in large area cheaper and more convenient, which allows quick access to useful information around certain area. Distinguished from collecting images by satellites, UAVs capture images from the sky with flexible flying schedule and higher resolution, bringing the possibility to monitor and analyze landscape at specific location and time swiftly. These abilities make UAVs an effective data collection means for various applications.
目前大部分视觉语义分割数据集从地面进行采集。UAV作为数据采集平台,具有cheaper和more convenient的优点,同时具有更高的分辨率。
from semseg.
数据集示例
标注的类别示例
标注类别的像素直方图
数据集分割
from semseg.
网络架构
from semseg.
实验结果
from semseg.
总结与展望
from semseg.
Related Issues (20)
- DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs HOT 3
- 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation
- ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
- Improving Semantic Segmentation via Video Propagation and Label Relaxation
- Autofocus Layer for Semantic Segmentation
- Path Aggregation Network for Instance Segmentation
- Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds
- CFUN: Combining Faster R-CNN and U-net Network for Efficient Whole Heart Segmentation
- Tree-structured Kronecker Convolutional Networks for Semantic Segmentation
- Scale-adaptive Convolutions for Scene Parsing
- Adaptive Affinity Fields for Semantic Segmentation
- BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation HOT 4
- Learning Deconvolution Network for Semantic Segmentation HOT 3
- Bridging Category-level and Instance-level Semantic Image Segmentation
- Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes HOT 4
- Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network HOT 4
- Context Encoding for Semantic Segmentation HOT 3
- gcn HOT 1
- 请问这个项目可以预测视频分割吗,里面有代码吗 HOT 3
- 我用camvid数据集来训练drn时,显示分类权重错误
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from semseg.