Code Monkey home page Code Monkey logo

persformer_3dlane's Introduction

PersFormer

pipeline

PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark
Li Chen∗†, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan.

equal contributions.
corresponding authors
arXiV 2203.11089

Introduction

This repository is the PyTorch implementation for PersFormer.

PersFormer is an end-to-end monocular 3D lane detector with a novel Transformer-based spatial feature transformation module. Our model generates BEV features by attending to related front-view local regions with camera parameters as a reference. It adopts a unified 2D/3D anchor design and an auxiliary task to detect 2D/3D lanes simultaneously, enhancing the feature consistency and sharing the benefits of multi-task learning.

Changelog

2022-5-9: We compared our method on ONCES_3DLanes Dataset, where PersFormer also outperforms other methods.
2022-4-12: We released the v1.0 code for PersFormer.

Get Started

Installation

  • To run PersFormer, make sure you are using a machine with at least one GPU.
  • Please follow INSTALL.md to setup the environment.

Dataset

  • Please refer to OpenLane for downloading OpenLane Dataset.
  • Please refer to Gen-LaneNet for downloading Apollo 3D Lane Synthetic Dataset.

Training and evaluation

  • Please follow TRAIN_VAL.md to train and evaluate the model.

Benchmark

  • 3D Lane Detection Results (F-Score) in OpenLane.
Method All Up &
Down
Curve Extreme
Weather
Night Intersection Merge&
Split
GenLaneNet 29.7 24.2 31.1 26.4 17.5 19.7 27.4
3DLaneNet 40.2 37.7 43.2 43.0 39.3 29.3 36.5
PersFormer 47.8 42.4 52.8 48.7 46.0 37.9 44.6
  • 2D Lane Detection Results (F-Score) in OpenLane. Note that the baseline of 2D branch in PersFormer is LaneATT.
Method All Up&
Down
Curve Extreme
Weather
Night Intersection Merge&
Split
LaneATT-S 28.3 25.3 25.8 32.0 27.6 14.0 24.3
LaneATT-M 31.0 28.3 27.4 34.7 30.2 17.0 26.5
PersFormer 42.0 40.7 46.3 43.7 36.1 28.9 41.2
CondLaneNet-S 52.3 55.3 57.5 45.8 46.6 48.4 45.5
CondLaneNet-M 55.0 58.5 59.4 49.2 48.6 50.7 47.8
CondLaneNet-L 59.1 62.1 62.9 54.7 51.0 55.7 52.3
Method F1(%) Precision(%) Recall(%) CD error(m)
3DLaneNet 44.73 61.46 35.16 0.127
GenLaneNet 45.59 63.95 35.42 0.121
SALAD (paper of ONCE 3DLanes ) 64.07 75.90 55.42 0.098
PersFormer 74.33 80.30 69.18 0.074

Visualization

Following are the visualization results of PersFormer on OpenLane dataset and Apollo dataset.

  • OpenLane visualization results
    openlane_vis
  • Apollo 3D Synthetic visualization results
    apollo_vis

Citation

Please use the following citation if you find our repo or our paper PersFormer useful:

@article{chen2022persformer,
  title={PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark},
  author={Chen, Li and Sima, Chonghao and Li, Yang and Zheng, Zehan and Xu, Jiajie and Geng, Xiangwei and Li, Hongyang and He, Conghui and Shi, Jianping and Qiao, Yu and Yan, Junchi},
  journal={arXiv preprint arXiv:2203.11089},
  year={2022}
}  

Acknowledgements

We would like to acknowledge the great support from SenseBee labelling team at SenseTime Research, and the fruitful discussions and comments for this project from Zhiqi Li, Yuenan Hou, Yu Liu. We thank for the code implementation from Gen-LaneNet, LaneATT and Deformable DETR.

License

All code within this repository is under Apache License 2.0.

persformer_3dlane's People

Contributors

chonghaosima avatar dyfcalid avatar hli2020 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.