Code Monkey home page Code Monkey logo

auto-birds-eye's Introduction

Bird's Eye view generation and Mapping through end-to-end deep learning.

Open In Colab

First of all, this project wouldn't had been possible without Maciek Dziubiński.

The repository works on building a model that takes in input the front camera image and generates the top-down view as well as simultaneous map generation (Can be thought of 2D SLAM).

Tesla's autopilot results:

drawing

Our results (In CARLA Simulator):

drawing

Training:

3 Models were trained:

  • U-Net
  • Autoencoder
  • Deeper Autoencoder

Model was trained using following losses:

  • SSIM
  • Dice Loss
  • Cross Entropy Loss

drawing

drawing

Front view images:

drawing

Predicted Bird's eye view:

drawing

Ground Truth:

drawing

Testing on real world data (Comma10k dataset):

The model was trained on a simulator, but I was able to obtain good results after finetuning the model and some further image processing. See how the car is correctly localized to the right and th road curve is correctly detected.

drawing

Map Generation (In Progress) :

This is based on image stitching using map coordinate metadata. I noticed that SIFT/SURF, etc based image feature extractors and matchers do not work in this case since the images produced are very symmetrical and sparse in colors, therefore, no good/unique features can be extracted from these images to perform stitching based on pixels.

Hence, we use location coordintes based stitching. Images are simply overlapped along with correct rotation.

drawing

Dataset:

Dataset can be found here.

drawing

Although, the dataset contains images from five camera:

  • Front
  • Top
  • Left
  • Right
  • Rear

Note: For training the model, only front images were used to predict the half top-down image.

Try this out on Colab:

Open In Colab

References:

https://mono.software/2018/03/14/Image-stitching/

https://medium.com/asap-report/from-semantic-segmentation-to-semantic-birds-eye-view-in-the-carla-simulator-1e636741af3f

auto-birds-eye's People

Contributors

mankaransingh avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.