Code Monkey home page Code Monkey logo

object-detection-rcnn's Introduction

Object-Detection-RCNN Project

This project using RCNN (Region-CNN) for object detection.

The dataset is from CrowdAI, it contains 9,421 images with 72,064 entries of labels in three classes (car, truck, and pedestrain).

Project Setup

The pre-trained model is faster_rcnn_resnet101_coco available on TensorFlow website.

1. Set up the environment

  • Install Ubuntu 18.04
  • Install Nvidia driver 390 and Nvidia Docker 2
  • Pull the tensorflow:1.12.0-gpu-py3 image and run it using runtime of nvidia
  • Get into the container
  • Install the TensorFlow Object Detection APIs and COCO APIs

2. Prepare the scripts needed

3. Prepare tfrecord file using the dataset

  • Download the dataset mentioned above and extract it using tar -xvzf
  • Download the corrected .csv file
  • Split the .csv file into train.csv and test.csv
  • Used generate_tfrecord.py to convert the csv files into tfrecord

4. Retrieve pre-trained model

  • Download the pre-trained model from TensorFlow website
  • Extract

5. Config pipeline

  • Modify the pipeline.config file in the model
  • Set the classnames, tfrecords, and other parameters

6. Training

  • Set environment parameters
  • Run model_main.py with parameters to train
  • Launch Tensorboard for monitoring

7. Inference on specific images

  • Use the helper code from object_detection_tutorial.ipynb
  • Set the path to test images and image parameters
  • Show image with inferred labels

Result Evaluation

The model took 60 hours to train. After 50,000 steps of training, the Mean Average Precision (mAP) reached 0.3693. The mAP for large objects reached 0.5546.

object-detection-rcnn's People

Contributors

freshmand avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.