The evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data from maodong2056

maodong2056 / evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data Goto Github PK

View Code? Open in Web Editor NEW

This project uses RGB and Depth images as input into two different convolutional network of same architecture (namely VGGNet, RESNet, AlexNet) and fuses them

License: MIT License

Python 100.00%

evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data's Introduction

Evaluating Fusion points for Multi-Stream Networks handling cross modal data

Object detection using RGB-D images has become a trending topic these days due to its numerous applications. This project uses RGB and Depth images as input into two different convolutional network of same architecture (namely VGGNet, RESNet, AlexNet) and fuses them in deeper layers to improves the class prediction performance with respect to various metrics like run-time, number of parameters and accuracy. Our approach compares the different possible fusion points in a network to come up with the best tradeoff between complexity and prediction.

Dataset:

We experimented with NYUD V2 dataset which is a collection of video sequences from various indoor scenes recorded by both RGB and depth cameras from Microsoft Kinect. To balance the number of examples in each class, the images are shifted in space or inverted vertically and/or horizontally adding some noise to generate new images for each class.

Pre-Requisites

Python 3.0 or higher
Tensorflow (Runs better on Tensorflow-gpu)
Opencv
Numpy
Matplotlib

Installing

Download or git clone the repoaitory to local machine. Change the input directory location in the fused_classifier.py file

Running The tests

To execute a fusion point test, Change to corresponding function name at line 208 in the fusedClassifier.py file

Alexnet

Fuse Points ==> Name of the function
- Fusion at 2 ==> alexnet_fused2
- Fusion at 3 ==> alexnet_fused3
- Fusion at 4 ==> alexnet_fused4
- Fusion at 5 ==> alexnet_fused5
- Fusion at 6 ==> alexnet_fused6

VGGnet

Fuse points ==> Name of the function
- Fusion at 2 ==> vggnet_fused2
- Fusion at 3 ==> vggnet_fused3
- Fusion at 4 ==> vggnet_fused4
- Fusion at 5 ==> vggnet_fused5
- Fusion at 6 ==> vggnet_fused6

Resnet

Fuse points ==> Name of the function
- Fusion at 2 ==> resnet_fused2
- Fusion at 3 ==> resnet_fused3
- Fusion at 4 ==> resnet_fused4
- Fusion at 5 ==> resnet_fused5
- Fusion at 6 ==> resnet_fused6

Contributing Authors

Kausic Gunasekkar - Profile

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Recommend Projects

maodong2056 / evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data Goto Github PK

evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data's Introduction

Evaluating Fusion points for Multi-Stream Networks handling cross modal data

Dataset:

Pre-Requisites

Installing

Running The tests

Contributing Authors

License

evaluating-fusion-points-for-multi-stream-networks-handling-cross-modal-data's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent