Code Monkey home page Code Monkey logo

Comments (13)

sunshineatnoon avatar sunshineatnoon commented on May 13, 2024

Same question here.

from mxnet.

antinucleon avatar antinucleon commented on May 13, 2024

I will add feature to write handcraft kernel directly in op, and we always welcome you to send PR to contribute to MXNet

from mxnet.

sunshineatnoon avatar sunshineatnoon commented on May 13, 2024

@antinucleon Thanks!

from mxnet.

futurely avatar futurely commented on May 13, 2024

Fast and Faster R-CNN's changes to Caffe are all in this commit and the object detection application is here.

Fast and Faster R-CNN change set
 - smooth l1 loss
 - roi pooling
 - expose phase in pycaffe
 - dropout scaling at test time (needed for MSRA-trained ZF network)

from mxnet.

kaishijeng avatar kaishijeng commented on May 13, 2024

Any chance to add layers which are required by faster-rcnn in near future?

from mxnet.

loweew avatar loweew commented on May 13, 2024

are there any plans for a faster-rcnn type example in the image-classification section? This would be greatly appreciated, if possible.

from mxnet.

ijkguo avatar ijkguo commented on May 13, 2024

New operator mxnet.symbol.ROIPooling is slightly tricky. Here are some key information that are missing from documentation.

import mxnet as mx
data = mx.sym.Variable('data')
# [batch_size, channel, height, width]
rois = mx.sym.Variable('rois')
# [roi_number, 5]
# last dimension is [batch index of image, x1, y1, x2, y2]
# some convolutional layer
roi_pool = mx.sym.ROIPooling(data=data, rois=rois, pooled_size=(6, 6), spatial_scale=0.0625)
# please note that batch_size changes from batch_size to roi_number after ROI pooling.

from mxnet.

jonbakerfish avatar jonbakerfish commented on May 13, 2024

@precedenceguo How to train network with mxnet.symbol.ROIPooling? Any example?

from mxnet.

jonbakerfish avatar jonbakerfish commented on May 13, 2024

I noticed that the executor_manager.DataParallelExecutorGroup uses the same slices for both input data and labels. But in the case of fast-rcnn, during training, the input images are in shape (2,3,H,W) while the labels' are (128,) for the 128 ROIS. How can we change the code for training?

from mxnet.

ijkguo avatar ijkguo commented on May 13, 2024

Multiple devices training split data into slices for devices. In this example, each data batch has image shape (2, 3, H, W) and label shape (128, ). Splitting data batches does not compromise each batch since loader supply data and label together in each batch. Therefore no change is necessary except the number of training devices. For the training of Fast R-CNN, it is recommended to see the original caffe version for now.

from mxnet.

jonbakerfish avatar jonbakerfish commented on May 13, 2024

I'm not using multiple devices for training. I use the python api model.fit which actually calls _train_multi_device. The executor_manager inside _train_multi_device uses the same slices for both input data and labels. It works for the case where the batch sizes are the same for both input data and label. I'm looking for advice on how to change the code accordingly when input data and label have different batch sizes.

from mxnet.

ijkguo avatar ijkguo commented on May 13, 2024

There could exist some issue about varying batch size with the FeedForward API. Is there any error message or anomaly?

from mxnet.

tqchen avatar tqchen commented on May 13, 2024

c.f. https://github.com/dmlc/mxnet/tree/master/example/rcnn

from mxnet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.