Code Monkey home page Code Monkey logo

Comments (5)

wy1iu avatar wy1iu commented on June 2, 2024

Hi, thanks for your interest in our work.

"If the matrix M is block-diagonal, the result of equation 2 is equivalent to summing the output of the each channel."
-- When the matrix M is block-diagonal, you can view the equation 2 as multiple inner products in each channel, and then sum them up. In general, your understanding is right. In this case, it is equivalent to summing up the output of each channel, but the computation in each channel is like a small bilinear inner prodcut.

"This operator can be replaced by averaging pooling or something else."
-- If you can have the ouput from each channel (which is a small bilinear inner product like the form "WMX"), of course you can then use something like an average pooling to sum them up (or aggregate them). But this will restrict to the case that M is a block-diagonal matrix. The bilinear form could be more general than that.

from nsl.

peter943 avatar peter943 commented on June 2, 2024

@wy1iu Thanks a lot. I was wondering whether it is possible that I could ask you a more question.

In your paper, you say: In order to further stablize the training, we learn the residual of an identity similarity matrix instead of directly learning the entire similarity matrix.

I am not quite sure how to achieve it. Maybe I should initialize a random weight matrix W and identity matrix I and add them together which is similar to ResNet. Just change WX to (W+I)X and the rest of module stays the same. I do not know whether this understand is right?

I'm using the zero matrix now to initialize W (W+I is an identity matrix corresponding to the traditional Conv), I do not know if that's appropriate. Thanks again for your kind help.

from nsl.

wy1iu avatar wy1iu commented on June 2, 2024

Your understanding is basically right. But there are some details that you may need to pay attention to. To help you better implement the method, we will release our implementation in 1-2 days. Thanks

from nsl.

lzzcd001 avatar lzzcd001 commented on June 2, 2024

We have released the code for image recognition. Hope that clarifies things for you.

from nsl.

peter943 avatar peter943 commented on June 2, 2024

@lzzcd001 @wy1iu Thank you very much. It helps me a lot.

from nsl.

Related Issues (2)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.