Comments (3)
Hi, sorry for the late reply.
Can you talk about the inspiration for doing this, or explain it, I am confused about it
Sure. The main idea is to use regular convolution-based operations across three different "rotated" copies of the 3D input tensor. We hypothesized that this allows the network to capture cross-dimensional features in a way that regular 2D convolutions could not. For example, if the channel dimension represents time (such as in a video where frames are stacked and passed through a CNN), the network could learn features that correspond to an object moving up and down or side-to-side.
You can find more information about this idea of cross-dimensional interaction, the motivation for our method, and our analysis in our paper.
from triplet-attention.
Thank you very much for your full response!
What I mean is that this operation is actually a variant of spatial attention. So, have you considered doing the same on channel attention?
from triplet-attention.
What I mean is that this operation is actually a variant of spatial attention.
Yep, you're right. Triplet Attention applies a variant of CBAM's spatial attention across different permutations of the input.
So, have you considered doing the same on channel attention?
No, not really. But I suppose this would work as well, to a degree.
As we mention in the paper, one of the main motivations of Triplet Attention is efficiently learning how to extract "cross-dimentional" features. So by using spatial attention, each branch is able to learn features that span across two of the three dimensions.
If you used three channel attention branches, you wouldn't be able to learn those kinds of cross-dimensional feature extractors, since the canonical channel attention only operates across one dimension of a tensor.
from triplet-attention.
Related Issues (20)
- wandb error HOT 4
- No multiprocessing option HOT 5
- Pre-training weights loading HOT 5
- seems like the params are not used in `TripletAttention`'s definition? HOT 1
- The weights in google drive is demage!!! HOT 2
- The code for HeatMap visualization HOT 1
- load error HOT 3
- 怎么修改能一键替换SE 和CBAM HOT 3
- torch.cat( (torch.max(x, 1)[0].unsqueeze(1), torch.mean(x, 1).unsqueeze(1)), dim=1 ) HOT 1
- Hi,I wanna ask that how to calculate the AP value of. HOT 1
- error HOT 3
- Triplet Attention can be applied to EfficientNet in place SE? HOT 3
- How did you initialize TA weights? HOT 1
- A little question about adding attention to the network HOT 1
- I Have a question HOT 5
- Pretrain weights and experiment set on resnet20,32 HOT 3
- mmdetection_fasterrcnn HOT 3
- data loader issue in the training stage HOT 7
- 预训练模型加载 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from triplet-attention.