Comments (9)
@gss-ucas Thank you for your response. However, on Table 2 of paper mentioned that "All models are based on Imagenet pre-trained Inception-v1 ...". But I want to know the accuracy of training the 3D Inception V1 Model without any trick from scratch on UCF101 Data Set (It is important to know for validating the effect of Inflating Technique).
In addition, the concept of Inflating has some ambiguity for me. In fact, I think it is vitally important to show the effect of Inflating Technique on 3D CNN models, and as this view point, I think the paper has some shortcomings.
from kinetics-i3d.
@ahkarami Hi, I think you can get answer on table.2 in paper
from kinetics-i3d.
@ahkarami The author may not train from scratch on UCF101. As far as I am concerned, the intension of this architerture is leveraging successful ImageNet architecture designs and even their parameters, so we don't need to bother whether it's good when train from scratch, just use the existing Imagenet classification model. (of course, this is my humble opinion, please tell me if I'm wrong.
from kinetics-i3d.
Dear @gss-ucas,
Thank you very much for your time and helpful opinion. However, I think it was better that the authors report the accuracy of the 3D Inception V1 Model (without Inflating) on UCF101 Dataset. If they reported the mentioned accuracy, then one can easily figure out the effect of Inflating Technique. In addition, I think we can consider the Inflating technique just as a kind of weight initialization for 3D CNN models, and I doubt that this technique change the result of models significantly (maybe just accelerate the convergence rate, and maybe improve the accuracy of model a little). Moreover, the authors didn't release the code of Inflating Technique. It is worth noting that because the Kinetics data set is very large-scale, so if we train a 3D CNN model on it and then fine-tune it on a small-scale data set (e.g., UCF101) probably results a good accuracy (without using any trick such as inflating). But this method is completely differ from the Inflating Technique, and I think in the paper, these mentioned methods confused with each other.
from kinetics-i3d.
@gss-ucas @ahkarami Have you tried to inflate imagnet pretrained models into 3D models on CAFFE?
from kinetics-i3d.
Dear @baiyancheng20, No I haven't try it on Caffe.
from kinetics-i3d.
have you implemented this paper on UCF101?Can you share about it.Thank you! I am confused when I am faced with fine-tuning the pre-trained model on UCF-101.
from kinetics-i3d.
@panna19951227 Any update on your query?
from kinetics-i3d.
Hi,
the numbers for two-stream I3D from scratch on UCF101 are 88.8%, vs 93.4% when starting from ImageNet. Check table 4 in the most recent version of the arxiv paper.
from kinetics-i3d.
Related Issues (20)
- Optical flow rescaling HOT 5
- How to create a custom action recognition model HOT 2
- Inflating pre-trained models HOT 1
- Trying to create frozen graph, so i can convert it into tflite for android
- Training with different architectures
- customize actions class in the model HOT 1
- offline usage
- rgb.npy and flow.npy HOT 3
- training from scratch HOT 2
- which one is the checkpoint? HOT 5
- The problem of receptive field in I3D paper
- missing videos
- Does the video need to be cropped? HOT 11
- Calculation of TV L1 flow HOT 1
- dependencies issues HOT 1
- Struggling to learn using Opt. Flow HOT 2
- Incompatibility issues
- Run time of I3D on edge decives
- Is there Model File(.pth or .pt) that pretrained with Imagenet+Kinetics?
- I found the pth file which pretrained on Kinetics400 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kinetics-i3d.