Code Monkey home page Code Monkey logo

Comments (9)

ahkarami avatar ahkarami commented on September 24, 2024 1

@gss-ucas Thank you for your response. However, on Table 2 of paper mentioned that "All models are based on Imagenet pre-trained Inception-v1 ...". But I want to know the accuracy of training the 3D Inception V1 Model without any trick from scratch on UCF101 Data Set (It is important to know for validating the effect of Inflating Technique).
In addition, the concept of Inflating has some ambiguity for me. In fact, I think it is vitally important to show the effect of Inflating Technique on 3D CNN models, and as this view point, I think the paper has some shortcomings.

from kinetics-i3d.

shuangshuangguo avatar shuangshuangguo commented on September 24, 2024

@ahkarami Hi, I think you can get answer on table.2 in paper

from kinetics-i3d.

shuangshuangguo avatar shuangshuangguo commented on September 24, 2024

@ahkarami The author may not train from scratch on UCF101. As far as I am concerned, the intension of this architerture is leveraging successful ImageNet architecture designs and even their parameters, so we don't need to bother whether it's good when train from scratch, just use the existing Imagenet classification model. (of course, this is my humble opinion, please tell me if I'm wrong.

from kinetics-i3d.

ahkarami avatar ahkarami commented on September 24, 2024

Dear @gss-ucas,
Thank you very much for your time and helpful opinion. However, I think it was better that the authors report the accuracy of the 3D Inception V1 Model (without Inflating) on UCF101 Dataset. If they reported the mentioned accuracy, then one can easily figure out the effect of Inflating Technique. In addition, I think we can consider the Inflating technique just as a kind of weight initialization for 3D CNN models, and I doubt that this technique change the result of models significantly (maybe just accelerate the convergence rate, and maybe improve the accuracy of model a little). Moreover, the authors didn't release the code of Inflating Technique. It is worth noting that because the Kinetics data set is very large-scale, so if we train a 3D CNN model on it and then fine-tune it on a small-scale data set (e.g., UCF101) probably results a good accuracy (without using any trick such as inflating). But this method is completely differ from the Inflating Technique, and I think in the paper, these mentioned methods confused with each other.

from kinetics-i3d.

baiyancheng20 avatar baiyancheng20 commented on September 24, 2024

@gss-ucas @ahkarami Have you tried to inflate imagnet pretrained models into 3D models on CAFFE?

from kinetics-i3d.

ahkarami avatar ahkarami commented on September 24, 2024

Dear @baiyancheng20, No I haven't try it on Caffe.

from kinetics-i3d.

panna19951227 avatar panna19951227 commented on September 24, 2024

have you implemented this paper on UCF101?Can you share about it.Thank you! I am confused when I am faced with fine-tuning the pre-trained model on UCF-101.

from kinetics-i3d.

smittal6 avatar smittal6 commented on September 24, 2024

@panna19951227 Any update on your query?

from kinetics-i3d.

joaoluiscarreira avatar joaoluiscarreira commented on September 24, 2024

Hi,

the numbers for two-stream I3D from scratch on UCF101 are 88.8%, vs 93.4% when starting from ImageNet. Check table 4 in the most recent version of the arxiv paper.

from kinetics-i3d.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.