Code Monkey home page Code Monkey logo

Comments (3)

YuanxunLu avatar YuanxunLu commented on July 21, 2024

If your data doesn't include head poses and upper body motions, e.g., MEAD, of course, predicting head pose is meaningless because actually there're no head poses exist in the dataset right (heads in all training frames are fixed)?

Meanwhile, if your dataset doesn't include shoulder motions (or on a very small scale), removing upper body motion features may work. But it also depends on how you cut and crop the face region, e.g., if all the training frames keep the shoulders almost in the same location, I think upper body motion is not needed because there is no ambiguity about the bottom part of the images, i.e., the shoulder keeps fixed. After all, the shoulder line is designed to remove the ambiguity in the training set (One could move his shoulder while keeping his head fixed).

Also, if the background of your dataset is something like a green screen and your camera is fixed, i.e., camera parameters are fixed for all training frames, I think the candidate image set could also be removed.

Finally, I think every design should be considered and checked in the experiments, right?

from livespeechportraits.

forest520 avatar forest520 commented on July 21, 2024

谢谢Dr. Lu详细的解答!
再请教训练过程中,每个形象的3分钟视频,是使用的4个固定的candidate image么?还是根据时间窗口,4个candidate image会相应变动?

from livespeechportraits.

YuanxunLu avatar YuanxunLu commented on July 21, 2024

The number of the candidate images set of each video is not fixed. As mentioned above, if your data doesn't include changing camera parameters -- the influence of candidate images set is not obvious. For example, if one video contains 3 different camera parameters, I will select 3 different candidate image set during the training. During testing, you should keep the candidate images fixed.

from livespeechportraits.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.