Code Monkey home page Code Monkey logo

Comments (5)

ailingzengzzz avatar ailingzengzzz commented on August 25, 2024

Hi @mangguoxia,

Similar to other sliding-window-based filters, SmoothNet is also a near-online method. To modify it to an online inference, you can first take the first T frame as the latency, and you will get the first T smoothed poses. After that, you can input the following frames one by one (input the [1, 2, ..., T, T+1] poses), and obtain the smoothed (T+1)_th pose from the last frame from the output of SmoothNet.

Hope it will be helpful!

from smoothnet.

mangguoxia avatar mangguoxia commented on August 25, 2024

Hi @mangguoxia,

Similar to other sliding-window-based filters, SmoothNet is also a near-online method. To modify it to an online inference, you can first take the first T frame as the latency, and you will get the first T smoothed poses. After that, you can input the following frames one by one (input the [1, 2, ..., T, T+1] poses), and obtain the smoothed (T+1)_th pose from the last frame from the output of SmoothNet.

Hope it will be helpful!

栓Q so much !

Actually, I used a queue of size T. I copied the first frame T times to fullfill the queue, then put the queue into smoothnet, and obtain the smoothed (T)_th pose from the last frame output by smoothnet.
After that , I delete the first frame in the queue and add a new frame from the video into queue.

But the result is very poor, even I trained the model using my own dataset. (I also tested using pre-train model you provided, the result is similiar.)

I want to know whether there is a problem with my inference method or whether my task (facial landmark detection) is not suitable for smoothnet ?
If input the [1, 2, ..., T, T+1] poses as you said, the length of input will increase gradually , is this correct ?

from smoothnet.

ailingzengzzz avatar ailingzengzzz commented on August 25, 2024

Hi @mangguoxia ,

We have tested SmoothNet on the whole body 3d poses (including face, hand, and body detection), and the results look good.
If input the [1, 2, ..., T, T+1] poses, we always keep the fixed length T as a sliding window.
I'm concerned about the normalization of your data since we need to normalize the data into [-1, 1].

from smoothnet.

mangguoxia avatar mangguoxia commented on August 25, 2024

Thanks ! I will try to normalize the data into [-0.5, 0.5] (right? I checked the training code for dataloader)

BTW, which pretrained model is suggested for the facial landmark detection ? And the best windowsize ?

from smoothnet.

ailingzengzzz avatar ailingzengzzz commented on August 25, 2024

The performance depends on the movement features and jitter degrees of your data. You can try all these models.
For the offline test, the longer the sliding window length, the smoother the result.

from smoothnet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.