Code Monkey home page Code Monkey logo

facediffuser's People

Contributors

annonymoustomato avatar galib360 avatar stefan-st avatar uuembodiedsocialai avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

facediffuser's Issues

Pretrained weights

Hi @uuembodiedsocialai ,
thanks a lot for sharing this insightful work !
I wanted to ask if you have plans to release the pretrained weights for the models (particularly the one trained with vocaset). That would be very helpful !

Thanks a lot :)

Lip Sync on vocaset

Thanks for your contribution! Will you give the code for calculate the lip error on vocaset?
22

can't find templates/face_template.obj

when I ran "python predict.py --dataset vocaset ..." .
it reported an error "can't find templates/face_template.obj".
where can I download this .obj file?

Details about training and prediction on the BEAT dataset

Could you please provide some specific code for training and prediction on the BEAT dataset? It seems that this part is not included in the current code.

I tried to modify the code you provided to reproduce the results on the BEAT data set, but the test results were far from those reported in the paper.

It would be appreciated that if you could provide the relevant code and model training weights for the BEAT dataset.

e7f1bee0c80cb2c50fc8496436589dd

What is listener_path

The is a line

listener_path = os.path.join(args.data_path, args.dataset, args.listener_path)

which gives an error because there is no args.listener_path

What is it meant to be? There is nothing in the data that could fit.

BIWI training sequence indices

Thanks for your interesting work! I want to ask whether the model on BIWI is trained with only e sequences or not-e sequences or both.

In the paper it says only use the emotional sequences, but in the data_loader the training splits is range(1, 33), which IMO is the non-emotional sequences?

About the .flv file in BIWI dataset

Thanks again for the code.

I am one more question about pre-processing the BIWI dataset.
I have requested and obtained the download link of BIWI, which looks like this
スクリーンショット 2023-11-09 155408

It seems like no .flv files are included in the download link, which should be included as suggested in the download page:

  • 'videos' contains videos (.flv) of the rendered 3D geometries and original audio (sampling rate: 44.1kHz).

I wonder if i missed something and how the authors handle this problem.
Thanks again if you could kindly reply.

BIWI dataset preprocessing

Sorry to disturb.
I see the BIWI dataprepocessing requires only to download the [faces0x.tgz] files, which is different from other preprocessing ways such as CodeTalker that requires other files like [scans0x.tgz].

I wonder if the provided preprocessing code outputs the exactly same precessed files as CodeTalker.

Thanks again for the good work.

How to define the mask of mouth and upper face?

mouth_mask = list(range(94, 114)) + list(range(146, 178)) + list(range(183, 192))
upper_mask = [x for x in range(192) if x not in mouth_mask]
In the evalution of the code, I looked for how to calculate the LVE and FDD because the facial vector as a whole, how to know which positional information values represent the lip and the other region.
So how to define the mask?
Thanks.

license?

Thanks so much for the great work!
Could you please let us know the license of this work?

About the Multiface dataset

Thanks for the great work and code!
I have one question prior to downloading the Multiface dataset.
Does the training involves using the whole Multiface dataset (which is quite large), or just downloading the mini version suffices?

Questions about training on VOCASET

Overfitting happens when the model is trained on VOCASET. The training loss is descending while the validation loss is rising step by step when training.
So is VOCASET too small? Dose anyone meets the same problem?

Help! mistakes in dataloader

  1. I guess this should be train+val+test, otherview, I got empty train dataset

    all_subjects = args.test_subjects.split() + args.val_subjects.split() + args.test_subjects.split()

  2. wrong key building for one hot generation

    if self.data_type == "train":
    subject = file_name.split("_")[0]
    one_hot = self.one_hot_labels[self.subjects_dict["train"].index(subject)]

75c897106ad5b424dae1b0ad8714d16

Prediction on BEAT

Hi

Can you please mention how exactly prediction is done on beat dataset? Please share the command to predict on beat as well.

Also the beat model is not available. Can you please upload the trained model on beat?

Thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.