uuembodiedsocialai / facediffuser Goto Github PK

View Code? Open in Web Editor NEW

123.0 123.0 17.0 148.69 MB

License: Other

Shell 0.11% Python 87.46% MATLAB 0.13% HTML 1.03% CSS 0.23% JavaScript 11.04%

facediffuser's People

Contributors

Stargazers

Watchers

Forkers

stefan-st tuskaw liaorongfan sihangchen97 sogoojoy xuguozhi clabra hillday blueskyscorpio jackzhousz t-sekai wangryhen agileedulabs phoenixdigitalfx pangjiea zhongshijun henryham

facediffuser's Issues

Pretrained weights

Hi @uuembodiedsocialai ,
thanks a lot for sharing this insightful work !
I wanted to ask if you have plans to release the pretrained weights for the models (particularly the one trained with vocaset). That would be very helpful !

Thanks a lot :)

Lip Sync on vocaset

Thanks for your contribution! Will you give the code for calculate the lip error on vocaset?

can't find templates/face_template.obj

when I ran "python predict.py --dataset vocaset ..." .
it reported an error "can't find templates/face_template.obj".
where can I download this .obj file?

ZeroDivisionError: division by zero

print('Diversity: {:.4e}'.format(diversity / num_seq))
ZeroDivisionError: division by zero
why num_seq=0?

Details about training and prediction on the BEAT dataset

Could you please provide some specific code for training and prediction on the BEAT dataset? It seems that this part is not included in the current code.

I tried to modify the code you provided to reproduce the results on the BEAT data set, but the test results were far from those reported in the paper.

It would be appreciated that if you could provide the relevant code and model training weights for the BEAT dataset.

What is listener_path

The is a line

listener_path = os.path.join(args.data_path, args.dataset, args.listener_path)

which gives an error because there is no args.listener_path

What is it meant to be? There is nothing in the data that could fit.

BIWI training sequence indices

Thanks for your interesting work! I want to ask whether the model on BIWI is trained with only e sequences or not-e sequences or both.

In the paper it says only use the emotional sequences, but in the data_loader the training splits is range(1, 33), which IMO is the non-emotional sequences?

About the .flv file in BIWI dataset

Thanks again for the code.

I am one more question about pre-processing the BIWI dataset.
I have requested and obtained the download link of BIWI, which looks like this

It seems like no .flv files are included in the download link, which should be included as suggested in the download page:

'videos' contains videos (.flv) of the rendered 3D geometries and original audio (sampling rate: 44.1kHz).

I wonder if i missed something and how the authors handle this problem.
Thanks again if you could kindly reply.

BIWI dataset preprocessing

Sorry to disturb.
I see the BIWI dataprepocessing requires only to download the [faces0x.tgz] files, which is different from other preprocessing ways such as CodeTalker that requires other files like [scans0x.tgz].

I wonder if the provided preprocessing code outputs the exactly same precessed files as CodeTalker.

Thanks again for the good work.

How to define the mask of mouth and upper face?

mouth_mask = list(range(94, 114)) + list(range(146, 178)) + list(range(183, 192))
upper_mask = [x for x in range(192) if x not in mouth_mask]
In the evalution of the code, I looked for how to calculate the LVE and FDD because the facial vector as a whole, how to know which positional information values represent the lip and the other region.
So how to define the mask?
Thanks.

I guess this should be train+val+test, otherview, I got empty train dataset

FaceDiffuser/data_loader.py

Line 62 in b1f386f

all_subjects = args.test_subjects.split() + args.val_subjects.split() + args.test_subjects.split()

wrong key building for one hot generation

FaceDiffuser/data_loader.py

Lines 31 to 33 in b1f386f

    
           if self.data_type == "train": 
        
               subject = file_name.split("_")[0] 
        
               one_hot = self.one_hot_labels[self.subjects_dict["train"].index(subject)]

Prediction on BEAT

Can you please mention how exactly prediction is done on beat dataset? Please share the command to predict on beat as well.

Also the beat model is not available. Can you please upload the trained model on beat?

Thanks.

	if self.data_type == "train":
	subject = file_name.split("_")[0]
	one_hot = self.one_hot_labels[self.subjects_dict["train"].index(subject)]