Comments (5)
Try to set face erode to something like 40 ,mask blur to 30 and use gfpgan, Let me know
from sd-wav2lip-uhq.
Try to set face erode to something like 40 ,mask blur to 30 and use gfpgan, Let me know
just applied your suggested parameters, and the result is kind of off, here is the result video:
where the teeth seem crooked and over-crowded. not sure if it is because the mask isnt covering the teeth in the original video
https://github.com/numz/sd-wav2lip-uhq/assets/76946041/0a87d151-20e6-4ace-a68d-1bb20b7da032
from sd-wav2lip-uhq.
active debug and look into debug folder to check mask composition and see if it missed something or if erode or mask blur is not good enough
from sd-wav2lip-uhq.
active debug and look into debug folder to check mask composition and see if it missed something or if erode or mask blur is not good enough
the upper one is the "restored face video" and the bottom one is the "generated video". the "restored face video" has overall better looking eyes but only problem is the rectangular shape which appears around the mouth and it cuts off the tip of the chin a bit; however, the "generated video" looks closer to the original video, but the dark batches and shadows around the mouth makes it unpleasant and unnatural. i would prefer the "restored face video" without the rectangular shape.
here are the "restored face video" and the "generated video" for a clearer view of what I meant:
tsts1.mp4
tsts2.mp4
here is the setting im using now:
from sd-wav2lip-uhq.
You can try "only mouth" option, mouton mask dilate something like 30 and "mask blur" between 30 and 60.
Let me know
from sd-wav2lip-uhq.
Related Issues (20)
- RuntimeError: Detected that PyTorch and TorchAudio were compiled with different CUDA versions. PyTorch has CUDA version 11.8 whereas TorchAudio has CUDA version 11.7. HOT 1
- ImportError: numpy.core.multiarray failed to import
- ValueError: max() arg is an empty sequence HOT 3
- What is the required CUDA version to run this repo reliably? HOT 1
- What does this do?
- 现在可以命令行运行吗?
- No module named 'scripts.wav2lip_uhq_extend_paths' HOT 1
- this extension killed my A1111 i have to install it again (maybe) HOT 2
- Torch not compiled with CUDA enabled HOT 3
- Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check HOT 5
- TypeError: unsupported operand type(s) for -: 'NoneType' and 'int'
- Automatic1111 Ext not work HOT 3
- The mouth does not match on the X-axis. How can I fix this?
- 是缺少模型吗?下载链接有吗
- hi
- Who needs high-quality lip sync - contact me!
- Stuck in faceswap
- "Usage: Choose a video..." - but were is it at all ?
- Pay for stand alone version
- upload timeout,file size:27.6M
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sd-wav2lip-uhq.