Comments (5)
@alexanderrichard
Could you clarify a bit the intermediate representations of each model. I think the final visulization is done with the avatar renderer from the Driving-Signal Aware Full-Body Avatars paper, which I think isn't directly portable to any other renderer that relies on a combination of mesh, skeleton bones and blend shapes. Whether SMPL, or any other output, it would be helpful to know what are the intermediate representations from each network. If they are "proprietary" latent values of an avatar renderer, it will be way more challenging to add any cross platform functionality.
I know I could probably dig into the code and paper a bit more and get the answer, but you can potentially save me a few hours, with a direct answer.
and of course congratulations on the paper and results!
from audio2photoreal.
Sure. The body decoder consumes body pose (as joint angles) and face expression codes (following https://arxiv.org/abs/1808.00362) as input, and is conditioned on a view direction from which the avatar can then be rendered.
There is no compatibility with blendshape models if you want to render textured avatars. For that, you would need to solve a retargeting problem, which will not be perfect and deteriorate quality. With this in mind, there are two (lengthy, complicated) ways to make this approach run on other representations:
- Re-rigging/Re-targeting. Let's forget about the textures for a moment. The core problem to generalize to other body representations is the different rig. You'd have to build a mapping from the joint angles of our rig to the representation (typically also in joint angles, but usually a different number of joints) of your target rig. Probably our representation is not a superset of your target representation, so this might be difficult.
- Fit a SMPL model to "ground truth". You can of course use the provided renderer and render ground truth avatars from different view-points. You can then use the resulting multiview dataset to fit a SMPL model as if it was a "true" multiview capture.
None of this is trivial unless you already have a body tracking pipeline for (2) from other multiview/3D projects you worked on.
Sorry that there is no "quick and easy" solution to this.
from audio2photoreal.
Hi! We currently don't have plans to provide a mapping to SMPL but since both the joint angles and the body renderer for this project are released in this repository, please feel free to contribute a mapping to this repo :)
from audio2photoreal.
there are many existing body tracking pipelines
is there a trivial solution for 1 (?)
from audio2photoreal.
I tried it, but it didn't work very well.
Code:
Dump motion: https://github.com/see2023/audio2photoreal
Send to MetaHuman: https://github.com/see2023/Bert-VITS2-ext
Demo: https://www.bilibili.com/video/BV1dg4y1m7sX/
from audio2photoreal.
Related Issues (20)
- How to change the position of camera/model? HOT 1
- Training the model with different data format HOT 1
- The lips regressor predicts unexpected result HOT 5
- Switching from Recording to Uploading Audio in a Demo: Is it Possible? HOT 1
- Why the data is not as in the README ? HOT 2
- Models and pre-requisites models unavailable HOT 3
- Does it support languages other than English? HOT 1
- Models and pre-requisites models unavailable HOT 3
- What model was used to extract the body pose ? HOT 4
- Multiple GPUs DDP error HOT 5
- Data acquisition and processing HOT 3
- The evaluation code for lip reconstructions HOT 1
- Is it possible to run the demo in a laptop without GPU? HOT 3
- Training inference time and test data HOT 2
- How to train a new model from scratch HOT 1
- Visualize 2 avatars in the same scene, just like the introduction page HOT 1
- Replancement of fairseq HOT 1
- Video data HOT 1
- 1
- when i run python -m demo.demo, I test a audio,have a error? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from audio2photoreal.