Comments (3)
Hi! Thank you for your interest. The data acquisition directly follows from this repository. The technical report for that codebase provides details on number of cameras, positioning of cameras, etc. For the body pose, we used a similar setup to collect the dataset.
For the raw audio, we fetched this from the dome. Unfortunately I'm not sure I know much more about the details about the actual audio capture. From that .mp4 style files, I just dumped them directly into .npy files. So no actual audio processing happened there.
Lastly, the data processing step requires the full multiview to do the reconstruction. To get a 3D avatar, we render from many sides and do reconstruction across different viewpoints.
Hope this helps!
from audio2photoreal.
Thank you very much for your response!
I still have something that needs to be clarified:
- The body poses and face expression extraction step needs the full multiview recorded video, right?
- When do you plan to release the video from the dataset and processing code?
from audio2photoreal.
Regarding 1. yep, both the extraction for face and body needs the full multiview recorded video, which unfortunately we will not be releasing at the moment, and 2. sadly that is still up in the air. At the moment, we are just releasing the photoreal renderings, and 3d pose estimates provided in this repo. If anything changes, I will update the repo to reflect this.
from audio2photoreal.
Related Issues (20)
- How can I manually rotate an avatar's head? HOT 2
- How to pass avatar renderer conditions HOT 1
- How to change the position of camera/model? HOT 1
- Training the model with different data format HOT 1
- The lips regressor predicts unexpected result HOT 5
- Switching from Recording to Uploading Audio in a Demo: Is it Possible? HOT 1
- Why the data is not as in the README ? HOT 2
- Models and pre-requisites models unavailable HOT 3
- Does it support languages other than English? HOT 1
- Models and pre-requisites models unavailable HOT 3
- What model was used to extract the body pose ? HOT 4
- Multiple GPUs DDP error HOT 5
- The evaluation code for lip reconstructions HOT 1
- Is it possible to run the demo in a laptop without GPU? HOT 3
- Training inference time and test data HOT 2
- How to train a new model from scratch HOT 1
- Visualize 2 avatars in the same scene, just like the introduction page HOT 1
- Replancement of fairseq HOT 1
- Video data HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from audio2photoreal.