Probabilistic Triangulation V2

Code of ICCV 2023 paper: Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation

Abstract: 3D human pose estimation has been a long-standing challenge in computer vision and graphics, where multi-view methods have significantly progressed but are limited by the tedious calibration processes. Existing multi-view methods are restricted to fixed camera pose and therefore lack generalization ability. This paper presents a novel Probabilistic Triangulation module that can be embedded in a calibrated 3D human pose estimation method, generalizing it to uncalibration scenes. The key idea is to use a probability distribution to model the camera pose and iteratively update the distribution from 2D features instead of using camera pose. Specifically, We maintain a camera pose distribution and then iteratively update this distribution by computing the posterior probability of the camera pose through Monte Carlo sampling. This way, the gradients can be directly back-propagated from the 3D pose estimation to the 2D heatmap, enabling end-to-end training. Extensive experiments on Human3.6M and CMU Panoptic demonstrate that our method outperforms other uncalibration methods and achieves comparable results with state-of-the-art calibration methods. Thus, our method achieves a trade-off between estimation accuracy and generalizability.

version update

Accelerated the model by replacing backbone with mobileone;
changed the sampling logic to speed up multi-view fusion;
Now the model can be reasoned in real time on iphone.

Getting started

1. Dataset

Download and preprocess the dataset by following the instructions in h36m-fetch and learnable triangulation.

The directory structure after completing all processing：

human3.6m
├── extra
│   ├── bboxes-Human36M-GT.npy
│   ├── human36m-multiview-labels-GTbboxes.npy
│   └── una-dinosauria-data
└── processed
    ├── S1
    ├── S11
    ├── S5
    ├── S6
    ├── S7
    ├── S8
    └── S9

2. Quick Start

Use conda to create an environment, or a newer version of pytorch：

conda env create -f environment.yaml

Perform inference on pretrained models:

python infer3d.py

The following results will be obtained, where x3d/l2 is mpjpe:

loss 4.177016958594322
loss/hm 2.6981436171952415
loss/x3d 5.655890337684575
x2d/l1 16.779179317109726
x2d/l2 13.22389983604936
x3d/l1 38.431529241449695
x3d/l2 26.103624186095068

Train the 3d estimator, which by default will use the pretrained model of the 2d backbone:

python train3d.py

Train the 2d backbone:

python train2d.py

3. Some training suggestions

While training, it was found that the estimation accuracy of the 2D pose very much affects the results, the mpjpe can be up to 26mm for 384x384 inputs, but only 34mm for 256x256 inputs.
When the model parameter count is small, Human3.6m has a single background that tends to overfit the model (probably because the model uses color as a key point feature). We added some color and brightness data augmentation during training to combat it. But this can't completely solve the field scene. Pre-training the model with a broader dataset would solve this problem.
Human3.6m has a lot of data duplicates, and spaced use can quickly validate the training results.
The voxel fusion multi-view approach leads to a rich physics prior, but there is a bottleneck in acceleration. In the new version of the code, we use orientation + sampled features as inputs, which can greatly speed up the speedup.
In training, the fusion part uses the generated data for pre-training and is fine-tuned in subsequent training, which can achieve better generalization.

Citation

If you find this project useful for your research, please consider citing:

@article{hu2023pose,
  title={Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation},
  author={Boyuan Jiang, Lei Hu, Shihong Xia}
  journal={IEEE International Conference on Computer Vision},
  year={2023},
  publisher={IEEE}
}

dwro-creatz / probabilistic_triangulation Goto Github PK

probabilistic_triangulation's Introduction

Probabilistic Triangulation V2

version update

Getting started

1. Dataset

2. Quick Start

3. Some training suggestions

Citation

probabilistic_triangulation's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent