Code Monkey home page Code Monkey logo

chen8023 / cfte_dcc2022 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from berlin0610/cfte_dcc2022

0.0 0.0 0.0 43.47 MB

This repository contains the source code for the paper Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression by Bolin Chen, Zhao Wang, Binzhe Li, Rongqun Lin, Shiqi Wang, and Yan Ye.

License: Apache License 2.0

Shell 0.23% C++ 4.05% Python 95.73%

cfte_dcc2022's Introduction

Temporal-Evolution-Inference-with-Compact-Feature-Representation-for-Talking-Face-Video-Compression

This repository contains the source code for the paper Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression by Bolin Chen, Zhao Wang, Binzhe Li, Rongqun Lin, Shiqi Wang, and Yan Ye.

The DCC keynote video presented by Dr. Yan Ye can be found in https://www.youtube.com/watch?v=7en3YYT1QfU.

The overall implementation codes and pretrained checkpoint can be found under following link: OneDrive.

Installation

We support python3. To install the dependencies run:

pip install -r requirements.txt

In addition, please activate the VVC codec run

sudo chmod -R 777 vtm

Training

To train a model on VoxCeleb dataset, please follow the instruction from https://github.com/AliaksandrSiarohin/video-preprocessing.

When finishing the downloading and pre-processing the dataset, you can train the model,

python run.py

The code will create a folder in the log directory (each run will create a time-stamped new directory). Checkpoints will be saved to this folder. To check the loss values during training see log.txt. You can also check training data reconstructions in the train-vis subfolder. You can change the training settings in corresponding ./config/vox-256.yaml file.

Inference

To encode a sequence, please put the provided testing sequence in ./testing_data/ file and run

python Encoder.py

After obtaining the bistream, please run

python Decoder.py

For the testing sequence, it should be in the format of RGB:444 at the resolution of 256*256.

Evaluate

In ./evaluate/multiMetric.py file, we provide the corresponding quality measures, including DISTS, LPIPS, PSNR and SSIM.

Additional notes

Reference

The training code refers to the FOMM: https://github.com/AliaksandrSiarohin/first-order-model.

The arithmetic-coding refers to https://github.com/nayuki/Reference-arithmetic-coding.

Citation:

@INPROCEEDINGS{CHEN_DCC2022,
  author={Chen, Bolin and Wang, Zhao and Li, Binzhe and Lin, Rongqun and Wang, Shiqi and Ye, Yan},
  booktitle={2022 Data Compression Conference (DCC)}, 
  title={Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression}, 
  year={2022},
  volume={},
  number={},
  pages={13-22},
  doi={10.1109/DCC52660.2022.00009}}

cfte_dcc2022's People

Contributors

berlin0610 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.