Code Monkey home page Code Monkey logo

agevoxceleb's Introduction

AgeVoxCeleb

AgeVoxCeleb DATABASE

The AgeVoxCeleb is the large-scale, balanced, and multi-modal age corpus that contains both video and speech of the same speakers from a wide age range.

The AgeVoxCeleb contains nearly 168k videos of approximately 5000 speakers selected from VoxCeleb2. All the videos are labeled with the speaker’s real age estimated using each celebrity's name and title of the original YouTube video.

To know the detail of how we built AgeVoxceleb, please refer to our paper presented in ICASSP2021.

USAGE

In this repository, only annotation labels are distributed. To obtain the data including audio files, cropped video files, and other meta information, please visit the original web-page of VoxCeleb2 and download them.

Data description

Each line in utt2spk.train and utt2spk.test corresponds to a pair of the segment-id and annotated age labels in training and test sets. Note that the segment-id is composed of speaker-ID, video-ID, and segment-ID defined in the original VoxCeleb.

In addition, please note that how to divide the data into training and test sets is different from ones defined in the original VoxCeleb dataset.

Related work

There is another work [1] that is closely related to us but conducted independently of our work. The key difference between our work and theirs is that they automatically annotated age labels in a more strict way. Specifically, they use video semantic and people identity information taken from multiple data sources to get the accurate speaker's birth year and the recording date. Please refer to their Arxiv paper[1] and Github repository for detailed information.

[1] Khaled Hechmi, Trung Ngo Trong, Ville Hautamaki, Tomi Kinnunen, ``VoxCeleb Enrichment for Age and Gender Recognition,'' arXiv:2109.13510, 2021, 

Citation

If you use this dataset, please cite:

@inproceedings{tawara2021age,
  title={Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation},
  author={Tawara, Naohiro and Ogawa, Atsunori and Kitagishi, Yuki and Kamiyama, Hosana},
  booktitle={ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={6963--6967},
  year={2021},
  organization={IEEE}
}

agevoxceleb's People

Contributors

nttcslab-sp-admin avatar

Stargazers

yunjiawu avatar Yufeng Yin avatar hangtingchen avatar Alexander avatar  avatar Xingyu(Alex) Shen avatar Sejong Yang avatar  avatar haoyuan tang avatar  avatar Qian Liu avatar  avatar Nickolay V. Shmyrev avatar Pawel Cyrta avatar  avatar  avatar  avatar Yiming Lin avatar Victor Costa Beraldo avatar  avatar  avatar

Watchers

Mig avatar James Cloos avatar Nickolay V. Shmyrev avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.