Code Monkey home page Code Monkey logo

youmakeup's Introduction

Introduction

YouMakeup is a large-scale multimodal instructional video dataset introduced in YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension (EMNLP2019).

It contains 2,800 videos from YouTube, spanning more than 420 hours in total. Each video is annotated with a sequence of steps, including temporal boundaries, grounded facial areas and natural language descriptions of each step.

image

Makeup activities are fine-grained in nature. Different makeup steps share the same facial background but contain at least one subtle but critical difference in action, tool, product or facial area. Therefore, it requires fine-grained discrimination within temporal and spatial context to distinguish them. Our YouMakeup dataset can support various tasks for fine-grained semantic comprehension.

image

CVPR 2020 Workshop: YouMakeup VQA chanllenge

We propose two video question answering tasks in the CVPR 2020 workshop Language & Vision with applications to Video Understanding. The details of the chanllenge are introduced in CVPR_2020_YouMakeup_VQA_chanllenge.

Facial Image Ordering

As instructional video present steps for accomplishing a certain task, tracking the changes of object after the steps is crucial for procedure understanding. The effects of makeup are fine-grained changes of facial appearances. Therefore, we propose the facial image ordering task, which is to sort a set of facial images from a video into the correct order according to the given step descriptions.

image

Step Ordering

Cross-model semantic alignment is important in the field of visual and language. In the step ordering task, we ask models to sort a set of step descriptions into the right order as these actions are performed in the video. Models need to align textual step descriptions with corresponding video contents to solve the task. Since the makeup videos contain a sequence of actions and texts are composed of multi-sentences, the task also requires long-term temporal action reasoning and text understanding.

image

Data Download

The data license form should be signed before accessing the data. Please sign the form

data/Liscense Agreement.pdf

and send it to [email protected] and we will provide the link for data download.

Citation

@inproceedings{wang2019youmakeup,
  title={YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension},
  author={Wang, Weiying and Wang, Yongcheng and Chen, Shizhe and Jin, Qin},
  booktitle={Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)},
  pages={5136--5146},
  year={2019}
}

If you have any questions about YouMakeup dataset, please contact us by [email protected].

If you have any questions about the YouMakeup VQA chanllenge, please contact us by [email protected]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.