Code Monkey home page Code Monkey logo

kineticx-downloader's Introduction

KinetiX-Downloader

A Python3 script for downloading the Kinetics 400, 600, and 700 datasets using PyTube3.

Kinetics is a large-scale, high-quality dataset of URL links to approximately 650,000 video clips that covers 700 human action classes, including human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 600 video clips. Each clip is human annotated with a single action class and lasts around 10s.

Available versions of Kinetics:

Prerequisites

  • pytube3
  • pandas
  • tqdm

Download dataset

Script download.py can be used for downloading videos for the Kinetics dataset of given version (i.e., 400, 600, or 700) and for given dataset subset (i.e., train, test, validate, or all of them):

usage: Kinetics dataset downloader [-h] [-v {400,600,700}] [-s {train,test,validate,all}] [-w WORKERS]

optional arguments:
  -h, --help            show this help message and exit
  -v {400,600,700}, --version {400,600,700}
                        choose dataset version ('400', '600', '700')
  -s {train,test,validate,all}, --subset {train,test,validate,all}
                        choose dataset subset ('train', 'test', 'validate', or 'all')
  -w WORKERS, --workers WORKERS
                        Set number of multiprocessing workers

Dataset videos will be saved under the directory Kinetics<version>/. It is expected that some videos will not be available for downloading. This may happen for various reasons (e.g., due to an unexpected error of PyTube or due to unavailability of the YouTube video). A error log file will be created as soon as a video download error occurs, and will store all YouTube IDs of the videos that have not been downloaded (Kinetics<version>_<subset>_errors.log). After download process is complete, you may re-run it for attempting to download the videos that have failed (if a video has been downloaded successfully will be omitted).

TODO

  • Add script for extracting frames from downloaded video. In the meantime, you may take a look at this repo.

kineticx-downloader's People

Contributors

chi0tzp avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.