Code Monkey home page Code Monkey logo

vizwiz-captions's Introduction

VizWiz Captions Evaluation

Update 2020-02-27:

Thank you for your interest in VizWiz captions. Active development and maintenance responsibilities of this project has been transferred. Please visit this repository to obtain the latest codes, and for your queries about this project.


Code for the VizWiz API and evaluation of generated captions.

Requirements

  • python 3
  • java 1.8.0 (for caption evaluation)

Files

./

  • demo_vizwiz_caption_evaluation.ipynb (tutorial notebook)

./vizwiz_api

  • vizwiz.py: This file contains the VizWiz API class that can be used to load VizWiz dataset JSON files and analyze them.

./annotation

  • VizWiz_Captions_v1_val.json (VizWiz Captions v1 validation set)
  • Dataset shares the same data format as MS COCO.

./results

  • VizWiz_Fake_Captions.json (an example of fake results for running demo)
  • Dataset shares the same data format as MS COCO.

./vizwiz_eval_cap: The folder where all caption evaluation codes are stored.

  • evals.py: This file includes the VizWizEvalCap class that can be used to evaluate results on VizWiz.
  • tokenizer: Python wrapper of Stanford CoreNLP PTBTokenizer
  • bleu: Bleu evalutation codes
  • rouge: Rouge-L evaluation codes
  • cider: CIDEr evaluation codes
  • spice: SPICE evaluation codes

Setup

  • The primary VizWiz API is standalone.
  • For caption evaluation, you will first need to download the Stanford CoreNLP 3.6.0 code and models for use by SPICE. To do this, run: ./get_stanford_models.sh
    • To run shell scripts in Windows, you can setup Windows Subsystem for Linux.
    • The command for Windows will then be bash get_stanford_models.sh
  • Note: SPICE will try to create a cache of parsed sentences in ./vizwiz_eval_cap/spice/cache/. This dramatically speeds up repeated evaluations. The cache directory can be moved by setting 'CACHE_DIR' in ./vizwiz_eval_cap/spice. In the same file, caching can be turned off by removing the '-cache' argument to 'spice_cmd'.

References

Developers

Acknowledgement

This work is closely adapted from MS COCO API and MS COCO Caption Evaluation API.

Original Developers

  • Xinlei Chen (CMU)
  • Hao Fang (University of Washington)
  • Tsung-Yi Lin (Cornell)
  • Ramakrishna Vedantam (Virgina Tech)

Original Acknowledgements

  • David Chiang (University of Norte Dame)
  • Michael Denkowski (CMU)
  • Alexander Rush (Harvard University)

vizwiz-captions's People

Contributors

nilavra avatar

Stargazers

 avatar  avatar Xinxin Zhu avatar Zhi Rao avatar 陈崇彦 avatar Jakub Langr avatar  avatar

Watchers

 avatar paper2code - bot avatar

Forkers

zmcmcc

vizwiz-captions's Issues

Offline Demo and Data loading

Dear @nilavra ,

Thank you for the awesome dataset demo code! It helped me a lot to understand :)

I wanted your aid in modifying the code for two things:

  1. Instead of downloading images all the time, how can I modify the code such that it refers to the local path and reads images sequentially from a local folder?

  2. Where can I find the json file with captions for Train, Test, and Val images?

Thank you

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.