Code Monkey home page Code Monkey logo

who-data-mining's Introduction

World Health Organization - PDF Data Mining

Python algorithm for converting WHO official COVID daily reporting tables (PDF) into simple data format (Pandas / CSV file).

Project Overview

This code is an adaptation of Lain's tutorial code, posted at: http://www.degeneratestate.org/posts/2016/Jun/15/extracting-tabular-data-from-pdfs/

The main adaptation is this code suitable for reading the tables of daily reports of the World Health Organization (WHO) on COVID.

Features

  • Converts Official COVID PDF tables to CSV.
  • Query Algorithms for data visualization.

Core Libraries

How it works?

  • Place a PDF* inside input folder.
  • Run "WHO_PDF_MINER.py" and set pdf_file_name variable as the name of your PDF.
  • The Output CSV will be created in the output folder with the same name as the PDF.

*You can download official WHO pdfs from this link: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/


PDF Miner Algorithm:

Input PDF (PdfMiner)

Output CSV (Pandas)


Query Algorithms (Pandas / Seaborn):

One Country Plot -> By Given Dates & Attributes (Full Dataset)

One Country Plot -> By Given Dates & Column/Attribute

  

One Date Plot -> By Given Column/Attribute


who-data-mining's People

Contributors

lrmendes avatar

Watchers

 avatar

Forkers

kumarsumii

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.