Code Monkey home page Code Monkey logo

af-elections-data's People

Contributors

danielfdsilva avatar karitotp avatar lanesgood avatar olafveerman avatar piligab avatar rub21 avatar vgeorge avatar yunica avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

af-elections-data's Issues

Review and add geoID to 2010 data

  • review the geoId of the provinces and make sure we are using
  • add the geoId to the districts

We should be using the 419 AGCHO standard.

Add prior years' base data to provinces.csv and districts.csv

Each election year has a provinces.csv and districts.csv file that currently includes only the province/district name, id, and geoId. To keep track of the origin of all base, non-processed data, we would like to add the following data columns to each CSV for each year:

  • Population
  • Registered Voters
  • Observers
  • ECC Complaints

For example, 2018's provinces.csv will contain the following:

id name geoId totalPopulationWomen totalPopulationMen totalPopulation registeredMen registeredWomen registeredTotal Observers eccComplaints
1 Kabul 1 x x x x x x x x

See the recent PR on 2018 for an example: #50

Review 2014 geoId

The geoId columns of the 2014 provinces and districts should be verified. These files are currently part of this PR, but should soon be on master: #16

According to the documentation, the geoId column follows one of the agcho standards. Check if they match the 421 CSO that we settled on as our default.

Add Kuchi and Sikh metadata to their provinces

Currently, Kuchi and Sikh metadata like registeredVoters are included in the actual provinces (Kabul, Kapisa, etc).

id name registeredMen registeredWomen registeredNomad registeredHinduSikh registeredTotal
1 KABUL 1041185 522214 68039 759 1632197

Since the IEC vote count for Kuchi and Sikh are reported in separate and special provinces, their metadata should follow the same logic. If we don't, voter participation rates will be off (eg. Total votes Kabul / Registered voters Kabul).

Clean 2010 candidates

Clean up the 2010 candidates.csv:

  • English names
  • incumbent
  • elected preliminary
  • elected final

The 2010 result pages show this information, but the CSV files with raw result data only contain the names in Dari, and no indication of of incumbents / elected.

Readme

Each election should have its own Readme in which we document the source of each dataset, a data dictionary, and whether we've done any processing.

Review population district 2018 data

The districtx population data for some districts are is repeated:
dist.json.zip
https://github.com/developmentseed/af-elections-data/blob/master/2018-wolesi-jirga/districts.csv
example:

 "name": "Provincial Capital Of Herat (herat)",
        "name_dari": "هرات",
        "id": 320101,
        "dist_id": 3201,
        "province_name": "Hirat",
        "province_name_dari": "هرات",
        "province_id": 32,
        "admin_type": "district",
        "unit_type": "District",
        "distId": 320101,
        "provId": 32,
        "geoId": 3201,
        "registeredMen": 6611,
        "registeredWomen": 4660,
        "registeredTotal": 11271,
        "femalePopulationRural": 0,
        "malePopulationRural": 0,
        "totalPopulationRural": 0,
        "femalePopulationUrban": 270069,
        "malePopulationUrban": 268604,
        "totalPopulationUrban": 538673,
        "totalPopulationFemale": 270069,
        "totalPopulationMale": 268604,
        "totalPopulation": 270069,
        "womenCandidates": 28,

From ☝️ the totalPopulation and totalPopulationFemale has the same values, coudl you check it.

cc. @karitotp @LanesGood

Make this repo temporarily private

NDI has requested that we make this repo private for now, as they review observer data and want to ensure that we don't make certain data available to the public.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.