nditech / af-elections-data Goto Github PK
View Code? Open in Web Editor NEWBase data for the Afghanistan Elections site
Base data for the Afghanistan Elections site
We should be using the 419 AGCHO
standard.
Each election year has a provinces.csv
and districts.csv
file that currently includes only the province/district name, id, and geoId. To keep track of the origin of all base, non-processed data, we would like to add the following data columns to each CSV for each year:
For example, 2018's provinces.csv will contain the following:
id | name | geoId | totalPopulationWomen | totalPopulationMen | totalPopulation | registeredMen | registeredWomen | registeredTotal | Observers | eccComplaints |
---|---|---|---|---|---|---|---|---|---|---|
1 | Kabul | 1 | x | x | x | x | x | x | x | x |
See the recent PR on 2018 for an example: #50
We have to update the geoId for provinces according to the 421 CSO
which is the same 419 AGCHO
ref: https://github.com/developmentseed/af-elections/issues/97#issuecomment-484117765
The geoId
columns of the 2014 provinces and districts should be verified. These files are currently part of this PR, but should soon be on master
: #16
According to the documentation, the geoId
column follows one of the agcho
standards. Check if they match the 421 CSO
that we settled on as our default.
Currently, Kuchi and Sikh metadata like registeredVoters
are included in the actual provinces (Kabul, Kapisa, etc).
id | name | registeredMen | registeredWomen | registeredNomad | registeredHinduSikh | registeredTotal |
---|---|---|---|---|---|---|
1 | KABUL | 1041185 | 522214 | 68039 | 759 | 1632197 |
Since the IEC vote count for Kuchi and Sikh are reported in separate and special provinces, their metadata should follow the same logic. If we don't, voter participation rates will be off (eg. Total votes Kabul / Registered voters Kabul).
Compile base data for 2005 from data on https://afghanistanelectiondata.org/open/data:
provinces.csv
districts.csv
polling-centers.csv
Following this spec: https://github.com/developmentseed/af-elections-api
Clean up the 2010 candidates.csv:
The 2010 result pages show this information, but the CSV files with raw result data only contain the names in Dari, and no indication of of incumbents / elected.
For:
See docs for spec: https://github.com/developmentseed/af-elections-api/tree/feature/candidate-meta#candidatescsv
The breakdown of observers per province should be compiled from the multiple sheets in the workbook found here:
https://docs.google.com/spreadsheets/d/1H6T57vHCOJfofok4fMRdQ2ViUbEGmt867N8uyBarpRw/edit?usp=sharing
While the 2018 data story requests only total observers per province, I suggest we add all data points per province for posterity.
Review the provinces and districts of 2018, and ensure that they use the 419 AGCHO
standard.
See: https://github.com/developmentseed/af-elections/issues/97#issuecomment-479578737
Based on the 2010 election results, determine which candidates are incumbent in 2018.
Each election should have its own Readme in which we document the source of each dataset, a data dictionary, and whether we've done any processing.
Generate:
results.sqlite
candidates.csv
Generate:
results.sqlite
candidates.csv
The districtx population data for some districts are is repeated:
dist.json.zip
https://github.com/developmentseed/af-elections-data/blob/master/2018-wolesi-jirga/districts.csv
example:
"name": "Provincial Capital Of Herat (herat)",
"name_dari": "هرات",
"id": 320101,
"dist_id": 3201,
"province_name": "Hirat",
"province_name_dari": "هرات",
"province_id": 32,
"admin_type": "district",
"unit_type": "District",
"distId": 320101,
"provId": 32,
"geoId": 3201,
"registeredMen": 6611,
"registeredWomen": 4660,
"registeredTotal": 11271,
"femalePopulationRural": 0,
"malePopulationRural": 0,
"totalPopulationRural": 0,
"femalePopulationUrban": 270069,
"malePopulationUrban": 268604,
"totalPopulationUrban": 538673,
"totalPopulationFemale": 270069,
"totalPopulationMale": 268604,
"totalPopulation": 270069,
"womenCandidates": 28,
From ☝️ the totalPopulation
and totalPopulationFemale
has the same values, coudl you check it.
cc. @karitotp @LanesGood
Generate:
results.sqlite
candidates.csv
From this file: https://afghanistanelectiondata.org/sites/default/files/pollingcenters_2009.csv
generate:
provinces.csv
districts.csv
polling-centers.csv
Following this spec: https://github.com/developmentseed/af-elections-api
Data from http://www.iec.org.af/results/en/invalid/invalid_by_pc should be scraped and added to this repo for site usage.
Compile base data for 2005 from data on https://afghanistanelectiondata.org/open/data:
provinces.csv
districts.csv
polling-centers.csv
Following this spec: https://github.com/developmentseed/af-elections-api
NDI has requested that we make this repo private for now, as they review observer data and want to ensure that we don't make certain data available to the public.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.