Code Monkey home page Code Monkey logo

partyfactsdata's Introduction

Party Facts data import

logo

Authors

  • Paul Bederke — University of Konstanz — since 2019
  • Holger Döring — GESIS – Leibniz Institute for the Social Sciences — since 2012
  • Sven Regel — WZB Berlin Social Science Center — 2012–2023
  • [ contact information — paul.bederke uni-konstanz de ]

see full credits at Party Facts “about section”

References

Döring, Holger, and Sven Regel. 2019. “Party Facts: A Database of Political Parties Worldwide.” Party Politics 25(2): 97–109. doi: 10.1177/1354068818820671

Bederke, Paul, Holger Döring, and Sven Regel. 2021. “Party Facts Dataverse.” — dataverse.harvard.edu/dataverse/partyfacts

Summary

The Party Facts project aims to offer a gateway to empirical data about political parties and provides a modern online almanac about parties and their history as recorded in social science datasets. The many existing datasets with crucial information gained through expert or mass surveys, data handbooks, voting records, party positions or electoral results about political parties are difficult to link and there is the need for an infrastructure that helps combining existing sources. With Party Facts we want to establish an infrastructure that supports political scientists in linking parties across datasets. In the Party Facts project we link core datasets of political science and provide a platform for other scientists to add party keys from additional datasets.The project uses modern online technologies to offer an opportunity for collaborative data collection. Scientists can add missing links between parties, can validate links and are given the opportunity to provide additional information about parties and their history.

Import

The import folder contains all external datasets.

You may submit your own dataset there.

Party Facts world map


Party Facts linking activity

partyfactsdata's People

Contributors

fischerbit avatar hdigital avatar mrwunderbar666 avatar pbederke avatar polvis avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

partyfactsdata's Issues

harvard elections data is often wrong

image Vicente Fox won the 2000 election in Mexico, he is in the PAN party. But the elections global release data has the PRI winning 2000 with 35.1% of votes vs PAN's 34.6% of votes. image

In fact, if you merge their global leaders dataset with the elections dataset, very often does the President of a country in a year not line up with who they say won the election in a country, year pair. Unless someone can point out to me what is going on, this is seriously poor quality data and needs to be removed less people use it.

CLEA 2014 parties not in 2016 data

Some of the parties from CLEA 2014 are not in CLEA 2016.

These parties were not yet removed from Party Facts when the CLEA 2016 data was imported on 3 August 2016.

Double check with CLEA team before finally removing these parties from Party Facts.

Here is a country summary and see list of parties (clea-2014-remove.xlsx). We need some informaton about the high number of parties from India.

clea14 <- read.csv("clea-2014/clea.csv", fileEncoding = "utf-8", as.is=TRUE)
clea14 %>% filter( ! ctr_pty %in% party$ctr_pty) %>% .[['country']] %>% table

AUS AUT CUW DEU IDN IND LUX MDA NLD SLE TUR USA 
  1   1   7   1   1  26   1   1   2   2   1   4 

different data formats on dataverse

On Harvard Dataverse, where the stable versions of the Party Facts datasets are available, most data files have the .tab extension. There are three exceptions:

  1. https://dataverse.harvard.edu/file.xhtml?fileId=4274155&version=1.1
  2. https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/UUMC31/JEM0W5&version=1.0
  3. https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/FTQAYT/L8EGUM&version=1.0

These have the .csv extension.

If this is not on purpose, perhaps all files could be of the same format for consistency reasons. I realised this difference when I was trying to write a function that would get these files systematically, where the extension makes a difference.

Perhaps this is not possible and/or desirable. In any case, let me say thank you for this great resource.

partyfacts-clea.csv appears to be incomplete/outdated

The file at https://github.com/hdigital/partyfactsdata/blob/master/import/clea/partyfacts-clea.csv appears to be missing some parties.

For example, Demokratikus Koalíció with partyfacts_id = 469 (see: https://partyfacts.herokuapp.com/data/partycodes/469/) and dataset_party_id = 348000004 (see: https://partyfacts.herokuapp.com/data/partyall/51923/) does not appear in the file.

Interestingly, Demokratikus Koalíció is listed in clea.csv at https://github.com/hdigital/partyfactsdata/blob/master/import/clea/clea.csv with party_id = 348000004.

Where should I look if I want a more complete/up-to-date version of partyfacts-clea.csv?

Alternatively, if such a file does not already exist or cannot be provided, do you know how I can generate a complete version of partyfacts-clea.csv on my own using the data available online at https://partyfacts.herokuapp.com/ ?

run all script

Create R script to run all import R scripts. This would assure that all files can be run to create the data for the import. Its a lightweight alternative to unit testing.

add duplicate IDs

Datasets with cleaned-up party lists on Google drive ignore duplicates in Party Facts import.

Add these ignored party IDs in a new column and provide examples for dataset transformation.

keep Manifesto Project country names

Marpor country names are adjusted to Party Facts country names for different names used.

Keep Marpor country names and rename Party Facts country names used for merging.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.