This repository contains documentation for developers including:
- Writing Scrapers using Pupa
- Open Civic Data's Data Type Specifications
- Open Civic Data Proposals
Read these docs at https://open-civic-data.readthedocs.io/en/latest/
Canadian legislative scrapers
License: MIT License
This repository contains documentation for developers including:
Read these docs at https://open-civic-data.readthedocs.io/en/latest/
Still on ScraperWiki
Found a PDF - to determine whether easy enough to parse: http://www.community.gov.yk.ca/pdf/loc_govdir.pdf
Found a source: http://web5.gov.mb.ca/Public/municipalities.aspx
Scraperwiki:
New:
@todo
)@todo
)There's a broken scraper at https://classic.scraperwiki.com/scrapers/quebec_council/ I don't know if it ever worked, to be honest, so feel free to start from scratch.
All representatives should have a district_name
for CSV exports to work nicely. This will automatically be completed once all Scraperwiki scrapers are converted to Pupa.
In other words, export it as JSON that looks like the JSON produced by ScraperWiki.
A LaSalle scraper was written for LaSalle, Ontario by mistake. Write another for the borough of Montreal.
The scraper is broken, so may as well port it as part of the fix:
See update_from_data_source
in models.py
. We will need to move this code to transition to Imago.
https://github.com/rhymeswithcycle/represent-reps/blob/master/representatives/models.py#L144
treat the 'note' field for contact_details as the 'type' field for offices, eg: 'constituency', 'office', or 'home'
capture position of representative
Anything to do before merging?
Shapefile was recently added to Represent
Shapefile was recently added to Represent
Still on Scraperwiki. Might be fixed by #42.
Work through the broken scrapers at http://scrapers.herokuapp.com/
Ignore:
_municipalities
Previous issue content
Reserved for Matthew Leon:
Priority (already in Represent):
Shapefile was recently added to Represent
It's not raising an error, but it is only scraping the mayor currently:
https://classic.scraperwiki.com/scrapers/edmonton_city_councillors/
The government website simply links to municipal websites. AMO simply lists members on its website.
AMO charges $1800 for the contact information, and AMTCO charges $300 top-level contact information.
The scraper is broken, so may as well port it as part of the fix:
https://classic.scraperwiki.com/scrapers/windsor_on_council/
Compare the QC status to the census subdivision type to see if they match.
Once all the scrapers are in Pupa:
scraperwiki_spec.rb
from represent-canada-dashboard@todo
in represent-repsAdd a mapping. http://donnees.ville.montreal.qc.ca/fiche/arros-liste/
https://github.com/opencivicdata/imago
See also https://github.com/opencivicdata/opencivicdata-api that brings all API components together.
Probably just requires a quick fix. Up to you whether it's worth the effort to switch to Pupa now.
See http://scrapers.herokuapp.com/warnings/
Valid warnings have no checkbox:
boundary_url
)Resolved:
I have a very complex script that scrapes URLs of municipalities across Canada from the FCM's website, and matches them to a census subdivision according to their name and province. It produces this file.
There are other sources of URLs, such as CivicInfoBC. I wouldn't base myself off the script I'd written, and instead just start by trying to do straight name matching against the list of census subdivisions for BC (just limit yourself to those whose code starts with "59" to get only BC). There are only a few subdivisions with the same name in BC, in which case you need to match on name plus type.
If this works, we can add others for other provinces. You can write the script in a fork of ocd-division-ids.
Shapefile was recently added to Represent
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.