Scripts for running OpenAddresses on a complete data set and publishing the results. Uses OpenAddresses data sources to work.
This code is being used to process the complete OA dataset on an expected-weekly basis, with output visible at data.openaddresses.io.
Installation scripts for preparing a fresh install of Ubuntu 14.04 can be found
in chef
. Run them from a Git checkout like this:
sudo apt-get update
sudo chef/run.sh
Complete sources will be checked out to /var/opt/openaddresses/sources
.
Run a single source locally with openaddr-process-one
:
openaddr-process-one -l <log> <path to source JSON> <output directory>
For more than one source file, OpenAddresses requires Amazon S3 to work.
You can set the environment variables AWS_ACCESS_KEY_ID
and
AWS_SECRET_ACCESS_KEY
or provide values as arguments to openaddr-process
.
Run the complete process with openaddr-process
:
openaddr-process -a <AWS key> -s <AWS secret> -l <log> data.openaddresses.io
Run it on an Amazon EC2 spot instance with openaddr-ec2-run
:
openaddr-ec2-run -a <AWS key> -s <AWS secret> data.openaddresses.io
Modify the contents of openaddr/paths.py
with locations
of your local openaddresses.
Test the OpenAddresses machine with test.py
:
python test.py
Run the complete process from the openaddr
module:
python -m openaddr.process_all -a <AWS key> -s <AWS secret> -l <log> data.openaddresses.io
Convert remote ESRI feature services to GeoJSON with openaddr-esri2geojson
:
openaddr-esri2geojson <ESRI URL> <GeoJSON path>