heatmaps for big spatial data with ESRI

Following along with https://github.com/Esri/gis-tools-for-hadoop/wiki/Aggregating-CSV-Data-%28Spatial-Binning%29 I demonstrate how to create spatial heat-maps with ESRIs tools and open source tools like qgis

dependencies

We need the esri hadoop tools from https://github.com/Esri/spatial-framework-for-hadoop

Execute the following commands. Requires:

java8 / jdk
maven
git

git clone https://github.com/Esri/spatial-framework-for-hadoop
cd spatial-framework-for-hadoop
git checkout c50c02d8d94f99b77df34aa3d57498aa8e23571b
mvn clean install

if you are on a mac and see a test failure like Esri/spatial-framework-for-hadoop#141:

 expected:<1974-08-3[1]> but was:<1974-08-3[0]>

simply install with:

mvn clean install -DskipTests=True

NOTE: https://stackoverflow.com/questions/40369170/registering-hive-custom-udf-with-spark-spark-sql-2-0-0 you need to have spark's hive capabilities to be enabled in order to registers ESRI's hive udfs.

Unlike ESRi (serializing to json using custom hive serdes) I will serialize to WKT (well known text) which probably is easier to integrate into QGIs as well as the hadoop world (partitioning, orc/parquet

Run via the following command - optionally, you can specify some more spark configuration.

spark-submit --verbose \
        --class at.geoheil.app.SparkJob \
	target/scala-2.11/sparkMiniSample-assembly-0.1-SNAPSHOT.jar

WARN: currently, this will fail due to shading problems: pureconfig/pureconfig#333

notes regarding spark

mini project to show how hive sql can easily be executed on spark

use sbt consoleto interactively run queries

or ./sync.sh to run assembly

or sbt run but make sure to set $SBT_OPTS -Xmx8G -XX:+UseConcMarkSweepGC -XX:+CMSClassUnloadingEnabled -Xss2M as spark will be launched inside sbt

for development (in the sbt shell) ~reStart

also sbt test is useful ;)

geoheil / minimalpure Goto Github PK

minimalpure's Introduction

heatmaps for big spatial data with ESRI

dependencies

notes regarding spark

minimalpure's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent