offenesdresden / parkapi Goto Github PK

View Code? Open in Web Editor NEW

84.0 12.0 46.0 2.83 MB

🅿️ open API serving parking lot data for multiple cities

Home Page: https://parkendd.de/map.html

License: MIT License

Python 99.14% Dockerfile 0.34% Shell 0.52%

parking open-data dresden parkendd parking-spots

parkapi's Introduction

Offenes Dresden

Das Geschehen findet im Wiki statt.

parkapi's People

Contributors

Stargazers

Watchers

Forkers

mic92 meeposenpai mechlabengineering jurkov dastobi kawie jeannsebold666br codeforfrankfurt dialfund copra2005 manu1400 cgoodier chrismayer jklmnn firmanm soitun publicplan milindsalwe samr90 nim-stata augustqu mfdz jb3-2 rang01 olivaresf ubahnverleih danishmansuri bobdeng1974 praw26 jeetb21 hummans nicomue7 defgsus j4yj4y-r6 mtrnord opendatabs mobidata-bw parkcity313 moriisaac mola19 varshanipreddy sriaditya-s

parkapi's Issues

IDs?

@jklmnn suggested that we remove city IDs alltogether and reference them by their real name only. Not quite sure how we'd deal with duplicates, but due to the current metadata format we'd run into this problem anyways. But otherwise it would make things much easier.

Any suggestions?

[Luzern] Data

http://www.pls-luzern.ch/de/

Verify integrity of scraped data

To be notified if the format of a page changes as soon as possible. A big plus would be slack integration 🎉

It'd also be nice to periodically automatically update the test fixtures in this repo as well.

Specify schema.json for output

forecast field: use boolean type?

From the README:

"forecast":"true|false",

JSON sports native bool support. Use it!

Use util.get_most_lots_from_json

Specifically for Lübeck, where the total would return 0 for almost any lot at night...

[Rosenheim] Data

https://www.rosenheim.de/stadt-buerger/verkehr/parken.html

Write tests

[Bregenz] Data

https://www.bregenz.gv.at/sicherheit-verkehr/verkehr-und-parken/parkleitsystem.html

sample_city.py should run

The example script should be runable by the server. Not that some meaningful data come out of it, but that the server not crashes like now but came up with an json error. By the way, the example should contains all important objects and vars (maybe with blind or no data, but syntactic right)

[Dresden] Scrape detail pages

[Bonn] Data

Übersicht: http://www.bonn.de/umwelt_gesundheit_planen_bauen_wohnen/verkehrs_infos/14767/index.html?lang=de

Parkplätze in Bonn:
http://www.bcp-bonn.de/bcp/index.php?id=87
Keine Ahnung wie das jemals klappen soll >.<

Park and Ride:
http://www.swb-busundbahn.de/bus-und-bahn/service/park-ride.html
Hier ist das Format allerdings ganz ok

https://twitter.com/tursics/status/611411296664354816

Add 'time scraped' to the output

Persistent Database connection

Opening the database connection on every request is probably a bit slow.

[Köln] Data

http://www.koeln.de/apps/parken/

https://twitter.com/tursics/status/611411296664354816

<city>.py bad defined variables

the vars
data_url = ""
data_source = ""
city_name = ""
file_name = ""
detail_url = ""
are a little bit redundant. Please make more clear by name whats the use for every single var or set an comment

Adjust for server timezone

Otherwise the time last downloaded and time last updated can differ by several hours, which is kinda weird...

[Konstanz] Data

http://www.konstanz.de/tourismus/01759/01765/index.html

Can't assign @lucaswo, but he's on it :D

Autodetect modules.

Autodetect available cities instead of writing them into an array.

[Moers] Data

Kontakt mit OpenDataMoers? Generell auch für alle anderen Städte empfehlenswert...

https://twitter.com/tursics/status/611411296664354816

Data for Dresden

http://www.dresden.de/freie-parkplaetze/

[Lübeck] Data

http://www.kwl-luebeck.de/parken/aktuelle-parkplatzbelegung/

[Karlsruhe] Data

http://vmz.karlsruhe.de/entry-tba_Parkleitsystem/

Data is displayed when a lot is selected

[Dortmund] Data

http://geoweb1.digistadtdo.de/OWSServiceProxy/client/parken.jsp

Database connection

Opened this issue to collect some stuff about the database.

I'm planning to write a database connector that connects to a PostgreSQL db. In this it'll throw data with id, timestamp_updated, timestamp_downloaded, city and data attributes. data contains a JSON dump of the data that the scraper acquires (Postgres is pretty damn nice!).

The scraper is then run periodically and talks to the db connector. Before the db connector saves stuff in the database, it first verifies if it looks ok (#6). If it doesn't, it'd probably be best to notify us about it. A slack bot would be pretty damn sweet!

insert best practice with missing data

in sample_city.py should write down, how to handle missing data fields.
should comented the lines out, delete it or should the data set to false, null, ..?

[Oldenburg] Data

http://www.oldenburg.de/microsites/verkehr/parken/parkplaetzeparkleitsystem/parkplaetze.html

Transform to current API.

Transform to current API as described at http://jkliemann.de/parkendd/dev.html

Improve scraping 'logistics'

Specifically:

Specify data as generically as possible in the city files (only config files would be too perfect, but probably not possible), so that they can easily be added without writing a python file that handles all the scraping and returns a finished dict
Write a separate scraper that handles all the scraping and getting data in the correct format for all cities, so it won't be possible to have surprisingly differing output
Dynamically import city files from the scraper
Handle a database that scraper and server can talk to

For the time being the server will probably talk to the scraper directly, but the scraper should be able to run on it's own soon and store stuff into a database (I'm really liking the idea of just throwing the json into a mongodb instance - clear out #4 first though!), which the server then queries for current data. That way the scraper can run periodically (as easily as via cron) and the server only touches the previously saved data (usually the most current data) without them getting in each others way.

Http Caching Header

https://www.pythonanywhere.com/forums/topic/694/

Logging

See https://gist.github.com/ibeex/3257877 for more info on how to implement this.

seperate data structure for geolocation

Instead of saving the geolocations inside of scraping-city.py should they stored in a seperate file to enable easy updating besides the automatic update while scraping a new parking-spot.

the new file should lay besides the scrape-city.py in /cities and should have the suffix .geo.
Like this:
..cities/
dresden.py
dresden.geo
Luebeck.py
Luebeck.geo

and the content should be like this:
{
"Altmarkt": { "lat": 51.05031, "lon": 13.73754 },
"An der Frauenkirche": { "lat": 51.05165, "lon": 13.7439}
}

For later improvements we can change to geojson so github can render this file by it self on a map:
https://help.github.com/articles/mapping-geojson-files-on-github/

[Lübeck] Geodata

Currently existent, but incomplete.

If anyone finds a way to gather the coordinates from here (there's some stuff happening in kwl_maps.js and elsewhere there, but I can't find the data), please add it to Luebeck.geojson.
Or from anywhere else of course...

[Stuttgart] Data

http://verkehrslage.stuttgart.de/ivl/app.html?map=ilvLos&layers=Raster,LOS,&overlay=default&extents=3512167,5403934,3514138,5405312

Add forecast data

Include @balzer82's forecast data. Maybe a route something like /city/forecast/daterange? CSV would still probably be the best option for this I guess.

Invalid free counts

{
  "address": "Ferdinandplatz",
  "coords": {
    "lat": 51.04645,
    "lng": 13.73988
  },
  "forecast": false,
  "free": 8259,
  "id": "dresdenferdinandplatz",
  "lot_type": "Parkplatz",
  "name": "Ferdinandplatz",
  "region": "Prager Strasse",
  "state": "open",
  "total": 140
}

Just got this in Dresden... 8259 free spaces would be nice, but probably aren't correct.

offenesdresden / parkapi Goto Github PK

parkapi's Introduction

Offenes Dresden

parkapi's People

Contributors

Stargazers

Watchers

Forkers

parkapi's Issues

Recommend Projects

Recommend Topics

Recommend Org