okfn / dataportals.org Goto Github PK
View Code? Open in Web Editor NEWOpen Data Portals and Sites around the world
Home Page: http://dataportals.org/
Open Data Portals and Sites around the world
Home Page: http://dataportals.org/
Going for Node over pure browser JS because:
I'd like to add/update a few Irish catalogs. As an admin of the old CKAN-based site, I could have done that easily, so what's the workflow now? Should I still go through the Google Form, or submit a pull request with changes to /data/catalogs.csv
, or…?
At the moment to reload the database after a change requires doing heroku restart
. Provide a url like admin/reload/
that will reload the spreadsheet DB on demand.
We have been migrating over to new DB (google spreadsheet!)
Question is what fields should we have?
Outlining suggested fields in this spreadsheet template sheet.
maintainer
and maintainer_url
field for organization or person maintaining the portal.license
field - shall we normalize in some way, do we want a link to the license page or evidence on source websitemetadata_modified
and metadata_created
? Latter is easy to support but former is hard with google spreadsheet DBopen
boolean field to indicate whether the data in the portal is open or not
open_percent
for mixed portalsdate_launched
date the portal launchedmetadata_contributor
to credit folks contributing info (very spotty in current data and not very systematic).groups
and tags
- no reason to keep separate ...generator
- name of software that powers the portalSearch box is not available from
*http://datacatalogs.org/search ('browse'page)
*http://datacatalogs.org/catalog/[name] (individual record page)
To access the search box user must visit application root http://datacatalogs.org/
Consider moving search box into header (navbar navbar-fixed-top) - for consistent user experience and improved access to search.
Will attempt PR if agreed worthwhile.
Geocode all existing catalogs in the DB
Have done some but more to do.
Can generally do this by getting the place
field sorted and then using ImportXML plus nominatim - see http://schoolofdata.org/2013/02/19/geocoding-part-ii-geocoding-data-in-a-google-docs-spreadsheet/
The values displayed on the front-end for this record http://datacatalogs.org/catalog/alabama
do not correctly map for Issued & License values
Issued = Active, License = US/en
Couldn't find another record with a similar issue.
copy over
When you type a search query that has no results there's no explicit indication that there are no results.
Perhaps this issue could be included as part of #16?
Follow https://github.com/okfn/datasets.okfnlabs.org#deployment-to-heroku
Deploy at http://new.datacatalogs.org for the present
Open data portals is perhaps a better and more appropriate naming than open data catalogs
Site url would go from datacatalogs.org => dataportals.org
Note: the field 'license_open' is still up for discussion. We will now have scope to put a page explaining the situation for mixed licenses under license_url if 'license_id' cannot be filled in or is filled in with something ambiguous. The question is whether catalogs that don't have a single license ought to have license_open set to FALSE or empty.
A further discussion topic is whether we want to have scope for generating stats on the catalogs (e.g. number of datasets, or catalog last updated) and storing them in some way.
Consider making it easier for others to discover API of catalogue by placing a hyperlink to http://datacatalogs.org/api/data.json .
In the Google sheet (curated Tab) has 426 rows. As one row is a header, there should be 425 portals but the website shows 424.
Can be published CSV from Google, datapackage, or anything
A portal was reported as no longer being available. I changed the value in the Google sheet from active
to inactive
thinking this would hide the portal on DataPortals.org. This only changed the Status and the portal still appears in the website.
How should a portal be removed/hidden?
inactive
sitesWe can just copy and paste from the one in http://census.okfn.org
I'm not sure what the wrench is doing next to all the catalogs. I’d remove it and move the list of tags in its place, which will shorten the length of the page.
Do we want to do some pruning/structuring of the metadata inside the infoboxes on the map?
Suggest:
moderated
and moderatorcomments
columns to DBmoderated
= 1 means that item will not be shown on sitemoderatorcomments
are comments from moderator as to why it is moderatedExample: http://datacatalogs.org/catalog/eduinfo may be spam ...
How would one bulk submit data portals to this list?
Enable notifications for Editors on dataportal.org
Although the website footer currently features an Open Data badge and the submission form specifies that all entries are licensed under CCZero and/or PDDL, the license is not clearly evident for people interested in using the raw data. This could be taken care of together with #33.
Also create a form for submitting new items
Suggest we use google forms
Editors can enter direct into the spreadsheet db
After the merge of groups + tags (done quickly with GSheets formula), there are duplicates. This is probably more easily dealt with with a quick python script once we move to CSV (Issue #35). Or we could add it to the model code and do it at (re)load.
The used version includes an old Leaflet. Would be cool if you can upgrade to a commit after datopian/datahub#431
Hi there,
Is anybody updating and adding new open data portals on the map? Ones submitted via Google Forms on dataportals.org.
Cheers!
I notice that some of the info for the new metadata (Issue #34) was already being collected on the form (e.g. launch date and publisher type) whereas some of the old metadata (tags) was not being collected. What is the current workflow? Do we want a second sheet that draws on the responses sheet to at least reshuffle the form responses so its easier to copy/paste? Or do we rather want to encourage 90% of contributors to edit the DB directly and we can cope with the other 10% being a bit of manual work?
Brisbane City Council in Australia has updated its open data portal to https://www.data.brisbane.qld.gov.au. How do you edit an existing entry?
Where a record term (e.g. its Title) includes a special character (e.g. comma, exclamation, period ) the search will not return the record.
Test cases include:
record can be found with other terms (e.g. search for ontario city feedback )
search for falls, returns no results b/c 'Falls' always has a comma after it in a record
record can be found with other search terms (e.g. search for Niagara)
search for EDINA and search for geo, both do not return Go-Geo! University of Edinburgh b/c EDINA always followed by comma, and Geo is followed by exclamation mark
Not sure if this is an error in the spreadsheet data or the information box.
Spreadsheet description column has cells with markdown. The information box does not process the markdown.
Could be added to #37
As discussed on the OK Discussion Forum, I'm exploring using a custom map pin on the DataPortals.org map and the map pin image as a favourite icon and apple touch icon. It seems possible to use a custom map pin - do you think we should make this change?
If so, I'm happy to work to sort out the design and provide the image files.
I will follow these requirements unless told otherwise:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.