Comments (3)
I'm pasting my comments from Comma Chameleon as I think they are still relevant here.
It would be good if the tools used by data publishers could be combined into the one application. Data Curator would be the most logical place as it would enable people like myself to;
- import an Excel spreadsheet containing raw data, (DC does this)
- convert the data to CSV using a standard dialect, (DC does this)
- check the data for quality issues (which could be highlighted so the user can see them and decide to correct them or not)
- correct those errors,
- build a JSON schema which complies to the expected open data standard,
- validate the data against the schema and,
- create the data package
- test publish it to Octopub and even get feedback using Github
- All in the one place.
I love all the tools but a one stop shop would be a publisher’s dream.
I should also add that DC needs some rudimentary Excel-style functionality, such as the ability to resize columns and rows (and automatic resize all), so you can make some edits in DC without having to go back to Excel.
Please feel free to provide this feedback to the developers. I’m loving their work so far.
from data-curator.
Hi @albreez - thanks for your suggestions.
-
should be done via Generate Data Quality Information that would produce a report
-
I'm not sure how these quality issues could be corrected - do you have an example?
-
Are you suggesting:
- An
Import a Schema
function and then let user enter data and validate against that schema? This would be an extension of Import Column Properties - or do you mean, use set column/table/data package properties to define an Open Standard that does not have a JSON representation?
- An
6. & 7. will be in release 1.0
-
Octopub does not support this yet theodi/octopub#477
-
I don't think we'll attempt to combine Data Curator and other desktop tools like Open Refine. Some of the schema definition and validation tools are being integrated into CKAN however that presumes you're happy to send your data to the web for validation. Data Curator's differentiator is that all work happens locally on your computer before publishing to the web.
Some column resizing is automatic but a little buggy on insert column. We'll look at the HandsonTable features to see what can be added easily. Feel free to add a feature request for this.
from data-curator.
@albreez just adding features that will behave like this example except for move columns.
Edit: However I've disabled Sorting until the Header Row feature is implemented to avoid header rows being sorted with values
from data-curator.
Related Issues (20)
- Opening Excel wordsheet, if cancelled leaves menu item disable
- Use latest version of tableschema.js HOT 1
- Open data package .zip from local drive opens additional tab HOT 1
- Overwriting data and resetting data
- Import Data Package Properties from file
- Import Table Properties from file
- Export Table Properties
- Hashing of properties or resources HOT 1
- Help -> Data Curator Help dead link HOT 1
- Can no longer export HOT 2
- Where does data package export file go? HOT 1
- Custom Properties Type checkboxes displaying over option text HOT 1
- Open Office Calc xls format, removes blank fields shifts left HOT 1
- Default Contributor Title in Preferences is misleading HOT 2
- Link to help broken HOT 2
- link to data packages on main page incorrect HOT 1
- query re iterative data HOT 4
- MacOS binaries can't run HOT 3
- Consider re-visiting display of frictionless errors on Import
- Unable to install Data Curator on my Windows HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from data-curator.