Code Monkey home page Code Monkey logo

Comments (3)

albreez avatar albreez commented on June 7, 2024

I'm pasting my comments from Comma Chameleon as I think they are still relevant here.

It would be good if the tools used by data publishers could be combined into the one application. Data Curator would be the most logical place as it would enable people like myself to;

  1. import an Excel spreadsheet containing raw data, (DC does this)
  2. convert the data to CSV using a standard dialect, (DC does this)
  3. check the data for quality issues (which could be highlighted so the user can see them and decide to correct them or not)
  4. correct those errors,
  5. build a JSON schema which complies to the expected open data standard,
  6. validate the data against the schema and,
  7. create the data package
  8. test publish it to Octopub and even get feedback using Github
  9. All in the one place.

I love all the tools but a one stop shop would be a publisher’s dream.
I should also add that DC needs some rudimentary Excel-style functionality, such as the ability to resize columns and rows (and automatic resize all), so you can make some edits in DC without having to go back to Excel.

Please feel free to provide this feedback to the developers. I’m loving their work so far.

from data-curator.

Stephen-Gates avatar Stephen-Gates commented on June 7, 2024

Hi @albreez - thanks for your suggestions.

  1. should be done via Generate Data Quality Information that would produce a report

  2. I'm not sure how these quality issues could be corrected - do you have an example?

  3. Are you suggesting:

    • An Import a Schema function and then let user enter data and validate against that schema? This would be an extension of Import Column Properties
    • or do you mean, use set column/table/data package properties to define an Open Standard that does not have a JSON representation?

6. & 7. will be in release 1.0

  1. Octopub does not support this yet theodi/octopub#477

  2. I don't think we'll attempt to combine Data Curator and other desktop tools like Open Refine. Some of the schema definition and validation tools are being integrated into CKAN however that presumes you're happy to send your data to the web for validation. Data Curator's differentiator is that all work happens locally on your computer before publishing to the web.

Some column resizing is automatic but a little buggy on insert column. We'll look at the HandsonTable features to see what can be added easily. Feel free to add a feature request for this.

from data-curator.

Stephen-Gates avatar Stephen-Gates commented on June 7, 2024

@albreez just adding features that will behave like this example except for move columns.

Edit: However I've disabled Sorting until the Header Row feature is implemented to avoid header rows being sorted with values

from data-curator.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.