Comments (4)
I do realise now that this has been mentioned in part in #79 and also relates to #56 and #38.
from openrefine-socialsci.
I think we should look at the Library Carpentry lesson on OpenRefine for clearer use cases in the introduction episode: splitting data elements into different columns, normalising date formats and maybe matching/enhancing. This would go instead of the Motivations section, which is currently written for potential instructors (I feel).
Let's replace the Features and Getting help sections with How is OR different from spreadsheet applications? and When would you write a script instead of using OR?.
Spreadsheets
- OR is not for creating data and doesn't handle colours/formulae/comments/... in cells
- it is easier to undo/redo actions, especially applying actions on different files
- find and replace works on everything or one thing at a time, OR allows row selection and works in one column at a time
- OR has clustering so that you don't have to be aware of which variations exist, unlike in spreadsheets.
- you can load spreadsheets, but they need to be tabular
Scripts
- OR is more for exploratory cleaning, scripts are more useful when you know what to fix
- use scripts when you have too many data for OR
- use scripts when OR cannot do what you want, like fixing data using a machine-learning model.
from openrefine-socialsci.
From #37:
- Change learning objective "describe use cases" to understand how OpenRefine compares to spreadsheet apps and scripting".
from openrefine-socialsci.
Perhaps it's also useful to distinguish OR from using SQL with a relational database. SQL also allows selection of rows and creating derivative columns. The cross
function allows to join data from different projects, like JOIN
in SQL. (cross
is not currently part of the lesson, but I have used it myself.)
from openrefine-socialsci.
Related Issues (20)
- OpenRefine community no longer uses Google Groups HOT 2
- Awkward wording Trim Leading and Trailing Whitespace HOT 2
- Scheduling early transition to Workbench HOT 16
- Replace section that only links to "more on clustering" HOT 1
- Update files after Workbench migration
- Section on contributing in the README needs to refer to the Workbench way
- Move Instructor notes to appropriate episodes as inline notes HOT 3
- Replace remaining `{{ page.root }}{% link setup.md %}` links
- `[repo]` link in CONTRIBUTING.md is incorrect
- Add content to learner profiles HOT 3
- Explain how to quit OpenRefine correctly HOT 1
- Set descriptive titles for exercises
- Add more explanation to Transform(ation)s HOT 1
- Links need to be fixed in CONTRIBUTING.md HOT 1
- Update alt texts to describe what the screenshot is showing
- Update glossary with better definitions and links to episode or manual HOT 1
- Update Other Resources episode with Instructor note, additional resources
- Introduce more GREL expressions
- Introduce reconciliation HOT 1
- Introduce OpenRefine extensions and alternative distributions
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openrefine-socialsci.