Comments (8)
A nice overview of some common pitfals of using Excel in bioinformatics can be found in Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics. Zeeberg et al., BMC Bioinformatics 2004, 5:80 . Contains amusing example of RIKEN identifier "2310009E13" converted to floating-point number, and various gene name -> data conversion horrors
note added later: I now see that Mateusz' link mentions the same paper. Oh well
from spreadsheet-ecology-lesson.
Thanks, these are good pointers.
This issue might be good to move over to the organization-genomics lesson https://github.com/datacarpentry/organization-genomics
@mkuzak and @plijnzaad ok if we move it over there?
from spreadsheet-ecology-lesson.
Agreed that it would be good to move this to the genomics lessons as it seems more relevant there. @mkuzak and @plijnzaad would this be ok?
from spreadsheet-ecology-lesson.
I think it's relevant to both. It's an example from genomics but deals with what not to do in the spreadsheet.
from spreadsheet-ecology-lesson.
Thanks @mkuzak. I think this is likely a change that won't happen before the release of this lesson at the end of the month. Just by way of record-keeping, I'm adding an "after-lesson-release" tag here. If you (or someone else) would like to incorporate this before lesson release, please let me know and I'll remove the tag!
from spreadsheet-ecology-lesson.
@ErinBecker , I think @mkuzak is correct and this is relevant to both. OMG I had fits working with all the Septins (SEP1-13) and Membrane Associated Receptors (MAR) of the dog genome. The topic fits in well with the use of dates and proper formatting. It's also important because some of the changes Excel makes cannot be changed back to the original text!! I'm not saying I'll have time to write up a good exercise, but the sooner the better.
from spreadsheet-ecology-lesson.
More descriptive examples are required for data cleaning and formulae. I have found that the "Concatenate" formula really comes in handy in my field of work.
from spreadsheet-ecology-lesson.
It looks like this was addressed in #231, so I'm going to close this issue. @mkuzak if you think more details are needed, please feel free to reopen. Thanks!
from spreadsheet-ecology-lesson.
Related Issues (20)
- Where year 2015 in tab '2014' is coming from in the messy data is not clear nor explained HOT 5
- dates "in the future" are no longer in the future HOT 2
- Various lesson improvements in a separate repo HOT 8
- Suggested edits/updates for 'Quality Control' lesson HOT 2
- Broken weblink in contributing.md HOT 6
- Broken link in Data Cleaning with OpenRefine Introduction HOT 2
- Add more of the common mistakes to messy spreadsheet HOT 1
- Suggestion for adding some more common mistakes HOT 2
- Suggestion for more on the DATE function HOT 3
- Need jump lists (anchors) for headings HOT 2
- Including google sheets as a possible spreadsheet program HOT 3
- Reference for spreadsheet organization (Broman and Woo) HOT 4
- slide deck of images from lesson? HOT 3
- Accessibility: replace screenshot of table with HTML table HOT 7
- Add a larger practice Data Set HOT 1
- Text edit in "Dates as Data" HOT 1
- Spreadsheet ecology lesson HOT 1
- Scheduling early transition to Workbench HOT 1
- Transition To Workbench in May HOT 6
- Links need to be fixed in CONTRIBUTING.md
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spreadsheet-ecology-lesson.