Code Monkey home page Code Monkey logo

sage-data's Introduction

A Discussion on Issues around Making Data Available Alongside Research

Netlify Status

Abstract

Data makes science possible. Sharing data improves visibility, and makes the research process transparent. This increases trust in the work, and allows for independent reproduction of results. However, a large proportion of data from published research is often only available to the original authors. Despite the obvious benefits of sharing data, and scientists' advocating for the importance of sharing data, most advice on sharing data discusses its broader benefits, rather than the practical considerations of sharing.

This talk is framed as a discussion, and it covers some of pain points of sharing data alongside research. The talk will be framed as a discussion. One of the key messages we are trying to impart is sharing data falls on a continuum, and entering it should come with minimal barriers.

Slide available here

Working paper available here

Take home messages

  • You don't have to do every single thing to publish your data
  • Take small steps - get the data somewhere first, add more detail as you go
  • Try and get a DOI from a service like Zenodo or Dryad

Thanks

  • Karthik Ram
  • Miles McBain
  • Anna Kystalli
  • Daniella Lowenberg
  • ACEMS International Mobility Programme
  • Helmsley Charitable Trust
  • Gordon and Betty Moore Foundation
  • Sloan Foundation

Resources

Colophon

Bio

Dr. Nicholas Tierney (PhD. Statistics, BPsySci (Honours)) is a Lecturer in Business Analytics and Statistics at Monash University, working with Professors Dianne Cook and Rob Hyndman. His research aims to improve data analysis workflow, and make data analysis more accessible. Crucial to this work is producing high quality software to accompany each research idea. Mostly recently, Nick's work is focussing on exploring longitudinal data (brolgar), and improving how we share data alongside research ( ddd). Other work has focussed on exploring data with the R package visdat, and on creating analysis principles and tools to simplify working with, exploring, and modelling missing data with the package naniar. Nick has experience working with decision trees (treezy), optimisation (maxcovr), Bayesian Data Analysis, and MCMC diagnostics (mmcc.

Nick is a member of the rOpenSci collective, which works to make science open using R, has been the lead organiser for the rOpenSci ozunconf events from 2016-2018 (2016, 2017, 2018), and co-hosts the rstats podcast "Credibly Curious" with Dr. Saskia Freytag. Outside of research, Nick likes to hike, rockclimb, make coffee, bake sourdough, (eventually) knit a hat, take photos, and explore new hobbies.

sage-data's People

Contributors

njtierney avatar

Watchers

James Cloos avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.