Code Monkey home page Code Monkey logo

bioinformatics-workflows's Introduction

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

GitHub release (latest SemVer)

Workflow managers provide an easy and intuitive way to simplify pipeline development. Here we provide basic proof-of-concept implementations for selected workflow managers. The analysis workflow is based on a small portion of an RNA-seq pipeline, using fastqc for quality controls and salmon for transcript quantification. These implementations are designed for basic illustrations. Workflow managers provide many more powerful features than what we use here, please visit the official documentations to explore those in detail.

The RNA-Seq workflow

The RNA-Seq analysis workflow performs quality controls with fastqc and quantifies transcripts expression using Salmon. Here we will use local installation (see documentation for salmon and fastqc). For the local installations you can add a symbolic link to the executables to your $PATH:

sudo ln -s /absolute/path/salmon/bin/salmon /usr/local/bin/salmon

sudo ln -s /absolute/path/FastQC/fastqc /usr/local/bin/fastqc

You can test the installation using the help function of the two tools (i.e. salmon -h and fastqc -h).

Test Data

This repository contains a simulated test data set which can be used to run the example implementations. The test data contains RNA-Seq reads (reads_1.fq.gz and reads_2.fq.gz), a transcriptome reference file (transcriptome.fa) and the true counts from the simulation experiments (truth.tsv)

Basic proof-of-concept implementations

Each workflow manager folder in this repository has a README detailing how to run the proof-of-concept pipeline. Implementations that are contributed and reviewed by developers from these workflow management systems are marked with ⭐ (see Acknowledgements):

Online Documentation for Workflow managers

Workflow managers have many more features which are not used in these implementations, and there are many additional workflow managers. You can read more about each workflow manager in their official documentation:

About this repository

The repository was created to illustrate features of workflow managers that are discussed in detail in this manuscript:

Wratten, L., Wilm, A. & Göke, J. Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers. Nat Methods (2021). https://doi.org/10.1038/s41592-021-01254-9 (Full text link: https://rdcu.be/cyjRN)

Contact and Call for Contribution

This repository was created by Laura Wratten. We very much encourage contributions by users of these workflows. If you would like to add an implementation for any of these workflow managers you can follow the template. If you would like to suggest changes to any of the existing implementations, please raise an issue and submit a pull request.

Acknowledgements

We would like to thank the following people for their contribution to this repository:

bioinformatics-workflows's People

Contributors

aurelg avatar bounlu avatar cclauss avatar dailydreaming avatar hexotical avatar johanneskoester avatar jonathangoeke avatar lwratten avatar mvdbeek avatar pditommaso avatar samuell avatar ssadedin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bioinformatics-workflows's Issues

Make the different implementations more accessible via a web interface

We should create a github pages based view on this repo which allows to switch between the description of the example analysis and the different implementations from the different systems in a visually pleasant way.

I personally don't have the time at the moment, and @jonathangoeke already mentioned that while it would be very interesting for them, his team does not have the experience necessary for that. Hence, we thought it make sense to ask for help from the community via this issue. Is there anybody interested in giving it a try?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.