Code Monkey home page Code Monkey logo

haplotyped-methylome's Introduction

Haplotyped Methylome

bioRxiv Preprint

Reproducibility instructions for Gigante et al., 2018.

Note: this repository is still being tested! If you find a bug, please file an issue.

Data available at ENA Accession PRJEB27157.

Directed Acyclic Dependency Graph

System requirements

  • R
  • Python>=3.5
  • Lots of RAM (min 256GB)
  • Lots of disk space (estimate: 2TB)

Dependencies

To install with conda, run the following command.

conda env create -f environment.yml
source activate haplotyped_methylome

You will then need to install Albacore: it is available on the Nanopore Community.

To install without conda, see the list of dependencies at the bottom of this README.

Required data

For the standard workflow, snakemake will download all the necessary files.

If you wish to avoid running albacore, bwa and nanopolish on the raw nanopore data, you can run the following command, which downloads the output of these programs and tricks snakemake into thinking you have run the pipeline from the beginning:

snakemake intermediate_download

Note: the download link for these files is not yet available, but will be posted to ENA shortly.

If you wish to rerun from the beginning after running this command, you can revert to the original download with snakemake --forceall.

Running the workflow

To generate all plots, tables and notebooks, simply run from the root directory:

snakemake --cores 16

If you don't wish to run the full analysis, you can run specific rules from the Snakefile by running, for example:

snakemake --cores 16 rnaseq_analysis
snakemake --cores 16 haplotype_analysis
snakemake --cores 16 methylation_analysis

Installation without conda

Software dependencies:

Python package dependencies:

pip install --user -r requirements.txt

R package dependencies:

Rscript install_R_deps.R

Known Issues

  • rnaseq_analys.Rmd failed: there is no package called 'ggrastr'

If devtools doesn't play nicely with conda, sometimes the automatic GitHub installation of ggrastr fails. You can resolve if as follows:

git clone --depth 1 https://github.com/VPetukhov/ggrastr.git
cd ggrastr
R -e 'devtools::install()'

Directed Acyclic Dependency Graph: Methylation

Directed Acyclic Dependency Graph: Haplotyping

Directed Acyclic Dependency Graph: RNA-seq

haplotyped-methylome's People

Contributors

scottgigante avatar

Watchers

James Cloos avatar Wtong avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.