Code Monkey home page Code Monkey logo

cut_and_run's Introduction

Overview

These are collected scripts that constitute a pipeline to analyse Cut and Run / Cut and Tag datasets. The scripts have been taken from the tutorial put together by Ye Zheng, Kami Ahmad and Steven Henikoff. The scripts have been modified to facilitate processing - see below for further details.

Pre-pipeline steps

QC

Use FASTQC and FastQ Screen.

Trimming

Use trim galore if reads >25bp

Pipeline

Pipeline details: https://yezhengstat.github.io/CUTTag_tutorial/

Prep for pipeline

Filenames

Input fastq files need to be of the format when passing to the pipeline

[sample_name]_rep[1,2,3,...n].fastq.gz

No underscores are allowed in the sample_name.

Create a chromosome sizes file for the genome (not the spike-in genome)

wget https://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/faSize

./faSize -detailed -tab file.fasta

Mapping

Place FASTQ files in a folder named 'fastq'

nohup bash ./mapping.sh > log.mapping.sh.log &

Spike in

nohup bash ./spike_in.sh > log.spike_in.sh.log &

Summarise mapping

Rscript alignment_summary.R

Identify / remove duplicates

nohup bash analyse_duplicates.sh > log.analyse_duplicates.sh.log &

Rscript summarise_duplicates.R

Assess mapped fragment size distribution

nohup bash analyse_frag_size.sh > log.analyse_frag_size.out &

Rscript summarise_frag_length.R

Filter by read quality

nohup bash filter_quality.sh > log_filter_quality.sh.log &

Convert to different file formats

nohup bash file_format_conversion.sh > log.file_format_conversion.sh.out &

Assess replicate reproducibility

nohup bash correlation_matrix.sh > log.correlation_matrix.sh.out &

Rscript correlation_matrix.R

Spike-in calibration

nohup bash spike_in_calibration.sh > log.spike_in_calibration.sh.out &

Rscript spike_in_calibration.R

Peak calling

nohup bash peak_calling.sh > log.peak_calling.sh.out &

Rscript summarise_peaks.R

Visualisation

nohup bash visualisation.sh > log.visualisation.sh.out &

cut_and_run's People

Contributors

stevenwingett avatar

Watchers

 avatar

Forkers

lmb-seq

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.