Code Monkey home page Code Monkey logo

gv_oocyte_paisoseqanalysis's Introduction

Scripts used for analyzing PAIso-Seq data

Welcome to PAIso-Seq.

Software Dependencies

This analysis pipeline is implemented in Perl and R. Packages and software Versions used are listed below:

Perl v5.26.2
R 3.5.1
minimap2 2.15-r905

Analysis of PAIso-Seq data

Step 1. Known the library structure of PAIso-Seq data

Library structure:
    Forward style: 5'-TSO--->cDNA--->AAAAA...AAAAA->Barcode->Adapter-3'
    Reverse style: 5'-Adapter-Barcode->TTTTT...TTTTT->cDNA->TSO-3'
5'-TSO Sequence: 5'-AAGCAGTGGTATCAACGCAGAGTACATGGG-3' (30 nt)
3'-Adapter Sequence: 5'-GTACTCTGCGTTGATACCACTGCTT-3' (25 nt)
Barcode Sequence: 5'-GAGTGCTACTCTAGTA-3' (16 nt)

Step 2. Circular Consensus Sequence calling

ccs movieX.subreads.bam movieX.ccs.bam --noPolish --minPasses 1 &>ccs.log

However, We now suggest the following command:

ccs movieX.subreads.bam movieX.ccs.bam --richQVs &>ccs.log

The output file is movieX.ccs.bam
Reference: https://github.com/PacificBiosciences/IsoSeq3/blob/master/README_v3.2.md

Step 3. Demultiplex and trim adapters


Convert movieX.ccs.bam to fasta:

bam2fasta -u -o ccs movieX.ccs.bam

The output file is movieX.ccs.fasta

Extract data and trim adapters:

./scripts/trim.py movieX.ccs.fasta sample GAGTGCTACTCTAGTAGTACTCTGCGTTGATACCACTGCTT 22 2 1>sample.out.fasta 2>sample.err.fasta

Step 4. Calculate poly(A) tail length and call non-A residues

Create minimap2 index :

minimap2 -x splice -t 20 -d Mus_musculus.mmi Mus_musculus.GRCm38.dna.toplevel.chromosome.fa &>index.log

Collect CCS passes :

./scripts/GetCCSpass.pl movieX.ccs.bam >ccs.pass.txt

Run the pipeline:

perl run.pl --ccs sample.out.fasta --sample sampleName --species mm10 --minimap2_index Mus_musculus.mmi --minimap2_thread  10 --pass ccs.pass.txt &>run.log

gv_oocyte_paisoseqanalysis's People

Contributors

niehu2018 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.