Code Monkey home page Code Monkey logo

transonerate's Introduction

Transonerate

A Ruby gem to generate a per contig score using exonerate and a genome

Usage

transonerate --assembly <your_assembly> --genome <genome> --annotation <
existing_annotation> --left <fastq_files> --right <fastq_files>

Musings

Idea for getting a per contig score using exonerate: We want to show that the transrate score, which is achieved used reads only, is highly correlated with this exonerate score which requires a genome and gtf.

Align the reads to the genome using tophat/cufflinks and create a new gtf file that contains all the previously annotated regions and also any new ones that it thinks exists based on the read evidence.

From exonerate we can set ryo to

--ryo "%qi\t%ti\t%pi\t%qab\t%qae\t%tab\t%tae\t%ql\n"

that will output a sort of blast6-like format. Eg:

query               target  seqid  qs  qe    target_s   target_e     qlen
comp111716_c0_seq1    Chr1  95.24  0   253   33596559   33596305     253

Then we can find the length of alignment by doing (qe-qs/qlen) and multiply it by the %seqid. Then look up this region in the gtf. Sum the length of all exons that are wholly or partially covered by the target_s and target_e exonerate hit. Find the percentage coverage to the transcript. So the score has to take into account the %seqid, the percentage of the contig that aligned to the genome, and the percentage of the aligned contig that aligned to a transcript in the gtf.

transonerate's People

Contributors

cboursnell avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.