Code Monkey home page Code Monkey logo

mlift's Introduction

mlift

A mummer based genomic coordinate liftover tool

This is a working but prototype version so use this with caution. If you provide a genome coordinate bed file (or simply a tab delimited file), mlift will lift the coordinates of ref genome to query genome (ref and query genome assignments are arbitrary and follow the MUMmer convention). If you use it, and has a feedback, please email me at [email protected].

  1. Install
   Make
  1. Run nucmer

The parameters of nucmer depends on how you want to lift your coordinates. If you want to ignore duplicates, use

  nucmer --prefix ref2q ref.fasta query.fasta

  delta-filter -r -q re2q.delta >ref2q.rq.delta

If you want to get all copies, use

   nucmer --prefix ref2q.mm ref.fasta query.fasta
  1. Run mlift
   mlift ref2q.delta foo.bed s

Here the delta file is the delta alignment from the nucmer run in step 2. You can use 's' or 'l' as the mode. When your complete interval is not present in the alignment, 's' will contract the interval to fit it inside the alignment, whreas 'l' will write 'NA' for your coordinate that is not covered in the alignment. I use 's' when I have to find partial of full gene duplications using mlift. The bed file should have the ref genome coordinates in the following tab separated format -

Chromosome_name	Start	End

Output: The output will be in a file named 'lifted.txt'. It has 5 columns, the last two columns give you the sequences.

mlift's People

Contributors

mahulchak avatar

Stargazers

 avatar Ethan Shealy avatar  avatar  avatar Nick Cauldron avatar  avatar Songtao Gui avatar peterdfields avatar

Watchers

James Cloos avatar  avatar

mlift's Issues

Empty query coordinates in lifted.txt? (request example .bed file)

I'm seeming to run into trouble with the bed file. Could we get an example for what is expected in the bed file? Is it simply the chromosome name the start BP (1) and then the length of said chromosome?

For example my bed starts like this
1 1 178720847
2 1 109423767
3 1 121101644
4 1 101460260

Am I interpreting this correctly?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.