Code Monkey home page Code Monkey logo

fastretriever's Introduction

FastRetriever

A python script used to extract specific sequences from a fasta file based on ID or keyword.

Background

Next Generation Sequences produce large fasta files that are too large to be handled in graphical user interface (GUI) text editors. Quite often these files need to be filtered based on ID list produced by other tools' output (gene expression results, anootation results, etc.). This script provides a simple GUI to allow researchers to retrieve specific sequences from big fasta files (genome/transcriptome assemblies) without the hassle of using Unix command line scripts and tools.

Usage

Run the script get_from_fasta_gui.py from command line or your favourite Python IDE.

Requirements

  1. List of sequence IDs, one in a line, in a csv file
  2. Fasta database to look for the sequences
  3. Python 2.7 with easygui and BioPython packages installed
  4. Graphical terminal access (windows on linux machines).

TODO

  1. Add exception handling for long fasta headers
  2. Add exception handling for program abort (no input files selection)
  3. Tidy up functions
  4. Keyword list support
  5. Command line version

fastretriever's People

Contributors

idobar avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.