Code Monkey home page Code Monkey logo

vcf2diyabc.py's Introduction

vcf2DIYABC.py

Purpose

Tool to convert vcf file in DIYABC input file for SNP data

Requirement

  • python 2.x
  • A version compatible with python 3.x is avaible on branch python3.x
  • Windows users: This script is unable to run if python is not installed on your machine. I strongly recommand you do the necessary step to achieve this by following this link, since it's probably gonna be useful for many other bioinformatic purposes.

TODO: I'm currently trying to produce a self contained executable to run the script without the need for a python installation with py2exe.

Download/Installation

Unix users can clone this repository if git is installed on their system with:

git clone https://github.com/loire/vcf2DIYABC.snp.git

A link to a zip archive of this repository is on the bottom right of this page.

To install, just unzip the archive in a local directory.

Usage

  • Unix and mac user can use the terminal and type
python vcf2DIYABC.py
  • On windows, navigate in the file manager to find where you download and uncompressed the repository and then right click on the script file to execute it.

Once launched, the program will ask you to enter the path and name of your input files.

Input files

  • a vcf file (tested on the 4.0 version of this file format)
  • A text file which specify individuals sex and population of origin. This information is not present in a typical vcf file while being needed for DIYABC analysis. This file contains a line for each individual. First column is the individual ID (same as the name in your vcf file), second is the sex ("M" or "F" in uppercase, "9" if data is missing), and third is the population name.
  • Example:
NA00001	M pop1
NA00002 9 pop2
NA00003 F pop2

You can find an example of each file in example directory of this repo. A perfect match is required between names of individuals in the vcf file and those in the individuals informations file.

Acknowledgment

This script was written by Etienne Loire in the CBGP lab of the french institute for agronomical research. it's free to use or modify under the GPL license.

vcf2diyabc.py's People

Contributors

loire avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.