Code Monkey home page Code Monkey logo

pangenome_analysis's Introduction

Pangenome graph analysis

Parse a pangenome graph and overlap given annotations of genes and TEs for every sample in the graph. For all genes, define orthogroups based on their synteny on the pangenome graph. Plot the variation for the whole pangenome and for repeats at the nucleotide level, and for all genes and a given subset of genes at the orthogroup level.

Pan_orthogroups-01

Installation

git clone https://github.com/Umbel89/pangenome_analysis.git

Usage

The whole pipeline is executed by running the main.py script:

python main.py --help

usage: main.py [-h] -i FILE -g STR -r STR -e STR -o STR [-c STR] [-t FLOAT]

Parse and annotate a pangenome graph, and plot its variation.

  -h, --help            show this help message and exit

required arguments:
  -i FILE, --input_gfa FILE
                        File location of pangenome graph gfa.gz file.
  -g STR, --gene_gff_dir STR
                        Directory with the gene gff files for all samples in the graph. dir/[sample]*.gff3
  -r STR, --repeat_gff_dir STR
                        Directory with the repeat gff files for all samples in the graph. dir/[sample]*.gff3
  -e STR, --effector_dir STR
                        Directory with txt files of a subgroup of genes, one gene_id per line, for all samples in the
                        graph. dir/[sample]*.txt
  -o STR, --output_dir STR
                        Directory where output will be written.

optional arguments:
  -c STR, --input_chrom STR
                        Specify a chromosome to be parsed. [default=all]
  -t FLOAT, --cluster_threshold FLOAT
                        Define orthogroups of genes that their distance is bellow this threshold. [default=0.6]

Cite

Skiadas, P. et al. (2024), Unpublished: Pangenome graph analysis reveals extensive effector copy-number variation in spinach downy mildew

pangenome_analysis's People

Contributors

umbel89 avatar

Watchers

 avatar

Forkers

teammge

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.