Code Monkey home page Code Monkey logo

pathifier-tool-kit's Introduction

Pathifier Tool Kit

DOI

Citing Information

If you use this code, please consider citing it as: M.A. García-Campos. (2018, March 17). AngelCampos/Pathifier-Tool-Kit: v1.0 (Version pathifierTools_v1.0). Zenodo. http://doi.org/10.5281/zenodo.1202186

Description

This repository has been developed to help users to run Pathifier and visualize its results in an interactive manner. The code may need modifications according to user's needs.

Pathifier is an algorithm designed to model the heterogeneity in samples based on their genomic data, e.g. gene expression levels, in a pathway centered manner (Drier et al., 2013). It does it by calculating the level of deregulation compared to a control group taking into account all the genes involved in a specific pathway. In this manner it is possible to represent samples with fewer, but more informative variables, based on biological knowledge of the biomolecules that work togheter in a pathway (e.g. apoptosis).

The following scripts may be used for easier loading of data, and calculation of default arguments not implemented in the native functions. As well as attractive plotting of the results of the deregulation analysis.

1_Run_Pathifier.R

An R script for running Pathifier (Drier et al., 2013) in R.

First file required is TAB separated file of the gene expression matrix in the following format:

NAME Sample1 Sample2 Sample3 Sample n...
NORMALS 1 1 0 ...
Gene1 1.25 1.56 2.56 ...
Gene2 4.56 0.25 1.75 ...
Gene3 1.56 3.21 1.36 ...
GeneN ... ... ... ...

1st row = Sample names 2nd row = Normal status of samples: 1 = Healthy, 0 = Tumor 1st col = Gene names from third row on (Gene symbol format is recommended)

Second file required by the script is a TAB separated file of the pathway information in GMT format:

PATHWAY1 Blank cell GeneA GeneB GeneC etc...
PATHWAY2 Blank cell GeneD GeneA GeneX etc...

Notes:

  • NO HEADER ROW!
  • Gene names used must coincide between expression matrix and pathway annotation
  • GMT format: Each row represents a pathway. 1st column is pathway name or id, 2nd column it's discarded BUT needed, from the 3rd column on starts the genes belonging to that pathway.
  • The PDS stability procedure is set to 10 for illustrative purposes (default should be 100),

Test-data is provided on TEST-DATA directory

Powered by package(s): "pathifier"


2_Heatmap_4_PDS.R

An R script for showing results of Pathifier (Drier et al., 2013) in a heatmap, showing the deregulation of each sample by pathway.

This script works directly with the results of the "1_Run_Pathifier.R" script and the stand-alone version of Pathifier, adding a vector for depicting normal samples (See additional notes).

The last update of the heatmap adds sample labeling for normal (healthy) vs tumoral samples

Output preview (using EXAMPLE_DATA) alt text

Powered by package(s): "gplots", "RColorBrewer"


3_Curves_4_PDS.R

An R script with 2 functions for plotting the principal curves derived of the pathway deregulation analysis using Pathifier (Drier et al., 2013)

This script works directly with the results of "1_Run_Pathifier.R" script and the stand-alone version of Pathifier, adding a vector for depicting normal samples (See additional notes).

Output examples (using EXAMPLE_DATA)

alt text

alt text

Powered by package(s): "plotly", "RColorBrewer"


Additional notes:

Pathifier Stand-alone version: http://www.weizmann.ac.il/complex/compphys/software/yotam/pathifier/installation.html

R-version of Pathifier is available via Bioconductor: http://bioconductor.org/packages/pathifier/

For Pathifier support, please contact the maintainer of the package Assif Yitzhaki

pathifier-tool-kit's People

Contributors

angelcampos avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

pathifier-tool-kit's Issues

Using RNA-seq

Sorry can I use your software Pathifier for RNA-seq? If yes, should I use raw read counts, normalized or RPKM values?

Thank you much for any information

I have used normalised values from DESeq2 on my RNA-seq data but I get this error

> min_exp <- quantile(as.vector(exp.matrix), 0.1) # Percentile 10 of data
Error in quantile.default(as.vector(exp.matrix), 0.1) : 
  missing values and NaN's not allowed if 'na.rm' is FALSE

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.