Code Monkey home page Code Monkey logo

alt-in-tool's Introduction

ALT-IN


ALT-IN is the instrument for predicting disruptions in protein-protein interactions induced by alternative splicing. We recommend usage of Docker version of our tool which is available on DockerHub repository (https://hub.docker.com/r/narykov/alt-in) and has a convenient interface (altin_docker.py)

DOI

INSTALLATION


DOCKER INTERFACE

The ONLY script file you would need to run Docker image is altin_docker.py. The prerequisite is existing Docker installation (the image would be downloaded automatically), and example data files. You would be able to run example just by executing the following commands:

python3 altin_docker.py -i test/interactors.tsv -o res.txt -f test/diabetes_ensembl_protein.fa test/string_protein.fa

Download would take approximately 10 min on stable internet connection. This is an interface script for Docker container. The ONLY dependency for it is preinstalled Docker with corresponding permissions.

OPTIONS
  • -h, --help

    Prints brief usage information.

  • -i, --interactors

    A tsv file containing triplets in format "reference_isoform", "interactor", "alternative_isoform". You can use test/interactors.tsv file for the reference.

  • -d, --output-dir

    Output directory where output would be stored. Permissions should allow for Docker to bind to this location. Default value is current directory

  • -o, --output-file

    Name of the output file. Would be stored in directory specified by "--output-dir" option.

  • -f, --fastafiles

    A list of fasta files, which should contain sequences for all IDs occuring in interactors file. Number of files you may specify is not limited.

STANDALONE

Read the following information if you wish to install ALT-IN Tool without Docker as a standalone package. Standalone version of ALT-IN Tool depends on 2 software packages - EMBOSS and INTERPRO. Installation was tested on Ubuntu 16.04 with Python 2.7 version of Anaconda.

EMBOSS

Direct download - ftp://emboss.open-bio.org/pub/EMBOSS/EMBOSS-6.6.0.tar.gz

Could be downloaded from the EMBOSS website. In our package version 6.6.0 was used.

InterPro

Direct download - ftp://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/5.27-66.0/interproscan-5.27-66.0-64-bit.tar.gz

Could be downloaded from the InterPro website. In our package version 5.27-66.0 was used.

CONFIGURATION

You would need this step only if you want to change sequence alignment (currently EMBOSS implementation is used) or domain detection (currently performed by SUPERFAMILY).

We require two files to be accessible - pepstat from EMBOSS and ass3.pl from SUPERFAMILY. If they are in the system path (i.e., accessible from the command line), no further actions required.

For each package that you cannot access from command line fill in corresponding line in 'config' file. E.g., to provide path for INTERPRO fill in empty space on the second line of the file: INTERPRO_PATH='~/Downloads/interproscan-5.27-66.0'

RUNNING

To run our tool please use command in format

python altintool.py [interactors_file.tsv] [fasta_file_1, fasta_file_2, ...] [output_file]

E.g., to run our test data you can use command

python altintool.py test/interactors.tsv test/diabetes_ensembl_protein.fa test/string_protein.fa test/results.txt

INPUT


To make predictions you would need at least 2 files:

  • Triplets consisting of the IDs of (Main isoform, Interacting partner, Alternatively spliced isoform) in tab separated format (.tsv)
  • Fasta file(s) with protein sequences for the aforementioned protein IDs

SUPPORT


For questions, please email to the [email protected]

alt-in-tool's People

Contributors

alexandrnp avatar dbogatov avatar

Watchers

 avatar  avatar  avatar

alt-in-tool's Issues

Runtime &results on example

Hello,

I tried your tool on the example dataset and it died tue to timeout in the pairwise alignment phase.
I extended the value to 300 but still it died for this line :

"ENSMUSP00000009538 ENSMUSP00000009538 ENSMUSP00000016664"

In this other line an error pops up due to the alignment did not output any value ('[]'),
Is it expected ?

ENSMUSP00000100212 ENSMUSP00000100212 ENSMUSP00000137424

Should I switch to the commented code in the alignment part and use localmd vs globalmd? would it affect the performances ?
alignments = align(isoform_seq, main_iso_seq, ratio)#pairwise2.align.localmd(isoform_seq, main_iso_seq, 5, -4, -100, -10, -10 * ratio, -1.5)

Then I just wanted to ask you what is the expected classification for the example cases [1,0] so that I know that that the output I am getting is the correct one.

One final question, looking in the extract_features.py
n_termini = 1 if sum(s2[:20] == '-') == 0 else (-1 if sum(s2[:20] == '-') > 12 else 0)
This would output 1 if she sum is '0', 0 if it is >0 and <= 12 :, and 2 if the sum is > 12.
Is this the expected behavior?

Thanks for your help!

Mattia

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.