Code Monkey home page Code Monkey logo

dimig2's Introduction

DimiG2 for inferring disease-associated miRNAs

The microRNAs (miRNAs) play crucial roles in many biological processes involved in diseases and miRNAs function with protein coding genes (PCGs). In this study, we present a semi-supervised multi-label framework to integrate PCG-PCG interactions, PCG-miRNA interactions, PCG-disease associations by integrating disease hierarchy into graph convolutional network (GCN). DimiG is then trained on a graph, which is further used to score associations between diseases and miRNAs.

software dependency

Installation of GCN

Here we modified the orginal GCN (https://github.com/tkipf/pygcn) to support multi-label learning.
python setup.py install

Data depedency:

We aleardy uploaded some data used in this study to the repository under the directory data/, and other big files can be accessed as belows:

  • PCG-PCG interaction file "9606.protein.links.v10.txt.gz" can be downloaded from STRING v10 database.
  • Disease-PCG assications file "human_disease_integrated_full.tsv" can be downloaded from DISEASES database. We also upload the file human_disease_integrated_full.zip in this repository, please decompress it at directory data/.
  • PCG-miRNA interaction file "9606.v1.combined.tsv.gz" can be downloaded from RAIN v1.0 database.
  • GTEx_Analysis_2016-01-15_v7_RNASeQCv1.1.8_gene_median_tpm.gct.gz from GTEx website
  • gencode.v19.genes.v7.patched_contigs.gtf.gz from GTEx website
  • The above five files need be saved at dir "data/".

dimig2's People

Contributors

xypan1232 avatar

Stargazers

PhuPing avatar

Watchers

 avatar

dimig2's Issues

reproduce results

Hey,

I'm currently trying to reproduce your results using DimiG2 but it seems there are some files missing.
It would be nice if you could confirm or correct my assumptions.
Am I right that:

  • loss: the loss.ClassificationLoss is from loss.py from your repository NeuralNLP-NeuralClassifier with loss_type=LossType.BCE_WITH_LOGITS? So that LossType.BCE_WITH_LOGITS needs to be added to the list in line 125 as use_hierar=True
  • pygcn: should be the modified version from DimiG 1.0 (without any other changes)?

I run the model like that and got an AUC of 0.753 which does not match the reported semi-supervised results with an AUC of 0.765, which is why I'm wondering if anything is missing or not set up right.

Please correct me if any of my assumptions are wrong or I'm missing anything else.

Best regards and thanks in advance!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.