Code Monkey home page Code Monkey logo

ecmp-rf's Introduction

##ECMP-RF

Prediction of extracellular matrix proteins by fusing multiple feature information, elastic net and Random Forest algorithm.

The experimental environment is: Windows Server 2012R2 Intel (R) Xeon (TM) CPU E5-2650 @ 2.30GHz 2.30GHz with 32.0GB of RAM, MATLAB2014a and Python3.6 programming implementation.

###ECMP-RF uses the following dependencies:

  • MATLAB2014a
  • python 3.6
  • numpy
  • scipy
  • scikit-learn

###Guiding principles:

**The data contains training dataset and testing dataset.

**Feature extraction: AD is the implementation of autocorrelation descriptor. EBGW is the implementation of encoding based on grouped weight. PsePSSM is the implementation of pseudo position specific scoring matrix. PseAAC is the implementation of pseudo amino acid composition. LD is the implementation of local descriptor.

**SMOTE: SMOTE_R_train_test.R is the implementation of SMOTE

** Feature selection: LOOCV_EN.py represents the elastic. LOOCV_KPCA.py represents KPCA. LOOCV_LASSO.py represents LASSO. LOOCV_PCA.py represents PCA. LOOCV_MI.py represents MI. LOOCV_LR_selection.py represents LR. LOOCV_FA.py represents FA.

** Classifier: LOOCV_AdaBoost.py is the implementation of AdaBoost. LOOCV_GBDT.py is the implementation of GBDT. LOOCV_LR.py is the implementation of LR. LOOCV_Naïve Bayes.py is the implementation of Naïve Bayes. LOOCV_RF.py is the implementation of RF. LOOCV_SVM.py is the implementation of SVM.

You can download the datasets from the corresponding website. After that, you should prepare the data used in the training code according to the steps below. Firstly, feature vectors are obtained by feature extraction. Then, the fused feature vector is processed by SMOTE. We use the problem as an example. Problem is the balance data after training data set feature fusion and SMOTE processing.

For LOOCV_RF.py

Input: problem.csv

Output: result_problem.csv, ytest_problem.csv, yscore_problem.csv

The result_problem.csv contains seven evaluation indexes: acc, precision, npv, sensitivity, specificity, mcc, roc_auc

ecmp-rf's People

Contributors

qust-aibbdrc avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Forkers

harel-coffee

ecmp-rf's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.