Code Monkey home page Code Monkey logo

deepace's Introduction

DeepACE

Angiotensin-I converting enzyme (ACE) plays an important role in maintaining the balance of the angiotensin system and regulating blood pressure in the body. Due to its importance, ACE has also become the target of many drugs to treat hypertension and cardiovascular diseases. Among them, the strength of ACE inhibitory activity will affect the effect of the drug. How to accurately identify peptides with strong ACE inhibitory activity from ACE inhibitory peptides is of great significance. this study proposes a novel and high-accuracy prediction model (DeepACE) developed using generative adversarial capsule network (CapsuleGAN), to accurately identify peptides with strong ACE inhibitory activity. For DeepACE, we first utilized a wide range of feature encoding schemes derived with two main types of feature encoding strategies, including hand-crafted feature encodings and pre-trained protein language models, to extract the hidden interrelated information embedding in ACE inhibitory peptides. Subsequently, these extracted features were fused and used to develop a predictor based on the CapsuleGAN model for identifying peptides with strong ACE inhibitory activity. Finally, to enhance the model performance we performed the dimensionality reduction on the fused features using the principal component analysis (PCA) method. Comparative analysis results show that in independent tests, DeepACE's prediction performance is significant, with a balanced accuracy of 0.984 ± 0.039, an MCC of 0.968 ± 0.079, and an AUC of 0.995 ± 0.014, which are significantly better than existing methods. In addition, DeepACE was able to attain a superior prediction performance compared with well-known machine learning-based and deep learning-based classifiers.

Requirements

The majoy dependencies used in this project are as following:

Python 3.9.18
fair-esm 2.0.0
pandas 1.3.5
numpy 1.21.6
scikit-learn 1.0.2
torch 1.13.0+cu116
tensorflow 2.6.0
keras 2.6.0

If other libraries are missing, you can install them yourself using the command pip install package_name==2.0.0

Model performance

ROC curve PR curve

Usage

Note: The dataset uses 0 and 1 to represent high activity and low/no activity respectively.

Extract features

In the iFeature_exteactor file, traditional feature extraction methods are provided, which can be modified according to your own needs. The ESM and PortT5 file provides extraction methods for ESM-2 and PortT5 features, and you can extract them according to your needs.

Feature processing

The feature processing method is in the extra_features folder, which includes feature fusion and feature selection. You can process the extracted features according to your own needs.

Model prediction

In the fusion_features folder, there are a variety of machine learning models and deep learning models. Such as: LR, SVM, MLP, CNN, DNN, etc. Models can be trained and tested by uploading your own data sets.

Further adjustments and modifications

Feel free to personalize it. Just scroll down to the model architecture section and modify it to suit your expectations.

deepace's People

Contributors

zhou-jianren avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.