Code Monkey home page Code Monkey logo

molecule_structure_generation's Introduction

molecule_structure_generation

Target specific de novo drug design using Transformer neural network.

Folders:

/data The folder contains datasets used for training and evaluation of the model. To obtain these datasets we downoaded full version of BindingDB and ran BindingDB_dataset_preparation.ipynb (see scripts). Datasets contain only human proteins. Vocabulary file was used to encode data.

/data_4_organisms The folder contains extended datasets with proteins from human, bovine, rat and mouse and corresponding vocabulary file.

/scripts The folder contains IPython notebooks with raw data preparation, model training and decoding from it, model evaluation and results visualization.

molecule_structure_generation's People

Contributors

dariagrechishnikova avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

molecule_structure_generation's Issues

Request to add the script for ROC-AUC curve calculation

Hi Daria. Thanks for sharing the code with the drug design community!
In your associated article from Scientific Reports, there are a few ROC-AUC curves provided for evaluating the discriminating ability of SMINA docking software.
I had a few questions regarding how the thresholding works and what are the class labels for the datasets compared in the figures. It would be really useful if you can share the scripts for the curve calculation from docking scores or if you can explain how the curves are plotted from the docking scores along with the associated hypotheses.

Can't open the codes

Hello, I read your paper and want to check your model structures.

However, the model_training_and_decoding.ipynb file returns "reach display cannot be rendered" error message.

Could you check the file please?

Fix dependency versions

I tried to replicate the code, but with the current versions of the used libraries, there are a lot of incompatibility issues.
Could you please fix the version of the dependencies in the code, such that it is possible to run it.

Eg.
"pip install tensor2tensor==1.15.7" instead of "pip install tensor2tensor"

train_4_org_1042_104 shows 1514 unique proteins instead of 1042

Hello @dariagrechishnikova, in data_4_organisms , the number of unique train proteins in dataset 1, i.e., train_4_org_1042_104/proteins_train_4_org_corrected , is 1514 and, the number of unique test proteins in dataset 1, i.e., test_4_org_1042_104/proteins_test_4_org_corrected , is 99, instead of 1042 and 104 respectively. Requesting for the correct dataset. Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.