Comments (4)
I have another question.
In the vec5_CTC.txt file, the dim of amino acid is 13. However, in 'x_list.pt', I found that the dim of amino acid is 7.
There is a mismatch!
from high-ppi.
Sorry to bother the authors! I found that author do the feature selection which is written in Supplementary Method 1.
from high-ppi.
In the vec5_CTC.txt file, each amino acid have a corresponding vector. What is the basis for determining these vectors? Or is it according to the conventions of a previous project?
Actually according to the conventions of a previous project. I tried to reproduce the baseline results, so I read the PIPR paper and find that.
The amino acid embeddings are composed of two parts. One part is obtained by pretraining the skip-gram model on the SHS148k protein sequences, which is 5 dimensional vector. The other part is obtained by a categorization of electrostaticity and hydrophobicity for the amino acid, which is 8 dimensional vector.(but the original paper says 7 dimensional vector, doesn't matter)
All the details can be found in this baseline model paper PIPR, section4.5 Amino acid embeddings.
Paper name: Multifaceted protein–protein interaction prediction based on Siamese residual RCNN
from high-ppi.
Hello, Haitao, thanks for your interest. Yes, your interpretation is consistent with our approach. To conveniently introduce chemical information, we used seq2vec pre-trained embeddings that can be directly assigned to various amino acids. An advanced approach is to consider the pre-trained representations of ESM-2 in the pre-processing stage to increase the model generalization. We just noticed that you have raised some other questions. Please give us some time to check and reply to you. Thanks!
Best regards
from high-ppi.
Related Issues (13)
- Questions about the Metrictor_PPI function HOT 2
- Questions about the gnn_models_sag.py HOT 4
- Questions about the Data Processing for New Datasets HOT 1
- Could you provide PDBs of SHS27k dataset?
- Question of ppi label HOT 1
- Questions about the importance of residues HOT 1
- Environment not working
- environment.yml HOT 8
- F1 HOT 4
- How to generate the Fig.2a? HOT 1
- Question about how to obtain the PDB files.
- error of generate_adj.py and generate_feat.py
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from high-ppi.