Skip to content

Graph Neural Network (Convolution , Node Attention , Node+Edge Attention) to assess quality of protein-protein complexes.

Notifications You must be signed in to change notification settings

yanistazi/Graph_Neural_Net_Protein-Protein-Complexes

Repository files navigation

Graph Neural Network + Attention mechanism to predict scoring functions (i-RMSD) for protein complexes and decoys.

Installation :

Make sure to create a dedicated environment as follow :

conda env create --name <YOUR_ENV_NAME> --file=environment_graph_predictions.yml

Tutorial for the data preparation, gridsearch training , testing and inference are available in this repository

alt text

1. Fully Automated data preparation pipeline that creates balanced graph datasets from PDB protein complexes and decoys files

alt text

2. Automated gridsearch for graph neural net architecture selection (Convolution, Node Attention, Edge Attention, Node+Edge Attention, customizable);

optimizer selection; possibility to train from scratch/resume training/transfer learning; feature selection alt text alt text

3. Automated testing pipeline that returns summary of the output, predictions and metrics.

alt text

4. Inference / Scoring pipeline returning the prediction on raw pdb files.

alt text

Conclusion : Precision within 2 A is reached using attention at both the node and the edge level to leverage complex interaction patterns between the nodes. Further training and architectures/hyperparameter exploration are required and might lead to performance improvement.

alt text alt text

Future work :

  • augmented training dataset
  • introduction of more features (pssm,depth,hse)
  • hyperparameter gridsearch exploration
  • use pretrained model as a feature embedding
  • deeper version of Edge + Node attention network
  • energy scoring functions

About

Graph Neural Network (Convolution , Node Attention , Node+Edge Attention) to assess quality of protein-protein complexes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published