Skip to content

Latest commit

 

History

History
106 lines (56 loc) · 7.87 KB

README.md

File metadata and controls

106 lines (56 loc) · 7.87 KB

Dock-MD-FEP

################ PLEASE NOTE ################################################################

The current workflow may be unable to deal with multiple chained PDB bank protein structures

  • This Colab version notebook is currently experiencing some instability due to dependencies and incompatible issues. This will be solved later, for now, you are advised to do this Dock-MD-FEP separately with three independent notebooks and

  • The third stage is Binder (This is only a test to assure yourself that the FEP is good to go, to run the actual job please use a GPU platform. Remember to change the platform to CPU during the test and back to CUDA in the real job).

  • The second stage is a free MD (https://github.com/pablo-arantes/making-it-rain/blob/main/Protein_ligand.ipynb) or at least an MD input generation process that you can finish from here Binder ,

  • The very first stage you can finish from here badge ############################################################################################

Installing the workflow on a Linux computer

Using the code locally, you only need to do the installation once so that you can skip the installation cell afterwards.

RTX30 series or a better GPU is recommended for the local computer.

This workflow requires a dedicated Python 3.8 environment before installation.

conda create --name Dock-MD-FEP python=3.8
conda activate Dock-MD-FEP
conda install jupyter --yes
(Dock-MD-FEP) jupyter lab Dock-MD-FEP_local_installation.ipynb

Several little things should be modified compared to using this online, like the path name etc. Smina instead of Gnia is used because of possible GPU incompatibilities. The rest of the code remains the same.

There might be an error from the first import cell, but it doesn't matter for the dock and MD simulation, it may only affect analysis later which you could do locally alternatively.

############################## Updated on 24/12/2022####################################

To use the workflow online with Colab

$Google_Drive_Path is the path in your Google Drive where you want to store the simulation data, and you should provide it. image

You can use this workflow purely for docking purposes, docking plus MD simulation, or docking, MD and absolute binding free energy calculation.

image

After you've imported Google Drive, just fill in the four lines that define your working path, providing your PDB ID and small molecule structure, and the rest will run automatically with just one click.

image


default parameters (Change simulation time to be longer is necessary for a proper simulation)

-- MD , all MD input use Amber Gaff2 force field to deal with small molecule.

Up until before FEP simulation, the workflow procedes with one MD (MD0), one docking(dock1), MD(1), then the second docking (docking2), and a final MD(MD2), it will cost a total 25 ns of MD simulation.

MD 0 , equilibration 1 ns, production 2 ns. OpenMM as simulation engine.

MD0 result in $Google_Drive_Path (PDB bank structures MD result)

1st docking result in $Google_Drive_Path and $Google_Drive_Path/MD1

MD 1,  equilibration 2 ns, production 5.0 ns. OpenMM as simulation engine.

MD1 result in $Google_Drive_Path/MD1 (Docked small molecule MD result)

2nd docking result in $Google_Drive_Path/MD1 and $Google_Drive_Path/MD1/MD2

MD2, equilibration 5 ns, production 10 ns. OpenMM as a simulation engine.

MD2 result in $Google_Drive_Path/MD1/MD2 (Docked small molecule MD result)


Dock, gnina, --exaustiveness=200. The best pose with docking score and best CNN score was carried on to FEP 

FEP, stop the simulation when reaching error < 0.1 KT, Simulation engine OpenMM with Yank python library.

image

Free energy of binding(benzene) : -11.229 +- 0.352 kT (-6.694 +- 0.210 kcal/mol)

Restart the simulation

Open the Dock-MD-FEP-restart.ipynb, after pasting the working directory path of last simulation. Import the drive manually, then run all the other cells at one time.

Trouble shooting

A CUDA version related error CUDA_ERROR_UNSUPPORTED_PTX_VERSION (222) Please refer to #2 or the openMM issue link openmm/openmm#3585

Those with access to AI studio can try this workflow with automatic analysis. (Extend the simulation steps if you have enough GPU time)

The user is expected to upload the ligand.mol2 + starting_end.pdb to the default path, then submit the job to the queue.

https://aistudio.baidu.com/clusterprojectdetail/7677668

The FEP section of this workflow relies on Yank, which built on OpenMMM with customirised potential, for a general OpenMM free energy calculation see https://openmm.github.io/openmm-cookbook/latest/notebooks/tutorials/Alchemical_free_energy_calculations.html