The server/software of the molecular docking analysis.
Abstract
Molecular docking is a bioinformatics-based theoretical simulation strategy. It is employed to study ligand-protein interaction profiles and predict their binding conformers and affinity through computational tools. Since the 1980s, computational tools have been used in the drug discovery process. The initial molecular modeling approaches available at the time focused on a rigid view of the ligand-protein interaction due to the limited computational capabilities. The advancement of hardware technology has made it possible to simulate the dynamic character of the ligand-protein interactions throughout time. The current chapter deals with an outline of the progression of structure-based drug discovery methodologies in the investigation of the ligand-protein interaction profiles from static to improved molecular docking strategies.
Keywords
- Molecular docking
- AutoDock
- Vina
- AutoDockFR
- iGEMDOCK
- Drug discovery process
- Virtual screening
1. Introduction
Docking tools have simplified the study of interactions between drug molecules and receptor proteins, DNA, or biological molecules [1]. These interactions take place covalently. Furthermore, critical molecular mechanisms, ligand binding approaches, and factors influencing the ligand-protein interaction profile can be estimated with the help of the docking results [2, 3]. Docking suites can be used to calculate the binding energies associated with the most stable conformation of drug-receptor interactions (Figure 1) [4, 5].
2. Types of docking
In 1982, Kuntz et al. developed the first molecular docking algorithm through the estimation of the released binding energy [6, 7]. Docking evaluations are performed to regulate the interaction profile between the ligand and target and to search for the most suitable conformation of the ligand in the complex. Empirical scoring functions are also explored, which transform binding energy into the docking score [8]. There are numerous free online tools available to generate 3D ligand and target interaction profiles, such as Biovia DSV, Pymol, Chimera, Rasmol, SwissPDB viewer, etc. Docking is broadly classified into three classes, discussed below:
2.1 Flexible docking
In flexible docking, the side chains of the protein and ligand are kept flexible. The general principle of flexible docking is based on the induced-fit hypothesis offered by Daniel Koshland in 1958 [9]. As a result, it is also known as “induced-fit docking,” in which the binding energies of various conformations of the proposed ligand are calculated at protein or receptor pockets [10, 11]. Furthermore, the target chain should be flexible enough to combine with the conformational modifications of the receptor and ligand. Various altered possible conformations of the ligand can be predicted, which makes it the most accepted and accurate technique, but it is time-consuming and costly at the same time [12].
2.2 Semi-flexible docking
In this approach, the ligand molecule is the only flexible element while the protein is rigid [13]. In addition to the six translational and rotational degrees of freedom, the conformational degrees of freedom of the ligand are also tested [14]. These approaches assume that a protein’s fixed conformation is capable of recognizing the ligands to be docked. As previously stated, this assumption is not always validated [15].
2.3 Rigid docking
In rigid docking, the main geometry of the target and ligand is retained and frozen during docking analysis [16]. The basis of this type of docking analysis is the ‘Lock and Key’ hypothesis, proposed by Emil Fischer in 1894 [17]. Thus, it is defined as lock and key docking, which also leads to several problems. The analysis of ligand-target docking is very significant for observing drug-target interaction, but a problem is associated with it when the ligand is docked at the pocket site of a receptor protein. Due to the rigid structure of both, observation of interactions becomes very challenging and the most suitable confirmation of ligand is not easily obtained [18]. Sometimes ligands do not enter the pocket site of a protein, leading to weak interactions that are not enough to show satisfactory results. Internal flexibility is necessary for good docking interaction. In various cases, the structural modifications that are essential for binding are negligible in rigid docking. Rigid docking is only enough to observe the interaction [11]. Some other benefits of rigid docking are its simplicity and a short period of run time.
3. Docking interactions
Docking is performed to establish the most suitable interaction profile for a ligand inside the target protein. It is also employed to estimate the energy evolved during the interaction between the ligand and protein [19]. Various forces influence docking interactions. The total energy released during these interactions is calculated through the empirical formula and displayed in the form of total binding energy [11, 18]. Based on the different forces, docking interactions are categorized as electro-dynamic forces (like van der Waals), electrostatic forces (charge-charge, dipole-dipole, and charge-dipole), steric forces (observed between closer molecules and influence the reactivity as well as the chemical reactivity), solvent-related forces (occurring due to interaction among the solvent and protein/ligand) and conformational modifications in the ligand) [20].
4. Types of energies
The preliminary objective of docking analysis is to obtain the best conformation of the drug during the drug-receptor interactions in support of the lowest binding free energy [21]. Molecular docking tools frequently calculate the scoring functions to evaluate the binding energies of drug-receptor interactions [11]. The resultant binding energy (ΔG bind) is calculated in the form of a combination of different energies such as H-bond, torsional free, electrostatic, unbound system’s desolation, total internal, dispersion, and repulsion, etc. The dissociation constant (K
An example of a scoring function is as follows:
The empirical scoring function of any docking program.
Binding Energy.
5. Docking algorithms
The docking algorithms display a new dimension to evaluating the interaction profile of the ligand-receptor complex [24]. It calculates all possible conformations of the ligand under investigation during the interaction with the receptor. It also delivers the most suitable conformational pose with minimum binding energy [24, 25]. The most common algorithms apply for various docking evaluations (Flexible, Semi-flexible, and Rigid Docking) are (Figure 2):
5.1 Flexible docking with single protein conformation
5.1.1 Side-chain flexibility docking
The side-chain flexibility docking approach introduces different conformations for various protein side-chains [26]. This is usually accomplished by utilizing rotamer library databases. Various docking approaches like GOLD use their search engine to sample some degrees of freedom. Large conformational fluctuations of the protein are ignored by these approaches due to side-chain flexibility [27].
5.2 Soft docking
In 1991, Jiang and Kim first described the soft docking strategy, which is based on the understanding of protein flexibility [28]. The VdW revulsion is also working in force field scoring functions because it reduces collisions and allows for more compact ligand-protein packing. In this method, an induced fit is recreated. As a drawback, this method can only simulate faint protein motions, which can lead to erroneous poses [24].
5.3 Flexible docking with multiple protein conformations
For the same target, multiple experimental structures may be offered [29]. Furthermore, computational approaches such as Monte Carlo or Molecular Dynamics simulations can be used to obtain an ensemble of protein conformations [30]. The goal behind multiple protein conformation docking is to consider all of the potential configurations by employing various strategies:
5.3.1 Individual conformations
The target structures are viewed as conformations that could be attached to the ligand. Therefore, several docking scores are undertaken, assessing the ligands on all of the target conformations [31]. Furthermore, to filter the structures, an initial standard to evaluate the presentation of distinct target structures in a docking investigation was also performed in individual conformations [32, 33].
5.3.2 United description of the protein
The structures are utilized to build the best-performing “chimaeras” protein instead of collapsing into an average grid [34]. Like FlexE, it selects structurally conserved areas from the ensemble’s structures to build a rigid configuration. This section is attached to the ensemble’s flexible portions in a combinatorial method, resulting in a pool of “chimaeras” that can be docked [35].
5.3.3 Average grid
The ensemble’s structures are combined to form a typical solitary grid [36].
5.4 Semi-flexible docking algorithm with simulation approaches
A well-known model of this class is molecular dynamics. This approach defines a system’s temporal evolution [37]. The molecular dynamics unit provides a more detailed explanation [38]. Energy-saving strategies are also included in this category, but these strategies are rarely utilized as standalone search engines [39]. Energy minimization is a local optimization approach for obtaining a system with certain potential energy [40].
5.5 Semi-flexible docking algorithm with stochastic methods
In this approach, the values of the degrees of freedom of a system are changed randomly rather than systematically like in stochastic algorithms [41]. The speed of these procedures is beneficial, as they might potentially locate the best answer very quickly. The main disadvantage of this approach is that it does not confirm a comprehensive investigation of the conformational space, which denotes the actual solution, which may be overlooked. Increase the number of iterations of the method to partially solve the lack of convergence. The following are the most well-known stochastic algorithms [42]:
5.5.1 Swarm optimization (SO) methods
Several swarm optimization approaches are based on the behavior of swarms [43]. The knowledge supplied by previously sampling good poses guides the sample of a ligand’s degrees of freedom. PLANTS use an Ant Colony Optimization (ACO) algorithm, which simulates the behavior of ants, and uses pheromones to find the quickest way to a food position [44]. Each degree of freedom is coupled with a pheromone in this system. Successful ants contribute to pheromone deposition, while virtual ants choose conformations based on pheromone values.
5.5.2 Evolutionary algorithms (EA)
The most prominent evolutionary algorithms are genetic algorithms (GAs), which are based on the idea of biological evolution [45]. The genes, chromosomes, mutations, and crossover concepts are all taken from biology. Genes are represented in the form of the degrees of freedom as well as ligand conformation, which is defined by a chromosome that is awarded a fitness score [46]. Within a population of chromosomes, mutations and crossovers occur, and the chromosomes with greater fitness survive and replace the ones with lower fitness. rDock, PSI-DOCk, AutoDock, and GOLD are the most well-known instances [46, 47, 48, 49, 50].
5.5.3 Tabu search methods
Tabu search strategies are used to avoid exploring zones of the conformational/positional space that have already been explored. At each cycle, random alterations are made to the ligand’s degrees of freedom. The previously sampled conformations are recorded, and a new stance is allowed only if it is distinct from any previously investigated pose. This category includes programs like PRO LEADS and PSI-DOCK [47, 51, 52, 53, 54].
5.5.4 Monte Carlo (MC) methods
The Metropolis Monte Carlo algorithm, which presents a recognized measure in the development of docking exploration, is the basis for Monte Carlo approaches [55]. Each repetition of the algorithm involves a casual adjustment of the degrees of freedom of the ligand. The Metropolis algorithm in its basic form, although it is implemented in a variety of ways in docking software, AutoDock Vina, MCDOCK, QXP, ICM, and AutoDock [30, 42, 44, 56].
5.6 Semi-flexible docking algorithm with efficient exploration techniques
In an efficient exploration, a collection of findings is associated with each degree of freedom, and all the values of each coordinate are examined in a combinatorial manner [56]. These approaches are classified into the following categories:
5.6.1 Conformational ensemble
Rigid docking approaches can easily be supplemented with a certain amount of flexibility. If an ensemble of previously produced ligand conformers is docked to the target using a conformational variation approach on the ligand complement, an example is MS-DOCK [57].
5.6.2 Fragmentation
DesJarlais et al. in 1986 described an approach to fragmentation of the ligand. The first application of ligand flexibility in docking was the hard docking of fragments into the reaction site and the subsequent connecting of the fragments [58]. Partial flexibility is achieved at the junctions among the fragments in this manner. Additional approaches, known as incremental building, initially dock one fragment and then add the rest, one by one. FlexX [59] and Hammerhead [60] are two approaches that use fragmentation [61].
5.6.3 Exhaustive search
Exhaustive exploration is an efficient method in austere intelligence, as it examines all of the ligands’ rotatable bonds systematically. To limit the search space and avoid a combinatorial explosion, several limitations and termination criteria are usually defined. The software Glide’s docking pipeline [62, 63] includes an exhaustive search stage.
6. Some common docking software
6.1 AutoDock
AutoDock is an open-source and automated docking package introduced by the Molecular Graphics Lab, Scripps Research Institute, La Jolla, CA 92037, USA. It is effectively applied to the calculation of the binding sphere of biological macromolecules like proteins and enzymes, as well as ligands (small molecules) [25]. The AutoDock docking suite offers the minimum binding energy of interaction obtained between the ligand and the receptor protein. The binding energy calculation is based on the formula offered in the form of the scoring function. Using the Lamarckian genetic algorithm (LGA), the AutoDock scoring function is established on the AMBER force field as well as through linear regression analysis [64]. It deals with reinforcing docking evaluation for ligands through almost zero to ten flexible bonds. The default settings of AutoDock are tremendously effective and are commonly applied to search for the interaction profile of a drug candidate. Furthermore, it is also extensively used for virtual screening. For each docking, the AutoDock is performed for a considerable duration to provide frequently docked conformations of the ligand concerning a receptor protein [65]. Examples: drug-receptor docking; protein-protein docking; molecule optimization; analysis oscillating from structure-based drug design; validation of the action mechanism of drug molecules; etc.
6.2 Handling tips of AutoDock
AutoDock tools offer multiple approaches for docking simulation, such as alternating from simple docking to advanced docking procedures [66]. The successful run of AutoDock requires four different files, such as ligand coordinates, target coordinates, grid parameters, and docking parameters [67, 68]. These files are prepared with the help of AutoDock Tools (ADT)/MGL Tools and their preparatory procedures are as follows:
6.3 Preparation ligand coordinate file
AutoDock accepts PDB or mol2 files as an input. In the novel compound, the first three-dimensional (3D) structure of the compound is prepared. The two-dimensional (2D) structure of the proposed compound can be prepared with the help of ChemDraw or ChemDoodle (https://web.chemdoodle.com/demos/sketcher/) and saved as a SMILES file. The SMILES file is pasted into the online CORINA Classic service (https://www.mn-am.com/online_demos/corina_demo) to prepare meals or.pdb files, but it needs further structural optimization through a suitable method such as Merck Molecular Force Field (MMFF). On the other hand, for simple preparation to optimize 3D structures, the online molsoft (https://www.molsoft.com/2dto3d.html) is recommended. It can prepare 2D as well as 3D structures in a single place. During the conversion of a 2D structure into 3D, it automatically optimizes the structure through MMFF. It has been found that the most accurate, optimized structure can be offered by DFT, but MMFF is still useful for an organic molecule. If the proposed compound has a known structure, then its crystalline 3D structure can be obtained from PubChem (https://pubchem.ncbi.nlm.nih.gov)and ChemSpider (http://www.chemspider.com/), etc. The coordinate setting of proposed compounds needs the addition of hydrogen atoms that are included in the 3D structure [69]. The proposed compound’s open 3D structure is selected as a ligand in ADT, and the ‘edit’ button is clicked to add polar hydrogens, Gasteiger charge, number of torsions, and detect root. At this moment, the ligand will be visible on the screen in which aromatic carbons appear green and another fragment looks red. Now click ‘ok’ and save it as a ligand pdbqt file.
6.4 Preparation of target coordinate file
ADT also requires preparing the coordinates of a biological macromolecule such as a protein or enzyme. The PDB file of the receptor can be downloaded from the Protein Data Bank (www.pdb.org), the Cambridge Crystallographic Database (ccdc.cam.ac.uk), etc. To generate the target coordinate file, all hydrogen atoms, need to be added. The 3D coordinates of the target can be taken from the PDB, and it requires the removal of water, ligands, cofactors, ions, etc. Click on ‘Edit’ to incorporate polar hydrogen, Kollman charge, Marge nonpolar hydrogen, and macromolecules are saved as target pdbqt.
6.5 Preparation grid parameter file
ADT needs a pdbqt file to prepare the grid parameter file (gpf). In a new window to set the grid, click on Grid > Macromolecule > Open and open the target pdbqt file by macromolecule. Similarly, click on Grid > Set map type > Open and open the ligand pdbqt file of the proposed small molecule or ligand, and then set the grid map, grid size, as well as grid center in x, y, and z-direction by clicking on “grid > Grid box”. After that, the output file can be saved as a gpf file.
6.6 Preparation docking parameter file
For the preparation of the docking parameter file (dpf), click on Docking > Macromolecule > Set rigid filament > Open in the ADT window to open the target PDBQT. Similarly, ligand pdbqt can also be opened by clicking on Docking > Ligand > Open. Then, set the algorithm by clicking on Docking > Search Parameters > Genetic algorithm and setting docking parameters. Finally, click on Docking > Output > Lararckian GA and save it as a dpf file. Then ADT is ready to run. Firstly, it runs. It required a proper time and needed the grid parameter file as well as a docking parameter file.
6.7 Analysis of docking result
ADT also offers to evaluate docking interactions and binding energies of a minimum of ten conformations along with a docking inhibition constant (K
The AutoDock scoring function can be calculated based on the following formula:
Where, total energy of van der Waal energy, hydrogen bond energy, electrostatic energy and dissolved energy equals to final intermolecular energy.
7. AutoDock Vina
AutoDock Vina was established by Oleg Trott in the Molecular Graphics Lab at the Scripps Research Institute in 2010 [70]. It is a relatively new, freely available tool for molecular docking, drug discovery, and virtual screening. It also offers high performance, multi-core proficiency, greater accuracy, and a simple handling protocol. Vina itself predicts the grid maps and clusters. Vina considerably enhances the accuracy of the interaction mode calculations as associated with AutoDock. Vina has been found to predict more accurate results as compared to other tools [71].
7.1 Handling tips of AutoDock Vina
The input and output files of Vina are pdbqt. It is essential to prepare the ligand as well as the target coordinate file in pdbqt format. Both coordinate files are prepared similarly as in AutoDock. Vina does not require a grid parameter file and a docking parameter file [72]. Additionally, it requires a text configuration file. Complete handling of AutoDock Vina is discussed below.
7.2 Preparation of configuration file
A new window of ADT is opened after the preparation of the ligand and target coordinate file. Click on Grid > Macromolecule > Open and open the target pdbqt file. Click “YES” to save the present modifications in the folder, and then press “OK” to receive them. Sometimes a warning window is also opened if there are minor indiscretions in charge. Ignore it by pressing “OK.”
Then, click Grid > Set map types > Open and open the ligand pdbqt file. The grid map, grid size, and grid center of the analysis space are then described in a new window that is opened by selecting Grid > Grid box. To begin the box built on the ligand, click Center > Center on the ligand. Here, thumbnails are available for the manual changes in the values of grid size and center, along with other options. Press file > close to save the current after adjusting the grid’s size and center. To complete the setup, select Docking > Output > Vina Configuration and click “SAVE” to provide a text configuration file with the default name of config.txt.
7.3 Run AutoDock Vina
The default setting of AutoDock Vina is not enough to accurately evaluate the interaction profile and binding energies. Vina offers a factor known as exhaustiveness to adjust the computer-aided strength utilized during a docking analysis. In Vina, the default value of exhaustiveness is 8. For greater accuracy, the default value of exhaustiveness is changed and set to about 24. It will provide more accurate docking findings. The most well-known way to run Vina is via ADT. ADT offers to click “run” to run AutoDock Vina. Open a window to start the route of the Vina executable file by pressing the browse option, and then press the launch button to operate the Vina. The second path is through the command line. Open a terminal window and modify the directory that encompasses the coordinate files as well as the configuration file. The command line is edited to adjust the values of exhaustiveness (like: /Vina--config config.txt-exhaustiveness = 24). This command accepts that the AutoDock Vina executable Vina is also found in a similar directory.
7.4 Analysis of Vina docking result
ADT also offers the ability to visualize the outcomes of docking from AutoDock Vina. Open a new ADT window and select the working directory. Analyze > Docking > Open the AutoDock Vina result and select the output file obtained from step II. Then select the default single molecule with numerous conformations followed by pressing “OK” to visualize the coordinates for all docked outcomes through arrow keys. To visualize the target coordinate file, select Analyze > Macromolecule > Open and open the target pdbqt file. Similarly, open the ligand coordinate file by clicking on File > Read molecule > Open and open the ligand pdbqt file to read the crystallographic location of the ligand. It offers the ability to evaluate the ligand as well as docked conformation. Select Analyze > Docking > Show interactions to examine the ligand-target complex’s interaction profile.
The estimated scoring function of AutoDock Vina is based on the following formula:
Where ∆G denotes Gibbs’ free energy, ∆G (vdW) denotes van der Waal’s free energy, and ∆G (H bond) denotes hydrogen bond free energy. ∆G (Elec.) stands for electrostatic free energy; ∆G (E dissolv.) stands for dissolved free energy. Torsional free energy is denoted by the symbol ∆G (tors).
8. AutoDock FR
AutoDock FR (ADFR: AutoDock for Flexible Receptors) was developed by Dr. Pradeep Anand Ravindranath in the Integrative Structural and Computational Biology Lab at the Scripps Research Institute in 2015. The ADFR is a newly designed docking tool built for the AutoDock scoring function. The ADFR was deliberately designed to study the interaction of small flexible ligands with the target protein [69, 73]. It offers preparation of side-chains of target proteins flexibly to simulate induced-fit without the knowledge of the side-chain conformational alterations [73]. The ADFR regulates up to 14 targets with side-chain flexibility. The proficient growth rate of docking realization is more than 50%. On the cross, docking is investigated along with up to 12 flexible receptor side-chains. The ADFR displays superior results as compared to AutoDock Vina. Vina requires uncontrolled run time for docking by increasing the number of flexible receptor side chains. On the other hand, ADFR requires linear run time [73].
8.1 Handling tips of AutoDockFR
The input format of ADFR is pdbqt format. ADFR requires the preparation of coordinate files of ligand and target. Coordinate files are prepared with the help of ADT. To perform docking through ADFR also requires the generation of affinity maps and translational points that are probable ligand binding areas. The step-by-step handling protocol of ADFR is discussed below.
8.2 Prepare affinity maps and translational points
Open a new ADFR window, select the receptor PDBQT > Open, and upload the target coordinate file in pdbqt format to run the docking analysis. Similarly, the ligand pdbqt file is uploaded by selecting Open under ligand PDBQT. Then press the box entire ligand button to surround the ligand with a docking box or grid box, followed by clicking on the center view of the docking box to center the docking position. In the docking box, along with ligand, amino acid residues can also be labeled by clicking on “show receptor residue labels.” ADFR is the only tool to select the amino acid residue up to 14 at a time with a single click. To select the amino acid residues for docking investigation, click on flexible residues and select the amino acids from the list. The selected side chains of the amino acid are presented as orange balls-sticks and the other portions remain the same. Then click the green checkmark.
For the prediction of binding pockets, click on the ‘compute pockets’ button. Auto Site recognizes multiple pockets in the docking box and selects those at which the actual ligand is found in higher volume. These binding pocket fill-points appear as a green mesh, denoted as translational points. If the binding pocket fills-points button is green, then generating maps is supported. To generate affinity maps, press the Generate maps button and save the maps as a zip file in the working folder.
8.3 Run ADFR
Open the command window, adjust the working directory, and type the following windows command to run the ADFR: “c:\Program Files\MGL Tools 2-latest\adfr.bat” random pdbqt -m generate.zip -r ligand pdbqt -job Name Result --seed −1. To visualize the docking result, a visualization tool like Biovia DSV is used to generate the interaction profile of the ligand-target complex.
9. iGEMDOCK
The iGEMDOCK tool was established by the Institute of Bioinformatics at National Chiao Tung University, Taiwan for docking, drug design, screening, and post-screening analysis. It is an automatic multipurpose graphical package [74]. For docking evaluation on the iGEMDOCK, initially prepare the coordinate files of the ligand as well as the target. Coordinate files are prepared similarly as in AutoDock by adding torsions, bond orders, hydrogen atoms, and charges. These parameters are assigned to both the ligand and the target. The input and output files of the iGEMDOCK are PDB and Mol. IGEMDOCK automatically selects the most suitable conformation of the ligand and gives the total binding energy [74]. The iGEMDOCK scores are calculated using an empirical formula or fitness score, denoted as.
Van der Waal energy + Hydrogen bond energy + Electro-statistic energy equals fitness score.
During the docking evaluation, the estimation of target binding sites and structure optimization are very significant. The hydrogen bonds found in the docked complex strongly impact the scoring function. This possibility reduces the number of suspected H bonds significantly. Additionally, internal H bonds, as well as internal electrostatic interaction, are predicted as sp2-sp2 torsions from the interaction complex. The iGEMDOCK works, since the generic evolutionary method (GA), provides three effective docking methods,
9.1 Handling tips of iGEMDOCK
The iGEMDOCK is a complete package of automated docking and screening. It is a combination of two main parts; the first part predicts the interaction profile among the ligand-target complex in the 3D structure, while the second part predicts the suitable pose of the ligand-target complex along with post-analysis. The docking evaluation with the iGEMDOCK begins with the preparation of ligand and target protein coordinate files. Both coordinate files are prepared like AutoDock. The iGEMDOCK input and output file formats are mol, mol2, and PDB.
9.2 Target binding site preparation
In the iGEMDOCK operator, a distinct binding site of the target protein/enzyme or complete target structure is selected. If the target’s input file contains a natural physiological ligand, it will automatically determine the target’s binding site. To begin docking, upload the target’s coordinate file (PDB) by clicking “Prepare binding site > Browse > Open” in the “Protein-ligand docking/screening” window. To select the binding site of the target, click on “By bounded ligand” and then define the binding site center by selecting the available ligand which you want to study. It also offers to set the binding site radius; by default, its value is 8.0 Å. Uncheck the “Retain reference ligand” box, and then click “OK” to save the defined parameter to the chosen binding site. This will delete the physiological ligand. Select “by a current file” to specify the binding sites of the new target protein.
9.3 Ligand preparation
The iGEMDOCK provides two methods for ligand preparation. To begin, for “single ligand,” upload the ligand coordinate file (single/many) directly by clicking “Prepare compounds > Ligands > Open” and pressing “OK” at the “docking/screening” window. The iGEMDOCK recommends preparing the ligand coordinate file in mol. It does not assign charges and hydrogen to all of the ligand’s atoms. For the “ligand database”, the ligand library is also prepared as mol. To upload the list of compounds, click “Prepare compounds,” then “import list,” “Open,” and “OK.”
9.4 Run iGEMDOCK
Set the output path before the start of docking evaluation. Set the output path by clicking on the “Set output path”. Then choose the desired file and press “OK.”
10. Use of molecular docking
In the last decade, technologies like high-throughput sequencing and X-ray crystallography have been regularly updated. The crystal structures of large numbers of proteins have been defined. Consequently, the structural and functional significance of biological macromolecules (like proteins and enzymes) has been expanded and many novel drug targets also have been identified [75]. Due to the revolution of computational science in various fields of research, the utilization of virtual screening and molecular docking in DDD has been significantly stimulated. The development of a novel drug is time-consuming, costly, and needs more manpower [76]. Currently, computer-aided technology has become a key tool in DDD. Through molecular docking simulation, the analysis of the mutual interaction of drug and receptor becomes very easy along with high accuracy and boosts the drug development procedure by reducing the time [77].
Reverse molecular docking is a particularly fresh and innovative significant of molecular docking. It precedes the library of small molecules as a key structure to execute molecular docking in the spatial or 3D target database and evaluate the conceivable larger entities to conclude the three-dimensional structure and energy of identical assessment. That is to say, it identifies the most suitable target with minimum binding energy. For that reason, the development of reverse molecular docking provides a new route to discover the suitable target of a drug compound and reveal the drug action mechanism [78].
11. Conclusion
The findings of this chapter demonstrate that docking programs are highly focused on the development of new pharmaceutical compounds using molecular modeling. In this decade, new docking software designs are emphasized. These trends are focused on improving docking accuracy by using more accurate molecular energy calculations without any fitting parameters, such as quantum-chemical methods, implicit solvent models, and new global optimization algorithms that can treat ligand flexibility and protein atom mobility at the same time. Current docking applications are not reliable enough to estimate binding affinity due to the insufficient molecular structure and the inadequacies of the scoring algorithm. However, by including a huge amount of biological data into the scoring function, the present molecular docking technique can be improved. Finally, it is demonstrated that all of the conditions for improving docking accuracy may be met in practice. Furthermore, some expanded sampling strategies are no longer an exclusive methodological exercise but have become accessible to a wide range of research organizations, with real-world applications in drug discovery. Molecular docking, technological advancements, and novel MD computational approaches have all made it possible to simulate increasingly large conformational shifts. By providing a mechanical understanding of binding pathways, the ability to recreate present folding and binding processes can be used to address the long-standing argument regarding “induced-fit” and “conformational selection” binding theories.
Acknowledgments
The authors gratefully acknowledge the R&D wing of Integral University, Lucknow, for their support and guidance.
Conflict of interest
The authors declare no conflict of interest.
References
- 1.
Chen G, Seukep AJ, Guo M. Recent advances in molecular docking for the research and discovery of potential marine drugs. Marine Drugs. 2020; 18 (11):545-566 - 2.
Guedes IA, de Magalhães CS, Dardenne LE. Receptor–ligand molecular docking. Biophysical Reviews. 2014; 6 (1):75 - 3.
Pantsar T, Poso A. Binding affinity via docking: Fact and fiction. Mol A J Synth Chem Nat Prod Chem. 2018; 23 (8):1899-1909 - 4.
Mazumder M, Ponnan P, Das U, Gourinath S, Khan HA, Yang J, et al. Investigations on binding pattern of kinase inhibitors with PPAR γ: Molecular docking, molecular dynamic simulations, and free energy calculation studies. PPAR Research. 2017; 2017 :6397836 - 5.
Ramírez D, Caballero J. Is it reliable to use common molecular docking methods for comparing the binding affinities of enantiomer pairs for their protein target? International Journal of Molecular Sciences. 2016; 17 (4):525-539 - 6.
Kuntz ID, Blaney JM, Oatley SJ, Langridge R, Ferrin TE. A geometric approach to macromolecule-ligand interactions. Journal of Molecular Biology. 1982; 161 (2):269-288 - 7.
Ahmed A, Mam B, Sowdhamini R. DEELIG: A deep learning approach to predict protein-ligand binding affinity. Bioinformatics and Biology Insights. 2021; 15 :1-9 - 8.
Guedes IA, Pereira FSS, Dardenne LE. Empirical scoring functions for structure-based virtual screening: Applications, critical aspects, and challenges. Frontiers in Pharmacology. 2018; 9 :1089 - 9.
Koshland DE. The key–lock theory and the induced fit theory. Angew Chemie Int Ed English. 1995; 33 (23–24):2375-2378 - 10.
Lexa KW, Carlson HA. Protein flexibility in docking and surface mapping. Quarterly Reviews of Biophysics. 2012; 45 (3):301-343 - 11.
Tripathi A, Bankaitis VA. Molecular docking: From lock and key to combination lock. J Mol Med Clin Appl. 2017; 2 (1). DOI: 10.16966/2575-0305.106 - 12.
Andrusier N, Mashiach E, Nussinov R, Wolfson HJ. Principles of flexible protein-protein docking. Proteins: Structure, Function and Genetics. 2008; 73 (2):271-289 - 13.
Anderson AC, O’Neil RH, Surti TS, Stroud RM. Approaches to solving the rigid receptor problem by identifying a minimal set of flexible residues during ligand docking. Chemistry & Biology. 2001; 8 (5):445-457 - 14.
Meng X-Y, Zhang H-X, Mezei M, Cui M. Molecular docking: A powerful approach for structure-based drug discovery. Current Computer-Aided Drug Design. 2011; 7 (2):146-157 - 15.
Du X, Li Y, Xia YL, Ai SM, Liang J, Sang P, et al. Insights into protein–ligand interactions: Mechanisms, models, and methods. International Journal of Molecular Sciences. 2016; 17 (2):144-177 - 16.
Cramer F. Biochemical correctness: Emil Fischer’s lock and key hypothesis, a hundred years after — An essay. Pharmaceutica Acta Helvetiae. 1995; 69 (4):193-203 - 17.
Salmaso V, Moro S. Bridging molecular docking to molecular dynamics in exploring ligand-protein recognition process: An overview. Frontiers in Pharmacology. 2018; 9 :923 - 18.
Meng X-Y, Zhang H-X, Mezei M, Cui M. Molecular docking: A powerful approach for structure-based drug discovery. Curr Comput Aided-Drug Des. 2012; 7 (2):146-157 - 19.
Pinzi L, Rastelli G. Molecular docking: Shifting paradigms in drug discovery. International Journal of Molecular Sciences. 2019; 20 (18):4331 - 20.
Hetényi C, Van Der Spoel D. Blind docking of drug-sized compounds to proteins with up to a thousand residues. FEBS Letters. 2006; 580 (5):1447-1450 - 21.
Ferreira LG, Dos Santos RN, Oliva G, Andricopulo AD. Molecular docking and structure-based drug design strategies. Molecules. 2015; 20 (7):13384 - 22.
Hall R, Dixon T, Dickson A. On calculating free energy differences using ensembles of transition paths. Frontiers in Molecular Biosciences. 2020; 7 :106 - 23.
Agrawal P, Singh H, Srivastava HK, Singh S, Kishore G, Raghava GPS. Benchmarking of different molecular docking methods for protein-peptide docking. BMC Bioinformatics. 2019; 19 :426 - 24.
Torres PHM, Sodero ACR, Jofily P, Silva-Jr FP. Key topics in molecular docking for drug design. International Journal of Molecular Sciences. 2019; 20 (18):4574 - 25.
Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, Belew RK, et al. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. Journal of Computational Chemistry. 1998; 19 (14):1639-1662 - 26.
Zavodszky MI, Kuhn LA. Side-chain flexibility in protein–ligand binding: The minimal rotation hypothesis. Protein Science. 2005; 14 (4):1104 - 27.
Miao Z, Cao Y. Quantifying side-chain conformational variations in protein structure. Sci Reports. 2016; 6 (1):1-10 - 28.
Jiang F, Kim SH. “Soft docking”: Matching of molecular surface cubes. Journal of Molecular Biology. 1991; 219 (1):79-102 - 29.
Lepore R, Kryshtafovych A, Alahuhta M, Veraszto HA, Bomble YJ, Bufton JC, et al. Target highlights in CASP13: Experimental target structures through the eyes of their authors. Proteins Struct Funct Bioinforma. 2019; 87 (12):1037-1057 - 30.
Hospital A, Goñi JR, Orozco M, Gelpí JL. Molecular dynamics simulations: Advances and applications. Adv Appl Bioinform Chem. 2015; 8 (1):37 - 31.
Huang SY, Zou X. Ensemble docking of multiple protein structures: Considering protein structural variations in molecular docking. Proteins. 2007; 66 (2):399-421 - 32.
Salmaso N, Stevens HE, McNeill J, ElSayed M, Ren Q, Maragnoli ME, et al. Fibroblast growth factor 2 modulates hypothalamic pituitary Axis activity and anxiety behavior through glucocorticoid receptors. Biological Psychiatry. 2016; 80 (6):479-489 - 33.
Al-Karmalawy AA, Dahab MA, Metwaly AM, Elhady SS, Elkaeed EB, Eissa IH, et al. Molecular docking and dynamics simulation revealed the potential inhibitory activity of ACEIs against SARS-CoV-2 targeting the hACE2 receptor. Frontiers in Chemistry. 2021; 9 :661230 - 34.
Seidel R, Blumer M, Chaumel J, Amini S, Dean MN. Endoskeletal mineralization in chimaera and a comparative guide to tessellated cartilage in chondrichthyan fishes (sharks, rays and chimaera). J R Soc Interface. 2020; 17 :20200474 - 35.
Gallego-Yerga L, Ochoa R, Lans I, Peña-Varas C, Alegría-Arcos M, Cossio P, et al. Application of ensemble pharmacophore-based virtual screening to the discovery of novel antimitotic tubulin inhibitors. Computational and Structural Biotechnology Journal. 2021; 19 :4360-4372 - 36.
Knegtel RMA, Kuntz ID, Oshiro CM. Molecular docking to ensembles of protein structures. Journal of Molecular Biology. 1997; 266 (2):424-440 - 37.
Adcock SA, McCammon JA. Molecular dynamics: Survey of methods for simulating the activity of proteins. Chemical Reviews. 2006; 106 (5):1589 - 38.
Hollingsworth SA, Dror RO. Molecular dynamics simulation for all. Neuron. 2018; 99 (6):1129 - 39.
Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery: Methods and applications. Nature Reviews. Drug Discovery. 2004; 3 (11):935-949 - 40.
Gautam B. Energy Minimization. In: Maia RT, de Moraes Filho Ra, Campos M. editors. Homology Molecular Modeling - Perspectives and Applications. London: IntechOpen; 2020. DOI: 10.5772/intechopen.94809 - 41.
Araki M, Matsumoto S, Bekker GJ, et al. Exploring ligand binding pathways on proteins using hypersound-accelerated molecular dynamics. Nature Communications. 2021; 12 :2793 - 42.
Huang SY, Zou X. Advances and challenges in protein-ligand docking. International Journal of Molecular Sciences. 2010; 11 (8):3016-3034 - 43.
Korb O, Stützle T, Exner TE. PLANTS: Application of ant Colony optimization to structure-based drug design. Lecture Notes in Computer Science. 2006; 4150 :247-258 - 44.
Katoch S, Chauhan SS, Kumar V. A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications. 2021; 80 (5):8091-8126 - 45.
Tessaro F, Tessaro F, Scapozza L, Scapozza L. How ‘protein-docking’ translates into the new emerging field of docking small molecules to nucleic acids? Molecules. 2020; 25 (12):2749 - 46.
Xu J, Zhang L, Ye Y, Shan Y, Wan C, Wang J, et al. SNX16 regulates the recycling of E-cadherin through a unique mechanism of coordinated membrane and cargo binding. Structure. 2017; 25 (8):1251-1263.e5 - 47.
Pei J, Wang Q, Liu Z, Li Q, Yang K, Lai L. PSI-DOCK: Towards highly efficient and accurate flexible ligand docking. Proteins Struct Funct Bioinforma. 2006; 62 (4):934-946 - 48.
Jones S, Thornton JM. Prediction of protein-protein interaction sites using patch analysis. Journal of Molecular Biology. 1997; 272 (1):133-143 - 49.
Yan Y, He J, Feng Y, Lin P, Tao H, Huang SY. Challenges and opportunities of automated protein-protein docking: HDOCK server vs human predictions in CAPRI rounds 38-46. Proteins Struct Funct Bioinforma. 2020; 88 (8):1055-1069 - 50.
Ruiz-Carmona S, Alvarez-Garcia D, Foloppe N, Garmendia-Doval AB, Juhos S, Schmidtke P, et al. rDock: A fast, versatile and open source program for docking ligands to proteins and nucleic acids. PLoS Computational Biology. 2014; 10 (4):e1003571 - 51.
Verdonk ML, Cole JC, Hartshorn MJ, Murray CW, Taylor RD. Improved protein-ligand docking using GOLD. Proteins: Structure, Function, and Genetics. 2003; 52 (4):609-623 - 52.
Trott O, Olson AJ. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry. 2010; 31 (2):455-461 - 53.
Kozakov D, Hall DR, Xia B, Porter KA, Padhorny D, Yueh C, et al. The ClusPro web server for protein-protein docking. Nature Protocols. 2017; 12 (2):255 - 54.
del Alamo D, Sala D, Mchaourab HS, Meiler J. Sampling alternative conformational states of transporters and receptors with AlphaFold2. eLife (2022); 11 :e75751. DOI.org/10.7554/eLife.75751 - 55.
Pagadala NS, Syed K, Tuszynski J. Software for molecular docking: A review. Biophysical Reviews. 2017; 9 (2):91 - 56.
Brooijmans N, Kuntz ID. Molecular recognition and docking algorithms. Annual Review of Biophysics and Biomolecular Structure. 2003; 32 :335-373 - 57.
Sauton N, Lagorce D, Villoutreix BO, Miteva MA. MS-DOCK: Accurate multiple conformation generator and rigid docking protocol for multi-step virtual ligand screening. BMC Bioinformatics. 2008; 9 (1):1-12 - 58.
DesJarlais RL, Kuntz ID, Sheridan RP, Venkataraghavan R, Dixon JS. Docking flexible ligands to macromolecular receptors by molecular shape. Journal of Medicinal Chemistry. 1986; 29 (11):2149-2153 - 59.
Rarey M, Kramer B, Lengauer T, Klebe G. A fast flexible docking method using an incremental construction algorithm. Journal of Molecular Biology. 1996; 261 (3):470-489 - 60.
Lengauer T, Rarey M. Computational methods for biomolecular docking. Current Opinion in Structural Biology. 1996; 6 (3):402-406 - 61.
Welch WJ, Brown CR. Influence of molecular and chemical chaperones on protein folding. Cell Stress & Chaperones. 1996; 1 (2):109 - 62.
Halgren TA, Murphy RB, Friesner RA, Beard HS, Frye LL, Pollard WT, Banks JL. Glide: A new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. Journal of Medicinal Chemistry 2004;47(7):1750–1759. - 63.
Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, et al. Glide: A new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. Journal of Medicinal Chemistry. 2004; 47 (7):1739-1749 - 64.
Morris GM, Ruth H, Lindstrom W, Sanner MF, Belew RK, Goodsell DS, et al. Software news and updates AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. Journal of Computational Chemistry. 2009; 30 (16):2785-2791 - 65.
Dar AM, Mir S. Molecular docking: Approaches, types, applications and basic challenges. J Anal Bioanal Tech. 2017; 8 (2):1-3 - 66.
Hernández-Santoyo A, Tenorio-Barajas AY, VictorAltuzar V, Vivanco-Cid H, Mendoza-Barrera C. Protein-Protein and Protein-Ligand Docking. In: Ogawa T. editor. Protein Engineering - Technology and Application. London: IntechOpen; 2013. DOI: 10.5772/56376 - 67.
Cosconati S, Forli S, Perryman AL, Harris R, Goodsell DS, Olson AJ. Virtual screening with AutoDock: Theory and practice. Expert Opinion on Drug Discovery. 2010; 5 (6):597-607 - 68.
Rizvi SMD, Shakil S, Haneef M. A simple click by click protocol to perform docking: AutoDock 4.2 made easy for non-bioinformaticians. EXCLI Journal. 2013; 12 :831-857 - 69.
Forli S, Huey R, Pique ME, Sanner MF, Goodsell DS, Olson AJ. Computational protein-ligand docking and virtual drug screening with the AutoDock suite. Nature Protocols. 2016; 11 (5):905-919 - 70.
Trott O, Olson AJ. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry. 2009; 31 (2):NA-NA - 71.
Sandeep G, Nagasree KP, Hanisha M, Kumar MMK. AUDocker LE: A GUI for virtual screening with AUTODOCK Vina. BMC Research Notes. 2011; 4 (1):445 - 72.
Seeliger D, De Groot BL. Ligand docking and binding site analysis with PyMOL and Autodock/Vina. Journal of Computer-Aided Molecular Design. 2010; 24 (5):417-422 - 73.
Ravindranath PA, Forli S, Goodsell DS, Olson AJ, Sanner MF. AutoDockFR: Advances in protein-ligand docking with explicitly specified binding site flexibility. PLoS Computational Biology. 2015; 11 (12):e1004586 - 74.
Hsu KC, Chen YF, Lin SR, Yang JM. Igemdock: A graphical environment of enhancing gemdock using pharmacological interactions and post-screening analysis. BMC Bioinformatics. 2011; 12 (Suppl. 1):S33 - 75.
Jamkhande PG, Ghante MH, Ajgunde BR. Software based approaches for drug designing and development: A systematic review on commonly used software and its applications. Bull Fac Pharmacy, Cairo Univ. 2017; 55 (2):203-210 - 76.
Talluri S. Molecular docking and virtual screening based prediction of drugs for COVID-19. Combinatorial Chemistry & High Throughput Screening. 2021; 24 (5):716-728 - 77.
Oferkin IV, Katkova EV, Sulimov AV, Kutov DC, Sobolev SI, Voevodin VV, et al. Evaluation of docking target functions by the comprehensive investigation of protein-ligand energy minima. Advances in Bioinformatics. 2015; 2015 :20151-20112 - 78.
Malmstrom RD, Watowich SJ. Using free energy of binding calculations to improve the accuracy of virtual screening predictions. Journal of Chemical Information and Modeling. 2011; 51 (7):1648-1655