- •Foreword
- •Preface
- •Contents
- •Introduction
- •Oren M. Becker
- •Alexander D. MacKerell, Jr.
- •Masakatsu Watanabe*
- •III. SCOPE OF THE BOOK
- •IV. TOWARD A NEW ERA
- •REFERENCES
- •Atomistic Models and Force Fields
- •Alexander D. MacKerell, Jr.
- •II. POTENTIAL ENERGY FUNCTIONS
- •D. Alternatives to the Potential Energy Function
- •III. EMPIRICAL FORCE FIELDS
- •A. From Potential Energy Functions to Force Fields
- •B. Overview of Available Force Fields
- •C. Free Energy Force Fields
- •D. Applicability of Force Fields
- •IV. DEVELOPMENT OF EMPIRICAL FORCE FIELDS
- •B. Optimization Procedures Used in Empirical Force Fields
- •D. Use of Quantum Mechanical Results as Target Data
- •VI. CONCLUSION
- •REFERENCES
- •Dynamics Methods
- •Oren M. Becker
- •Masakatsu Watanabe*
- •II. TYPES OF MOTIONS
- •IV. NEWTONIAN MOLECULAR DYNAMICS
- •A. Newton’s Equation of Motion
- •C. Molecular Dynamics: Computational Algorithms
- •A. Assigning Initial Values
- •B. Selecting the Integration Time Step
- •C. Stability of Integration
- •VI. ANALYSIS OF DYNAMIC TRAJECTORIES
- •B. Averages and Fluctuations
- •C. Correlation Functions
- •D. Potential of Mean Force
- •VII. OTHER MD SIMULATION APPROACHES
- •A. Stochastic Dynamics
- •B. Brownian Dynamics
- •VIII. ADVANCED SIMULATION TECHNIQUES
- •A. Constrained Dynamics
- •C. Other Approaches and Future Direction
- •REFERENCES
- •Conformational Analysis
- •Oren M. Becker
- •II. CONFORMATION SAMPLING
- •A. High Temperature Molecular Dynamics
- •B. Monte Carlo Simulations
- •C. Genetic Algorithms
- •D. Other Search Methods
- •III. CONFORMATION OPTIMIZATION
- •A. Minimization
- •B. Simulated Annealing
- •IV. CONFORMATIONAL ANALYSIS
- •A. Similarity Measures
- •B. Cluster Analysis
- •C. Principal Component Analysis
- •REFERENCES
- •Thomas A. Darden
- •II. CONTINUUM BOUNDARY CONDITIONS
- •III. FINITE BOUNDARY CONDITIONS
- •IV. PERIODIC BOUNDARY CONDITIONS
- •REFERENCES
- •Internal Coordinate Simulation Method
- •Alexey K. Mazur
- •II. INTERNAL AND CARTESIAN COORDINATES
- •III. PRINCIPLES OF MODELING WITH INTERNAL COORDINATES
- •B. Energy Gradients
- •IV. INTERNAL COORDINATE MOLECULAR DYNAMICS
- •A. Main Problems and Historical Perspective
- •B. Dynamics of Molecular Trees
- •C. Simulation of Flexible Rings
- •A. Time Step Limitations
- •B. Standard Geometry Versus Unconstrained Simulations
- •VI. CONCLUDING REMARKS
- •REFERENCES
- •Implicit Solvent Models
- •II. BASIC FORMULATION OF IMPLICIT SOLVENT
- •A. The Potential of Mean Force
- •III. DECOMPOSITION OF THE FREE ENERGY
- •A. Nonpolar Free Energy Contribution
- •B. Electrostatic Free Energy Contribution
- •IV. CLASSICAL CONTINUUM ELECTROSTATICS
- •A. The Poisson Equation for Macroscopic Media
- •B. Electrostatic Forces and Analytic Gradients
- •C. Treatment of Ionic Strength
- •A. Statistical Mechanical Integral Equations
- •VI. SUMMARY
- •REFERENCES
- •Steven Hayward
- •II. NORMAL MODE ANALYSIS IN CARTESIAN COORDINATE SPACE
- •B. Normal Mode Analysis in Dihedral Angle Space
- •C. Approximate Methods
- •IV. NORMAL MODE REFINEMENT
- •C. Validity of the Concept of a Normal Mode Important Subspace
- •A. The Solvent Effect
- •B. Anharmonicity and Normal Mode Analysis
- •VI. CONCLUSIONS
- •ACKNOWLEDGMENT
- •REFERENCES
- •Free Energy Calculations
- •Thomas Simonson
- •II. GENERAL BACKGROUND
- •A. Thermodynamic Cycles for Solvation and Binding
- •B. Thermodynamic Perturbation Theory
- •D. Other Thermodynamic Functions
- •E. Free Energy Component Analysis
- •III. STANDARD BINDING FREE ENERGIES
- •IV. CONFORMATIONAL FREE ENERGIES
- •A. Conformational Restraints or Umbrella Sampling
- •B. Weighted Histogram Analysis Method
- •C. Conformational Constraints
- •A. Dielectric Reaction Field Approaches
- •B. Lattice Summation Methods
- •VI. IMPROVING SAMPLING
- •A. Multisubstate Approaches
- •B. Umbrella Sampling
- •C. Moving Along
- •VII. PERSPECTIVES
- •REFERENCES
- •John E. Straub
- •B. Phenomenological Rate Equations
- •II. TRANSITION STATE THEORY
- •A. Building the TST Rate Constant
- •B. Some Details
- •C. Computing the TST Rate Constant
- •III. CORRECTIONS TO TRANSITION STATE THEORY
- •A. Computing Using the Reactive Flux Method
- •B. How Dynamic Recrossings Lower the Rate Constant
- •IV. FINDING GOOD REACTION COORDINATES
- •A. Variational Methods for Computing Reaction Paths
- •B. Choice of a Differential Cost Function
- •C. Diffusional Paths
- •VI. HOW TO CONSTRUCT A REACTION PATH
- •A. The Use of Constraints and Restraints
- •B. Variationally Optimizing the Cost Function
- •VII. FOCAL METHODS FOR REFINING TRANSITION STATES
- •VIII. HEURISTIC METHODS
- •IX. SUMMARY
- •ACKNOWLEDGMENT
- •REFERENCES
- •Paul D. Lyne
- •Owen A. Walsh
- •II. BACKGROUND
- •III. APPLICATIONS
- •A. Triosephosphate Isomerase
- •B. Bovine Protein Tyrosine Phosphate
- •C. Citrate Synthase
- •IV. CONCLUSIONS
- •ACKNOWLEDGMENT
- •REFERENCES
- •Jeremy C. Smith
- •III. SCATTERING BY CRYSTALS
- •IV. NEUTRON SCATTERING
- •A. Coherent Inelastic Neutron Scattering
- •B. Incoherent Neutron Scattering
- •REFERENCES
- •Michael Nilges
- •II. EXPERIMENTAL DATA
- •A. Deriving Conformational Restraints from NMR Data
- •B. Distance Restraints
- •C. The Hybrid Energy Approach
- •III. MINIMIZATION PROCEDURES
- •A. Metric Matrix Distance Geometry
- •B. Molecular Dynamics Simulated Annealing
- •C. Folding Random Structures by Simulated Annealing
- •IV. AUTOMATED INTERPRETATION OF NOE SPECTRA
- •B. Automated Assignment of Ambiguities in the NOE Data
- •C. Iterative Explicit NOE Assignment
- •D. Symmetrical Oligomers
- •VI. INFLUENCE OF INTERNAL DYNAMICS ON THE
- •EXPERIMENTAL DATA
- •VII. STRUCTURE QUALITY AND ENERGY PARAMETERS
- •VIII. RECENT APPLICATIONS
- •REFERENCES
- •II. STEPS IN COMPARATIVE MODELING
- •C. Model Building
- •D. Loop Modeling
- •E. Side Chain Modeling
- •III. AB INITIO PROTEIN STRUCTURE MODELING METHODS
- •IV. ERRORS IN COMPARATIVE MODELS
- •VI. APPLICATIONS OF COMPARATIVE MODELING
- •VII. COMPARATIVE MODELING IN STRUCTURAL GENOMICS
- •VIII. CONCLUSION
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Roland L. Dunbrack, Jr.
- •II. BAYESIAN STATISTICS
- •A. Bayesian Probability Theory
- •B. Bayesian Parameter Estimation
- •C. Frequentist Probability Theory
- •D. Bayesian Methods Are Superior to Frequentist Methods
- •F. Simulation via Markov Chain Monte Carlo Methods
- •III. APPLICATIONS IN MOLECULAR BIOLOGY
- •B. Bayesian Sequence Alignment
- •IV. APPLICATIONS IN STRUCTURAL BIOLOGY
- •A. Secondary Structure and Surface Accessibility
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Computer Aided Drug Design
- •Alexander Tropsha and Weifan Zheng
- •IV. SUMMARY AND CONCLUSIONS
- •REFERENCES
- •Oren M. Becker
- •II. SIMPLE MODELS
- •III. LATTICE MODELS
- •B. Mapping Atomistic Energy Landscapes
- •C. Mapping Atomistic Free Energy Landscapes
- •VI. SUMMARY
- •REFERENCES
- •Toshiko Ichiye
- •II. ELECTRON TRANSFER PROPERTIES
- •B. Potential Energy Parameters
- •IV. REDOX POTENTIALS
- •A. Calculation of the Energy Change of the Redox Site
- •B. Calculation of the Energy Changes of the Protein
- •B. Calculation of Differences in the Energy Change of the Protein
- •VI. ELECTRON TRANSFER RATES
- •A. Theory
- •B. Application
- •REFERENCES
- •Fumio Hirata and Hirofumi Sato
- •Shigeki Kato
- •A. Continuum Model
- •B. Simulations
- •C. Reference Interaction Site Model
- •A. Molecular Polarization in Neat Water*
- •B. Autoionization of Water*
- •C. Solvatochromism*
- •F. Tautomerization in Formamide*
- •IV. SUMMARY AND PROSPECTS
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Nucleic Acid Simulations
- •Alexander D. MacKerell, Jr.
- •Lennart Nilsson
- •D. DNA Phase Transitions
- •III. METHODOLOGICAL CONSIDERATIONS
- •A. Atomistic Models
- •B. Alternative Models
- •IV. PRACTICAL CONSIDERATIONS
- •A. Starting Structures
- •C. Production MD Simulation
- •D. Convergence of MD Simulations
- •WEB SITES OF INTEREST
- •REFERENCES
- •Membrane Simulations
- •Douglas J. Tobias
- •II. MOLECULAR DYNAMICS SIMULATIONS OF MEMBRANES
- •B. Force Fields
- •C. Ensembles
- •D. Time Scales
- •III. LIPID BILAYER STRUCTURE
- •A. Overall Bilayer Structure
- •C. Solvation of the Lipid Polar Groups
- •IV. MOLECULAR DYNAMICS IN MEMBRANES
- •A. Overview of Dynamic Processes in Membranes
- •B. Qualitative Picture on the 100 ps Time Scale
- •C. Incoherent Neutron Scattering Measurements of Lipid Dynamics
- •F. Hydrocarbon Chain Dynamics
- •ACKNOWLEDGMENTS
- •REFERENCES
- •Appendix: Useful Internet Resources
- •B. Molecular Modeling and Simulation Packages
- •Index
Computer Simulation with QM–MM Methods |
227 |
semiempirical Hamiltonian. The protocol used there for evaluating the QM–MM interaction at the quantum/classical boundary is to perform a quantum calculation, with a link atom, using the equations shown in the previous section. Once a stable electron density has been determined, the contribution from the link atom to the QM–MM energy is removed. In the case of ab initio or density functional implementations the link atom is treated as a proper quantum atom and, as such, feels the charges from atoms in the classical region. [10]. The charges on the classical functional group or atom that is replaced by the link atom do not contribute to the one-electron integrals in the self-consistent calculation for the quantum region. For CH3CH2OH with OH treated by quantum mechanics and CH3CH2 treated by molecular mechanics as an example, the charges on the CH2 group are not included in the QM Hamiltonian; all other partial charges on classical atoms (i.e.,
˚
from the CH3 group) are included. The link atom is initially positioned 1 A along the original bond but is not constrained during the simulations.
The location of the quantum/classical boundary across a covalent bond also has implications for the energy terms evaluated in the EMM term. Classical energy terms that involve only quantum atoms are not evaluated. These are accounted for by the quantum Hamiltonian. Classical energy terms that include at least one classical atom are evaluated. Referring to Figure 2, the Cα ECβ bond term; the NECα ECβ, CECα ECβ, Hα E Cα ECβ, Cα ECβ EHβ1, and Cα ECβ EHβ2 angle terms; and the proper dihedral terms involving a classical atom are all included.
An alternative approach to the link atom method is to use the frozen orbital approach developed by Rivail and coworkers [56] (the local self-consistent field, LSCF). The continuity of the electron density at the boundary region is maintained by a frozen orbital along the bond between the quantum and classical atoms. This frozen orbital is derived from calculaitons on model compounds, with the assumption that the orbitals from model compounds are transferable to the enzymatic system. A more generalized form of this implementation was presented by Gao et al. [57], in which a set of hybrid orbitals are used at the boundary region [this method is termed the generalized hybrid orbital (GHO) method by the authors]. The set is divided into auxiliary and active orbitals and acts as a basis set for the boundary atoms of the MM fragment. The active orbitals are optimized along with other orbitals on the QM atoms in the SCF cycle. In essence this method is an expansion of the approach of Rivail but has the advantage that the oribitals do not need to be parametrized for each specific problem.
There is concern in the field about current approaches for treating the boundary region in QM–MM calculations. The original implementation of the link atom aproach [9] results in arbitrary charge polarization in the QM region and the development of unrealistic partial charges on the link atom. The semiempirical implementation of the LSCF and GHO methods appears to improve the treatment of the boundary region compared to similar calculations using the link atom approach. However, the implementation of the link atom method for ab initio and density functional based QM–MM methods does not appear to introduce unphysical electrostatic perturbations into the QM region, and in a limited number of test cases it has been found to give very reasonable results for geometries, energetics, and charge distributions compared with pure quantum calculations [10].
III. APPLICATIONS
In this section the applications of QM–MM methods to three enzymes are presented with the intention of illustrating the techniques that are employed to simulate enzyme catalysis.
228 |
Lyne and Walsh |
The specific enzymes chosen cover applications involving both semiempirical and ab initio methods in the quantum region.
A. Triosephosphate Isomerase
The chemical reaction catalyzed by triosephosphate isomerase (TIM) was the first application of the QM–MM method in CHARMM to the study of enzyme catalysis [26]. The study calculated an energy pathway for the reaction in the enzyme and decomposed the energetics into specific contributions from each of the residues of the enzyme. TIM catalyzes the interconversion of dihydroxyacetone phosphate (DHAP) and D-glyceraldehyde 3-phosphate (GAP) as part of the glycolytic pathway. Extensive experimental studies have been performed on TIM, and it has been proposed that Glu-165 acts as a base for deprotonation of DHAP and that His-95 acts as an acid to protonate the carbonyl oxygen of DHAP, forming an enediolate (see Fig. 3) [58].
Bash and coworkers [59] used QM–MM methods to follow the reaction path in TIM using a simulation protocol that has been employed in many subsequent simulations of enzyme catalysis with QM–MM methods. First, the initial coordinates used for the calculations came from a high resolution X-ray crystal structure [59]. This is an important point, because the validity of the results of a simulation of this nature on an enzyme is questionable in the absence of an accurate structure from crystallographic studies. The
˚
authors then chose to include only residues with a 16 A radius sphere of the active site, with the remainder of the enzyme deleted. This saves on computation time while maintaining the bulk of the protein around the active site. In any given study of enzyme catalysis it is important to evaluate the effect on the results of different sized model systems (i.e., determine the radius of the sphere to include the study) in order to prevent any size dependence from being introduced to the study. Subsequently, Bash and coworkers surrounded the truncated protein with a sphere of preequilibrated water molecules and applied stochastic boundary conditions to the system to mimic the effects of bulk water. The reaction pathway was mapped adiabatically by fixing one degree of freedom corresponding to the reaction coordinate and fully minimizing all other degrees of freedom. The quantum region was chosen to include the substrate and the side chains of Glu-165 and His-95, with link atoms used at the quantum/classical boundary. The AM1 Hamiltonian was used for the quantum region, and the CHARMM force field for the classical region.
The results supported the proposal of Glu-165 as the general base and suggested the novel possibility of neutral histidine acting as an acid, contrary to the expectation that His-95 was protonated [26,58]. The conclusion that the catalytic His-95 is neutral has been confirmed by NMR spectroscopy [60]. The selection of neutral imidazole as the general acid catalyst has been discussed in terms of achieving a pKa balance with the weakly acidic intermediate. This avoids the ‘‘thermodynamic trap’’ that would result from a too stable enediol intermediate, produced by reaction with the more acidic imidazolium [58].
Using the QM–MM method it is also possible to quantify the effects of individual residues on the reaction occurring at the active site. By performing a perturbative analysis involving sequential deletion of residues followed by a QM–MM interaction energy calculation, starting with the residue farthest from the active site and working in, an energy decomposition profile is generated [26]. This illustrates the effect that long-range electro-
˚
statics can have on the active site, with residues up to 14 A away having an effect on the energy of the system in the active site. The most important contribution was found for
Methods MM–QM with Simulation Computer
Figure 3 |
A possible mechanism for the isomerization of dihydroxyacetone phosphate (DHAP) to D-glyceraldehyde 3-phosphate (GAP) by the enzyme |
triosephosphate isomerase (TIM). The general acid (Glu-165) and general base (His-95) are shown. |
229
230 |
Lyne and Walsh |
Lys-12, which lies close to the active site, and the role of this residue is the stabilization of the enediolate. This explicitly explains the inactivity of a Lys-12 to Met mutation [61].
The QM–MM study of TIM was the first illustration of the potential of these methods for studying enzyme catalysis and has served as a reference for the protocol needed for subsequent studies of enzyme reactions.
B. Bovine Protein Tyrosine Phosphate
Recently, the study of the reaction catalyzed by bovine protein tyrosine phosphate (BPTP) using QM–MM methods was reported [30]. This study represents a progression from the techniques applied to the study of TIM and other enzymatic systems, because the reaction was followed by using molecular dynamics techniques and the QM–MM potential (QM– MD). QM–MD studies are a more powerful technique for studying chemical reactions in condensed phases because they allow for sampling of configuration space as the reaction pathway is followed and the generation of statistics that can be used to calculate reaction activation parameters such as the enthalpy and entropy of activation of various steps along the reaction pathway.
The use of QM–MD as opposed to QM–MM minimization techniques is computationally intensive and thus precluded the use of an ab initio or density functional method for the quantum region. This study was performed with an AM1 Hamiltonian, and the first step of the dephosphorylation reaction was studied (see Fig. 4). Because of the important role that phosphorus has in biological systems [62], phosphatase reactions have been studied extensively [63]. From experimental data it is believed that Cys-12 and Asp-129 residues are involved in the first step of the dephosphorylation reaction of BPTP [64,65]. Alahambra et al. [30] included the side chains of the phosphorylated tyrosine, Cys-12, and Asp-129 in the quantum region, with link atoms used at the quantum/classical bound-
˚
aries. In this study the protein was not truncated and was surrounded with a 24 A radius sphere of water molecules. Stochastic boundary methods were applied [66].
The reaction pathway in the enzyme was calculated by using the method of umbrella sampling, which has been widely used in the study of chemical reactions in solution [7,19]. The simulations were performed in 10–12 overlapping regions to cover the entire reaction coordinate, with 25 ps equilibration and 20 ps sampling in each window. A sampling time of 20 ps is rather short for fully sampling conformational space at each stage on the reaction coordinate. Nonetheless it represents a considerable improvement in simulation compared to simply minimizing the system at each point on the reaction pathway.
The study found that the transition state structure for the initial step of the dephosphorylation step is preferentially stabilized over the ground state through a Walden-inver- sion-enforced hydrogen-bonding mechanism at the active site [30]. It also suggested that a dianionic substrate is preferred in the reaction over a monoanionic mechanism, because the latter involves the breakage of a hydrogen bond between the nucleophile and the phosphoryl group, which causes the overall energy barrier to be raised.
This study is particularly noteworthy in the evolution of QM–MM studies of enzyme reactions in that a number of technical features have enhanced the accuracy of the technique. First, the authors explicitly optimized the semiempirical parameters for this specific reaction based on extensive studies of model reactions. This approach had also been used with considerable success in QM–MM simultation of the proton transfer between methanol and imidazole in solution.
Second, molecular dynamics techniques were employed that allowed the accurate