Hydrogen bonding properties of water molecules, which are confined in microcavities of biological interfaces, are significantly different from those of bulk water and drive most of the complex biological processes. While NMR, X-ray and UV–vis-IR spectroscopic techniques have been found inadequate for describing the dynamics of the thick (20–40 Å) sheath of hydration layer around biomolecules, recently developed THz spectroscopy has emerged as a powerful technique to directly probe the collective dynamics of hydrogen bonds in the hydration layer, which control all important functions of the biomolecules in life. Both laser and accelerator-based THz sources are intense enough to penetrate up to about 100 μm thick water samples, which makes THz transmission and/or dielectric relaxation measurements possible in aqueous solutions. These measurements provide valuable information about the rattling and rotational motions of hydrated ions, making, breaking and rearrangement of hydrogen bonds in hydration layer as well as hydrophilic and hydrophobic interactions between biomolecule and water. THz spectroscopy has also been successfully applied to study the effect of modulation of the physical conditions, like temperature, pH, concentration of proteins and chemical additives, on the structure and dynamics of hydration layer. THz spectroscopy has also been applied to study the processes of denaturation, unfolding and aggregation of biomolecules.
- Hydration Layer
- Hydrogen bond reorganization dynamics
- THz spectroscopy
- Protein aggregation
Water molecules, which reside on the surfaces of proteins or lipid bilayers or in tissues and cells, exhibit properties that are significantly different from those found in pure or bulk water as water molecules in such biological systems face additional interactions [1, 2, 3, 4, 5, 6, 7]. Unique properties of such water molecules, which are confined in microenvironments of biological surfaces or interfaces are popularly termed as ‘biological water’. This water drives many biological processes, in which it plays wide varieties of roles at different levels of complexity making its participation increasingly evident as an active agent and not simply as the spectator solvent [8, 9, 10, 11, 12]. Biological water differs from bulk water in a number of ways. First, clustering of the water molecules at the surface of a protein increases the local density by as much as 25% compared to that in bulk water [13, 14]. MD simulation by Smith et al. has revealed that about half of this density increase arises from the shortening of the average water – water distances and the other half from an increase in the coordination number . Second, replacement of the water–water hydrogen bonds by the water–protein hydrogen bonds in the protein hydration layer lowers the freezing point of the hydration water (i.e. prevents formation of ice). The hydration layer of many proteins does not freeze even at sub-zero temperatures and thus life may sustain even at low temperatures [16, 17, 18]. Third, in bulk water, mutual polarization of the hydrogen bonded water molecules increases the dipole moment and dielectric constant. Such polarization is absent for water molecules in hydration layers of biological molecules, i.e. biological water is less polar than bulk water . Further, measurements of water dynamics suggest that around 10–25% of water molecules in cells have slower reorientation dynamics, by around an order of magnitude, than those in the bulk [2, 20]. Centrality of liquid water to life can be easily understood from the fact that condition for the search for the possibility of life elsewhere is the presence of trace of water . In spite of this status, role of water in sustaining life is still not understood perfectly . It is now, however, clear that water plays an active role in the life of the cell over many scales of time and distance and exhibits diverse structural and dynamical roles in molecular cell biology [3, 22].
In liquid (or bulk) water, the molecules form a tetrahedrally coordinated motif, which is the building block of ephemeral six-membered (ice-like) or five-membered (clathrate-like) ring structure, consisting of fluctuating network of hydrogen bonds, but each bond has an average lifetime of about a picosecond . Thermodynamics of hydration in water are generally governed by a balance between the enthalpic and entropic consequences, namely the enthalpies of water–water and water–solute interactions (hydrogen bonding, electrostatic, and van der Waals) and the entropies of disrupting the relatively ordered hydrogen-bonded networks of bulk water and forming new hydrogen bonds to suit the geometric factors of the biological interfaces [15, 24, 25].
Similarities in water dynamics in hydration shells of various proteins  suggest that the dynamics are determined by rather general features of surface chemistry and topology, which induce excluded volume effects and hinder the approach of new hydrogen bond acceptors within the hydration network. There is now universal recognition that the dynamical behavior of biological macromolecules cannot be decoupled from that of water and the dynamical behavior of the hydration shell largely controls the chemical function of the biological molecules [27, 28].
Since the development of this understanding that biomolecules are surrounded by a sheath of hydration water, which takes active part in all of their normal activities, extensive efforts have been made to perceive the detailed structure and dynamics of the hydration layer using a variety of spectroscopic methods, such as X-ray and neutron scattering [14, 29, 30], NMR [31, 32, 33], second harmonic generation [34, 35] and ultrafast fluorescence and IR spectroscopies [1, 2, 22, 26, 36, 37], assisted by ab initio and molecular dynamics simulations [24, 25, 38, 39]. However, these measurements could provide information about the dynamics of water and the biomolecules at a low-hydration level corresponding to the first layer of hydration water only. Since fluctuations of protein and solvent dynamics take place over a wide range of length scales (in the range of a few nm) and timescales (from milliseconds to picoseconds) influencing several aspects of protein function, no single technique can span so many spatial and temporal orders of magnitude. Therefore, there has obviously been a debate about how to reconcile the results of different experimental methods that explore dynamics [1, 2, 22, 40, 41].
Recent realization that low-frequency and large-amplitude modes of water molecules in the hydration shell are particularly important in controlling the conformational changes that dominate protein function, has led to using the terahertz (THz) spectroscopy to observe water dynamics around biological molecules [42, 43]. THz radiation (1 THz = 1012 Hz = 33.33 cm−1) excites the low frequency vibrations of the solvated protein and directly probes the collective hydrogen bond dynamics of the coupled protein-water system in (sub-) ps time domain. Librational, translational and intermolecular, collective motions of hydration water, as well as large amplitude motions of biomolecules, also occur in similar time scale (see Figure 1). In frequency domain, this corresponds to low-frequency modes in the 1–10 THz frequency region. In spite of the fact that water molecules strongly attenuate the THz radiation in this frequency region, laser or accelerator-based THz sources (vide Section 2) are now powerful enough to penetrate water layers. Modern THz spectroscopic techniques can provide valuable information about water dynamics in the frequency region of the electromagnetic spectrum (0.3–20 THz or 10 to 600 cm−1, the so-called “terahertz gap” between the dielectric and the infrared regimes). This has offered a unique view of the hydration water in fully solvated biomolecules.
2. THz transmission or absorption spectroscopy
Two kinds of THz spectroscopy techniques, namely THz transmission and THz time-domain spectroscopy (THz-TDS), have extensively been used to record the THz absorption spectra of biological materials. THz transmission spectroscopy is a simple single or dual beam steady state spectrometer using THz radiation beam to estimate the THz absorption coefficient. THz transmission or absorption spectroscopy of fully solvated biomolecules in water yields direct information on the global dynamical correlations among solvent (water) molecules. However, application of THz spectroscopy to study solvated biomolecules was not possible because of huge absorption of water in the THz frequency range. This problem was solved by the development of the p-Ge laser, which was a strong the THz emitter [44, 45, 46, 47]. Prior to this development, weak radiation power of standard sources like a globar or an arc lamp in the far IR and THz region was the cause for poor signal to noise ratio and the measurements were limited to powders or hydrated films of biomolecules, but not in their native aqueous environment. Free-electron lasers [48, 49] and synchrotrons , which are also sources of high power far-IR and THz pulses, have also been reported to be used for spectroscopy studies of biomolecules but are not easily accessible. THz transmission spectroscopy of water in biological systems received a momentum after the discovery of the p-Ge laser. This laser is a powerful table-top THz source (up to about 400 W) and has the tunability in the 80–100 cm−1 (2.4-3.3 THz) frequency region. Development of a table top dual beam (reference and sample arms) THz transmission spectrometer ensured accurate measurements of absorption of water of thickness as large as 100 μm .
Accelerators delivering ultrashort pulses of relativistic electrons have been widely used as a source of high intensity THz radiation. Free electron lasers and synchrotrons are actually seeded by short duration electron pulses. An easier method of generating high intensity and tuneable THz radiation using an ultrafast electron accelerator is the coherent transition radiation (CTR). CTR occurs when a charged particle passes through an interface between media with different dielectric constants [52, 53, 54, 55]. Sudden change in the dielectric constants along the electron’s path causes a discontinuity in the electric field at the interface and this discontinuity readjusts itself as radiation spreading out from the point where the electron passes through the discontinuity. The angular spectral energy density of the CTR depends on the dielectric constants of the two media [56, 57]. Since the dielectric constants of aluminum and gold are much more than that of vacuum in far-IR region, these two metals are frequently used as perfect conductors. Direction of propagation and the intensity distribution of the CTR with respect to the angle of incidence of the electron pulse on the target has been discussed in detail in Ref. .
At wavelengths shorter than the bunch length, the emitted radiation field is incoherent and total intensity is proportional to the number of electrons (N). But at wavelengths longer than the electron bunch length, the radiation emitted from the bunch is coherent. With a typical number of electrons per bunch on the order of 108–109, the coherent radiation intensity greatly exceeds that of incoherent radiation. Especially, at long wavelengths compared to the bunch length, the radiation intensity is proportional to the square of the number of electrons in the bunch. Therefore, it is possible to generate coherent radiation in the far-IR or THz spectral range from short electron bunches with bunch lengths of hundred micron or less. Moreover, the shorter the bunch length, the broader is the radiation spectrum that can be generated. With larger number of electrons in the bunch, the total intensity (both coherent and incoherent components) of the CTR at frequency ω is given by,
In this equation, contribution of the coherent component, Icoherent(ω), is given by,
and Ie(ω) is the radiation intensity generated from one electron in the bunch and Eqs. (1) and (2) have been derived assuming that all electrons in the bunch have the same energy.
In THz transmission measurements at AIST, Tsukuba, Japan, a femtosecond linear accelerator system, which delivered electron pulses of about 300 fs duration, and electron energy of 42 MeV and ∼ 100 pC charge (∼108 electrons per one macro-pulse) , was used. Electron beam was directed to hit a thin gold foil (thickness is about 500 μm) at an incidence angle of 45°. CTR, which contained the component of THz radiation, was collected in the direction perpendicular to the electron beam using a gold coated parabolic mirror and the intensity of the THz radiation was measured using a Schottky diode detector . Intensity of the THz pulse generated by this method was of high intensity (about 100 nJ/micropulse and covered a wide frequency range (0.3–2.5 THz), which was practically determined by the frequency response of the Schottky diode detector. Band pass filters (Tydex, Russia) were used to select a band of THz frequencies for estimation of absorption of protein solutions. Absorbance of the solutions of different path lengths were measured. Samples were taken in Bruker liquid sample cells, path lengths of which were varied using Teflon spacers of required thicknesses. Fused silica windows having thickness of 1.5 mm were used in the Bruker sample cell. Thicknesses of the Teflon spacers used were in the range of 20 to 150 μm. Absorption coefficient value (in cm−1) of the solution was estimated from the slope of the linear plot of absorbance vs. path length of the cell (Eq. (4)) . Experimental arrangement used here corresponded to that of a single beam absorption spectrometer.
Here, I0 and Id represent the beam intensities in the absence (i.e. using the blank cell) and presence of the sample solution, respectively. αsol represents the absorption coefficient of the sample solution, d is the path length or thickness of the sample.
The absorption coefficient of the sample, α, is defined by,
(cm−1) is the frequency of THz radiation,
The real part describes the refractive index of the sample, while the imaginary part
Presently, the THz-TDS in the 0.1–10 THz (3.33 cm−1– 333 cm−1) region is the most popular technique, which has been extensively applied in the fields of research in chemistry, materials science, physics, engineering, medicine as well as in industry [60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74]. THz -TDS has also been proved to be a valuable technique for investigation of low frequency dielectric relaxation and vibrational spectroscopy of hydrogen bonded liquids, such as water, alcohols, and others [75, 76, 77]. While various configurations of the THz time-domain spectrometer have been used depending on the kind of applications, we will describe here the general principle of the technique and the spectrometer configuration used for investigation of biomolecules.
The core principle of the THz-TDS technique, which uses a short duration (say a few tens of femtosecond) pulses of THz radiation, is measurement of the transient electric field associated with the THz pulse rather than variation of intensity of the frequency components of the THz pulse. However, since the response or the rise times of the electronic components and detectors are slower than a few ps, it is the normal practice to use the pump-and-probe kind of optical configurations in ultrafast spectrometers in order to overcome the slow response of the detectors. To achieve sub-picosecond time resolution in THz-TDS, the ultrashort NIR optical pulse (typically shorter than 100 fs) is beam-split along two paths to generate pump and probe pulses and to detect the time dependent THz field using a time-delay stage. Schematic diagram presented in Figure 2 represents the principle of a THz- time-domain spectrometer.
The pump pulse, which contains the major amount of energy (more than 70% of the total energy of the laser beam), is used for generating broadband pulses of THz radiation using a photoconductive antenna consisting of a low temperature grown GaAs or InGaAs film covered with metallic contacts for application of bias voltage, or by optical rectification in ZnTe(110) crystal. The pulsed beam of THz radiation is collimated and focused onto the sample using gold coated parabolic mirrors. After transmission through the sample, the THz beam is recollimated and refocused on another photoconductive antenna or ZnTe(110) crystal, which works as the THz detector. Recently, ZnTe (110) crystal are being used extensively both as THz emitter as well as in detection because of possibility of its application in much wider THz frequency region as compared to that has been possible using photoconductive antenna.
The probe beam is used to gate the detector and measure the instantaneous THz electric field using the method of electro-optic sampling. A high precision linear motion stage is used to delay the probe pulses to arrive at detector crystal with respect to the pump pulses. The THz pulse and the gate (NIR) pulse are propagated collinearly through the ZnTe crystal. The THz pulse induces a birefringence in ZnTe crystal, which is read out by a linearly polarized gate pulse. When both the gate pulse and the THz pulse are present in the crystal at the same time, the polarization of the gate pulse will be rotated by the THz pulse depending on the strength of the electric field of the THz pulse. Using a λ/4 waveplate and a beam splitting polarizer together with a set of balanced photodiodes, the THz pulse amplitude is mapped by monitoring the gate pulse polarization rotation after the ZnTe crystal at various delay times with respect to the THz pulse (Figure 2(a)). A Fourier transform is then used to convert this time domain electrical signal to a frequency domain spectrum (Figure 2(b)). The optical path from THz generation to THz detection is purged with dry nitrogen gas to avoid the effect of the humidity.
Dielectric relaxation measurement is a sound method to investigate intermolecular interactions and is capable of monitoring cooperative processes at the molecular level [78, 79, 80]. This method is appropriate to monitor molecular motions in widely varied time scales and has been used extensively to study the structure, dynamics, and macroscopic behavior of complex systems . A brief discussion (more details may be found elsewhere ) on the basic principle of dielectric polarization is presented below.
If a static electric field of strength, is applied to an isotropic and uniform dielectric material of dielectric susceptibility χ, then the macroscopic polarization can be expressed as,
where ε0 is the dielectric permittivity of the vacuum. Applying the macroscopic Maxwell theory, the electric displacement (electric induction) vector, can be written as
Now, in the linear regime, the relative dielectric permittivity (or
There are different types of polarization mechanisms by an applied electric field in different frequency regions: i)
The orientation polarization is given by the dipole density due to the dipoles d. If we consider a sphere with volume V comprised of dipoles, we can write,
where is the dielectric response function, . εs and ε∞ are the dielectric permittivity at low and high frequency, respectively. The frequency dependent complex permittivity is connected to the above relaxation function through Laplace transformation [82, 83, 84],
where is the operator of the Laplace transformation, which is defined for the arbitrary time dependent function
where and .
Eq. (17) gives equivalent information on dielectric relaxation properties of the sample both in frequency domain and time domain measurements. Therefore, the dielectric response might be measured experimentally as a function of either frequency or time, providing data in the form of a dielectric spectrum ε*(ω) or the macroscopic relaxation function
This is known as the Debye model for frequency dependent dielectric permittivity. According to the Debye model, the complex frequency dependent dielectric response  can be described as:
where, is the permittivity in free space (= 8.854 x 10−12 F/m), is the angular frequency, is the relaxation time for the
3. Structure and dynamics of hydration layer in interfaces of small molecules, proteins, peptides, nucleic acids and membranes
The dynamics of water around small as well as complex molecules changes owing to their specific interaction with the solute surface; the specific nature of the interaction could mostly be electrostatic or hydrogen bonding. Such interaction expectedly ruptures/modifies the tetrahedral water network structure as well as its dynamics, a clear imprint of that gets reflected in the THz frequency window. Measurement of optical parameters α(ν) and n(ν) enables one to determine the change in water dynamics at the vicinity of (bio)surface and consequently one can infer on the changes of the corresponding solute also.
Metal ions are perhaps the simplest solutes that can induce perturbation in water dynamics, the interaction being mostly electrostatic in nature. A systematic investigation using alkali monovalent cations  and alkaline earth bivalent cations  in aqueous solutions have been put forward by the group of Martina Havenith using far-IR and THz FTIR measurements coupled with classical MD simulation studies. Their study concluded that ion rattling motions can account for the observed changes in the THz absorption. They have shown that the spectrum of the salt solutions can be approximated by a linear superposition of concentration weighted neat water and ion contributions. Ab initio MD simulation study by Marx et al.  has successfully reproduced the spectral responses of the solvation shell around the ions in infrared and THz frequency range. Their study has shown that the solute-solvent dipolar couplings and the dipole–dipole correlations are the important factors that govern the absorption features in this frequency regions. The group of Martina Havenith has subsequently put forward a series of studies on the hydration dynamics of heavy metal ions, e.g. lithium , manganese , iron  and ytterbium . An elaborate description of the process involved has recently been put forward by this group . The overall THz effect of the ions could be apprehended in terms of the contributions from various modes as depicted in Figure 4.
In a systematic study using THz-TDS in the frequency region (0.3–2.1 THz; 10–70 cm−1), experimental evidences for the ultrafast collective hydrogen bond dynamics of water in the extended hydration layers of alkali metal chlorides have been obtained . The real and imaginary part of the permittivity (ε), as obtained from the THz-TDS measurements was fitted using a triple Debye relaxation model. The time scales obtained for bulk water are of the order of ∼8–9 ps (τ1), 200 fs (τ2) and 80 fs (τ3). The ∼9 ps and ∼ 200 fs timescales are due to the well-known cooperative rearrangement of the H-bonded network structure and the small angular rotational modes of individual polar water molecules, respectively. It has been reported that (τ1) decreases with increasing salt concentration, which identifies an acceleration of the cooperative hydrogen bond dynamics affirming a positive support towards the most debated notion of these ions to act as water structure breakers (see Figure 5(a)). The extent of this effect has been found to be mostly ion specific, K+ being the most effective ion and a simple consideration of ionic charge density is insufficient to account for the observed changes. Very recently, Havenith and Marx  have put forward a combined experimental and simulation investigation to provide a detailed mechanistic analyses based on cross-correlation analysis (CCA) technique to understand the minute details of water-solute interactions.
3.2 Complex ions and small molecules
While the interaction of metal ions with water is more straight-forward, complexity arises when an ion is associated with a hydrophobic moiety. Such molecules are often of biological importance and therefore their interaction with water is essential to establish. There have been several reports on the THz studies of sucrose and saccharides [95, 96, 97, 98, 99]. In a pioneering work Heyden et al.  have described the nature of hydration around small carbohydrate molecules and a detailed simulation study has revealed that the extent of the hydration shell and their absorption co-efficient change with the solute specificity. One such molecule is urea, which is well known to be a protein denaturant, however, the exact molecular mechanism of the processes involved still remains a strongly debatable issue, specially the argument of whether the interaction is water mediated (the water structure breakers notion of urea) or a direct interaction of urea with protein surface.
An elaborate attempt has been made to understand the effect of urea (and its derivatives) on the ultrafast solvation dynamics of water using a Debye type dielectric relaxation model in the 0.3–2.0 THz frequency region using THz-TDS technique . The relaxation dynamics shows considerable acceleration beyond a threshold concentration. Such acceleration is possibly associated with a disruption of the tetrahedral water network structure. It seems also intriguing that the observed collapse occurs at a certain urea concentration (see Figure 5(b)), which strikingly coincides with the denaturation concentration of urea for many proteins. Another molecule of such biological interest is guanidinium chloride (GdmCl), which offers remarkable protein denaturation ability beyond a certain threshold concentration (∼2–3 M) [102, 103]. Like in the case of urea, protein denaturation mechanism of GdmCl has also been debatable concerning a direct or indirect interaction. We have investigated the collective hydrogen bond dynamics around GdmCl in aqueous solutions as well as in the presence of a model globular protein human serum albumin (HSA) using the THz-TDS technique. It is found that the relaxation dynamics gets faster which renders support to the previously speculated notion that GdmCl acts as a water structure breaker. A similar but more prominent trend is observed in case of NaCl, which, however, does not interact with proteins.
The change in hydration dynamics in presence of HSA has been found to be in complete contrast in these two salts unambiguously pointing out towards a possibility of the collective hydration dynamics to share a pivotal contribution in the protein unfolding phenomenon. These studies seem to conjecture that ions with complex hydrophobic moiety acts as water structure breaker, and this very nature of such molecules makes them protein denaturant. To investigate the validity of such conjecture we investigated the ultrafast (sub-ps to ps) collective hydrogen bond dynamics of water in the extended hydration layers in a series of alkylammonium chloride salts using THz -TDS technique . We found these salts to transform from being a water ‘structure breaker’ to ‘structure maker’ with increasing carbon content. For example, the THz-TDS measurements reveal that ammonium chloride (AC) acts as a water network structure breaker while tri-ethyl ammonium chloride (EAC) and higher carbon containing salts are distinct water structure makers. The change in protein hydration is also found to trace its secondary structure rupture and the exposure of hydrophobic moieties accordingly changes the protein hydration. Our study strongly concludes that it is the hydrophobic effect, at least in the case of this type of salts, that plays the decisive role in determining their interaction with biomolecules.
3.3 Amino acids
Amino acids, which are the building blocks of proteins, often act as intermediates in metabolism as well as osmolytes that can stabilize proteins. Depending upon the ‘side group’, the amino acids are classified as hydrophilic or hydrophobic. Hydrophobicity of amino acids is believed to be a key parameter that regulates phenomena like protein folding - unfolding, aggregation, activity, protein-ligand binding and protein hydration in aqueous environments. While to analyze the solvation of hydrophobic and hydrophilic parts of a protein separately, one needs to take into consideration that the environment of amino acid residues is heterogeneous in nature as they are often composed of hydrophobic alkyl chains and hydrophilic groups. It is therefore essential to study the hydration of amino acids of various side chains in the exposure to solvents. Niehues et al.  have studied the hydration of a series of amino acids in ∼2.4 THz window and observed that the THz absorption coefficient, α(νTHz), of hydrated amino acids can be correlated with the hydrophobicity and the fraction of polar volume of amino acids. They have shown that glycine has the largest positive THz slope followed by serine, whereas for the other amino acids, the slope becomes gradually negative with increasing hydrophobicity. In another study , the same group, by analyzing the corresponding THz spectrum in terms of the correlated dynamics of solute and solvent molecules, demonstrates the line shape of the low-frequency vibrational response of glycine in water. Recently, they  have shown that hydrophilic solvation of the zwitterionic groups in valine and glycine leads to similar THz responses, which are fully decoupled from the side chain. This result concludes that the hydrophilic groups and their solvation shells dominate the THz absorption difference, while on the same intensity scale, the influence of hydrophobic water can be neglected. Shiraga et al.  concludes a correlation between the extent of H-bonding and the hydrophobicity of the solute. Glycine (Gly) and the L- isomers of five different amino acids: serine (Ser), aspartic acid (Asp), lysine (Lys), arginine (Arg) and tryptophan (Trp) of varying hydrophobicity and solvent accessible surface area (SASA) have been used by Samanta et al.  to study the dielectric relaxation up to their maximum water solubility in the frequency window of GHz-THz at neutral pH. The various rotational dynamics of the solutes and water are obtained by fitting the dielectric data in a multiple Debye relaxation model. From GHz study, we observed that the molecular rotation of amino acids correlates their respective molecular volume. Also, it was concluded that the amino acids do not usually aggregate even at high concentrations. From THz study, the authors found that Gly is a water structure breaker while the other amino acids are structure makers. Consequently, Gly accelerates cooperative hydration, while the others retard (see Figure 5(c) and (d)). It has been established that hydration in amino acids depends both on its hydrophobic as well as its hydrophilic nature of side chains. Born et al.  have studied the hydration dynamics of some small model peptides using THz spectroscopy. They observed that the cooperative hydration dynamics changes as the extent of hydration of such molecules change.
3.4 Alcohols and other solvents
Addition of otherwise indifferent organic solvents can influence marked changes in aqueous environments as well as they can induce misfunction in biologically important molecules e.g. protein, DNA, etc. Water - dimethyl sulfoxide (DMSO) binary mixture has been found to be of potential interest as it plays a significant role in the field of chemistry, physics, biology and pharmacology. Das Mahanta et al. have investigated the change in the collective hydration dynamics in presence of DMSO at different concentrations using THz-TDS. We found that α(ν) of the mixed solvents shows a non-linear change with XDMSO . This change has been correlated with the water-DMSO structural heterogeneity in the mixed solvents. Luong et al. have also studied the evolution of water hydrogen bonded collective network dynamics in water −1,4-dioxane (Dx) mixtures as the mole fraction of water (Xw) increases from 0.005 to 0.54 . The inter- and intra-molecular vibrations of water be observed using THz-TDS in the frequency range 0.4–1.4 THz (13–47 cm−1) and Fourier transform infrared (FTIR) spectroscopy in the far-infrared (30–650 cm−1) regions. From the absorption coefficient measurements, they infer that the mixtures are not ideal in nature, which suggests a significant change in the network by the addition of the solute. The authors found an increase in the collective hydrogen bond network as evidenced from dielectric relaxation studies in which τ1 has been found to be small at low Xw (where Xw is the mole fraction of water in the mixture), but at Xw > 0.1, it increases rapidly to reach a value identical to that in bulk water. It has been concluded that hetero-molecular (water - Dx) hydrogen bond dominates in the water diluted region in water-Dx mixtures, and with progressive addition of water, bulk-like intermolecular three-dimensional hydrogen bonded water network dynamics evolves beyond Xw = 0.1. In a separate study , a combined experimental (mid- and far-infrared FTIR spectroscopy, THz-TDS (0.3–1.6 THz)) and molecular dynamics (MD) simulation technique has been carried out to understand the evolution of the structure and dynamics of water in its binary mixture with 1,2-dimethoxy ethane (DME) over the entire concentration range. Debye relaxation data reveals a non-monotonous behavior in which the collective dynamics is much faster in the low Xw region, whereas in the Xw ∼ 0.8 region, the dynamics gets slower than that of pure water. The concentration dependence of the reorientation times of water, estimated from the MD simulations, also captures this non-monotonous character. Bohm et al.  have demonstrated that the THz response of alcohols can be decomposed into the spectrum of bulk water, tetrahedral hydration water, and more disordered (or interstitial) hydration water. They also concluded that it is not the tetrahedrally ordered component, rather it is the interstitial hydration water which is responsible for the temperature-dependent change in ΔCp and ΔG in such mixtures.
The surface of proteins is extremely heterogeneous owing to the presence of amino acids of varying types of charges. The pioneering studies by the group of Havenith et al. have shown that protein molecules are hydrated and the cooperative dynamics of water changes accordingly. An earlier study using a small protein ubiquitin  shows how the measurement of α(νTHz) reveals a change in the protein hydration dynamics as the authors termed it as the “THz dance”. In a subsequent ever important simulation paper, the same group has established different THz absorption of the hydrophobic and hydrophilic residues of a protein as they interact differently with water . In the later years, this group has put forward substantial contribution on the hydration dynamics of proteins using THz measurements [117, 118] as well as simulation studies [42, 119]. Their studies have unambiguously suggested that fast protein motion and solvent dynamics are correlated with enzymatic reactions . In a recent study this group has established the pivotal role of collective motion of water during an electron transfer reaction between flavoenzyme ferredoxin - NADP+ − reductase and ferredoxin-1 . In a seminal paper, Marklez et al.  have used THz-TDS measurements to show that the protein (hen egg white lysozyme) dynamical transition (the rapid increase in protein dynamics occurring at ∼200 K) needs neither tertiary nor secondary structure. Their results revealed that the temperature dependency essentially arises from the protein side-chain interaction with the solvent. He et al.  have investigated the presence of structural collective motions on a picosecond timescale for the heme protein, cytochrome C, as a function of oxidation and hydration, using THz-TDS and molecular dynamics simulations. Marklez group has developed a novel measurement technique (anisotropy THz microscopy)  wherein they were able to detect the long range protein vibration modes in chicken egg white lysozyme single crystals. They found the underdamped modes to exist for frequencies >10 cm−1. Such underdamped vibrational modes have also been identified using optical Kerr effect measurements in the THz frequency window . In a recent study, Niessen et al.  have used anisotropy THz microscopy, which is found to be sensitive to inhibitor binding and unique vibrational spectra for several proteins and an RNA G-quadruplex. There have been other reports from several experimental groups: Sun et al.  have reported the application of a new machine learning methods for quantitative characterization of bovine serum albumin (BSA) deposited thin-films detected by THz-TDS. The group of Emma Pickwell-MacPherson  have used THz spectroscopy to study the hydration shell formation around H9 subtype influenza A virus’s HA protein (H9 HA). They have also detected antigen binding of H9 HA with the broadly neutralizing monoclonal antibody. They observed a remarkable concentration dependent nonlinear response of the H9 HA, which reveals the formation process of the hydration shell around H9 HA molecules. The same group has also reported the dielectric properties of two different antibodies in water-glycerol mixtures using THz-TDS measurements . Sun et al.  used THZ-TDS to investigate the molecular processes involved above and below the transition temperature (TD) for GP2 peptide.
While much research has been focused on proteins, relatively less attention has been paid on the other biologically important molecule, DNA. There had been a few preliminary studies to understand the vibronic bands in the THz frequency region using single and double stranded DNAs and RNAs [131, 132, 133]. Arora et al.  have presented a label free quantitative detection method for DNA samples amplified by polymerase chain reaction (PCR) in aqueous medium using THz-TDS in the frequency range from 0.3 to 1.2 THz. Tang et al.  have recently investigated the feasibility of THz spectroscopy combined with microstructures for marker-free detection of DNA and oligonucleotides. Polley et al. have investigated the collective dynamics of two DNA molecules extracted from salmon sperm and calf thymus and observed that the dynamics did not differ much at the concentration range of the experiments .
3.7 Lipid membrane
There have been only limited studies on the THz studies on lipid membranes and/or vesicles. One of the preliminary results was due to Tielrooij et al.  who had studied the dielectric relaxation in mixed system of hydrated DOPC lipid bilayers in the THz frequency domain. They could identify three distinct water types: fast, bulk and irrotational. The relative content of those change with the extent of hydration. Later, Yamamoto et al.  have studied the temperature and hydration dependent low frequency spectra of lipid bilayers of 1,2-dimyristoyl-sn-glycero-3-phosphoryl-3′- rac-glycerol (DMPG) and 1,2-dimyristoyl-sn-glycero-3-phosphocholine (DMPC) using THz-TDS. They found that the THz absorption patterns reflect the lipid packing pattern in the bilayers. They subsequently extended their investigations towards purple membrane (PM, a complex of lipids and a membrane protein, bacteriorhodopsin)  and lipid bilayer of DMPC . Pal et al. have recently studied the microstructure and collective dynamics of the membrane interfacial hydration shell in zwitterionic and negatively charged phospholipid membrane bilayers using THz-TDS . They observed a dependence of the critical lipid concentration corresponding to the inflection point on the charge of the lipid head-group, thereby implicating membrane electrostatics as a major factor in the microstructure and dynamics of water at the membrane interface.
4. Influence of temperature, pH, ionic strength and additives on hydration layer dynamics
Hydration dynamics of biomolecules is significantly influenced upon changing its physical conditions as well as in the presence of additional chemical agents, like alcohols, glycols, etc. Protein molecules undergo various physical and chemical changes, which could induce conformational modifications in their secondary as well as tertiary structures. It can be noted here that as protein structures get disrupted (during unfolding or denaturation) the hydrophobic moieties (amino acid residues), which are otherwise buried inside in the native structure, get exposed, and this produces a definite alteration in THz absorption coefficient, α(νTHz). It therefore suggests that estimation of α(νTHz) provides a direct evidence of the structural perturbation in proteins.
Several experimental techniques are available to determine the structural evolution of proteins during unfolding, while THz provides with the estimation of the associated hydration changes. In a pioneering experimental work, in which a stopped-flow techniques was synchronized with THz-TDS, which the authors termed as Kinetic terahertz absorption (KITA) spectroscopy, Kim et al.  have shown that the pH induced unfolding-refolding kinetics (in real time) of Ub* could easily be traced by the associated α(νTHz) measurements. The THz results very well reconcile with the results obtained from circular dichroism (CD) and fluorescence measurements.
Another important physical environment that induces protein unfolding is temperature. In a study using HSA as a model protein, Mitra and Havenith  have shown that water dynamics associated with the protein during its reversible unfolding pathway up to 55°C as well as its irreversible denaturation pathway up to 70°C traces the protein’s structural rupture pathway. The THz measurements do support the conventional CD and fluorescence measurements. Sudden increase in the environment (like temperature or pressure) for a very short period of time (often termed as T-jump experiments) leads protein molecules to be structurally ruptured but upon removal of the intense pulse, the protein refolds. T-jump experiments have previously been characterized using conventional CD and fluorescence measurements. However, THz measurements was demanded to obtain explicit information of water dynamics. The first report of such experiment was from the group of Havenith , wherein the authors put forward a coupled KITA setup with a T-jump attachment. The authors monitored changes in the THz absorption λ∗6 − 85protein with a time resolution of >50 μs. They reported that the spectral changes are correlated with the hydrophobic collapse of the protein. In a subsequent study from the same group, Wirtz et al.  used an even better time resolution of about 500 ns to reveal the coupled ubiquitin−solvent dynamics in the initial phase of hydrophobic collapse (temperature induced unfolding). They propose that, in the case of ubiquitin, a rapid (∼500 ns) initial phase of the hydrophobic collapse from the elongated protein to a molten globule structure precedes secondary structure formation. Recently there have been a few reports of using THz spectroscopy technique to underline thermal denaturation of BSA , temperature- and pH-dependent protein conformational changes in pepsin A . In a very recent study Cao et al.  have successfully employed THz-TDS to track the hydrolysis of BSA protein by pepsin. The results indicate that protein hydrolysis can be easily monitored over time by focusing on the variation of the absorption coefficient from a macroscopic perspective. The authors explored the use of the Debye model to analyze the dielectric properties of the solution during protein hydrolysis. The results of the Debye analysis prove that it is possible to investigate in detail the microscopic dynamics of bio-macromolecule solutions at the molecular level by THz-TDS.
Samanta et al. have been involved in determining the changes in protein hydration in various distressed environments using the THz-TDS measurements. They have investigated the hydration dynamics around HSA in presence of short chain polyethylene glycols of different chain lengths (PEG 200, PEG 400, and PEG 10000) at different concentrations . FIR-FTIR studies conclude that the protein hydration is affected in a distinct way below and above the critical PEG concentration of 30% (v/v). THz-TDS study unambiguously confirmed a retardation of the solvation dynamics by PEGs. This study clearly shows an independent behavior of protein hydration at low PEG concentrations and a noticeable interaction between protein and PEG hydration beyond a critical PEG concentration. In another study, Das et al. have made an attempt to understand whether the DMSO induced conformation changes in lysozyme conformation perturbs its hydration dynamics . CD study establishes a marked change in the protein tertiary structure in presence of DMSO. The relative change in the THz absorption coefficient (Δα/α0) shows a negative minimum at XDMSO = 0.05 and a positive value at XDMSO = 0.15. The observed minimum is found to be due to the increased size of the protein while the positive value is attributed to the increased SASA and consequent increased hydration of the protein surface. In a recent report, Das et al. put forward an experimental observation of nonmonotonic changes in the collective hydration of BSA in the presence of alcohols of varying carbon-chain lengths, that is, ethanol, 2-propanol, and tert-butyl alcohol (TBA), by using THz – TDS . They observe an anomalous hydration behavior of the protein hydration with the alcohol concentration, which correlates the alcohol-induced α-helix to random coil transition of the protein secondary structure, as revealed by CD spectroscopy measurements. Recently, Das Mahanta have investigated the effect of alkyl-ammonium chloride salts on BSA and found a systematic trend towards disrupting the protein secondary structure . The associated changes in the protein hydration in the presence of these salts have also been investigated using THz-TDS. The change in protein hydration is also found to trace its secondary structure rupture, and the exposure of hydrophobic moieties accordingly change the protein hydration. The THz-TDS measurements strongly conclude that it is the hydrophobic effect, at least in the case of this type of salts, that plays the decisive role in determining their interaction with biomolecules.
5. Role of hydration water in protein aggregation and fibrillation
Aggregated protein is toxic to functioning of living systems and many of human diseases are associated with misfolded protein disorders [152, 153, 154, 155, 156]. This is why understanding of mechanisms of interactions between protein molecules in solutions have been the subject of extensive investigations during the last two decades [157, 158, 159, 160, 161]. Additionally, understanding the protein aggregation propensity may offer novel design principles for producing aggregation-resistant proteins for biotherapeutics.
Globular proteins, e.g. HSA and BSA, in their native states are present in living cells at concentrations as high as about 200 mg mL−1 and bimolecular interactions are significant. We explained earlier that stability of the three - dimensional structure of a monomeric protein molecule in physiological environments is the result of an intricate interplay between electrostatic, hydrophobic, hydrogen bonding and other interactions and any variation of temperature, pH of the medium or any other physiochemical conditions including concentration of protein in the cell may result in imbalance of the stabilization forces leading to misfolding [162, 163], thus triggering aggregation [164, 165, 166, 167, 168].
A native and structurally stable protein molecule is strongly hydrated with a well-defined hydration layer of thickness of about a few tens of Å (20–40 Å) around it [50, 59, 169]. The role of water molecules in the hydration shell could be crucial for the intermolecular interactions and the overall protein hydrophobicity, which may be defined by its hydration free energy, which may play an important role in protein aggregation in aqueous solution [162, 163, 166]. However, the role of hydration water in protein aggregation has largely been unexplored owing to the perception that protein – protein interaction is the major factor and the surrounding water is just a spectator playing no role in aggregation of protein, ignoring water as an active constituent of biological systems. In fact, whether a protein remains soluble or forms aggregation should intrinsically rely on its state of hydration in the monomeric state.
We have described in the earlier sections of this article that how the properties of water molecules in the hydration layer on an average are different from those in the bulk, which determines the stability of a monomer protein in aqueous environment. Protein aggregation proceeds through a multistep process initiated by conformational transitions, called protein misfolding, of monomer species towards aggregation-prone structures. Chong and Ham applied the fluctuating thermodynamic analysis method to understand the variations of the thermodynamic functions, which occur during the course of misfolding and dimerization of the Amylase-β protein. They suggest that the time variation of the solvent-averaged effective energy, F = Eu + Gsolv, describes the protein dynamics on the free energy landscape . Here, the protein potential energy (Eu), comprises both intra and intermonomer contributions and the solvation free energy (Gsolv), represents the interaction of the protein with surrounding water, which plays a critical role in protein aggregation. The free energy, F, decreases as the dimerization proceeds, but the decrease in F has different origins in the approach and structural adjustment regimes. The thermodynamic force driving the approach of two monomers is the decrease in Gsolv and hence, the misfolded monomers acquire a large hydrophobicity, which leads to conformational changes in monomers. This drives two monomers to approach each other to a contact distance. In absence of this thermodynamic driving forces, two negatively charged protein monomers (the total charge of Amylase-β and BSA monomer proteins at neutral pH are −3, and − 16, respectively [170, 171]) would never approach each other by overcoming the electrostatic repulsion. On the other hand, decrease of the protein potential energy Eu, due to direct protein–protein interactions, such as intermonomer van der Waals contacts and hydrogen bonds, drives the structural rearrangement required for formation of compact dimer structure leading to energetic stabilization. Interestingly, structural rearrangements are also associated with an increase in the solvation free energy, which originates from the dehydration of the protein surface and of the interfacial region. On contrast to other spectroscopic techniques, THz spectroscopy probes directly the collective intermolecular vibrations of the hydrogen bond network, and is thus able to detect sensitively solute induced changes in the solvation dynamics. Extensive works on THz absorption of protein solutions have demonstrated that the absorption coefficient of the protein solutions (αsol) are dependent on the concentration of protein [50, 59, 169, 170, 171]. For example, at the low concentration regime (e.g. in the case of HSA, <0.5 x 10−3 mol dm−3) αsol value increases linearly and this has been explained by increasing concentration of hydrated monomer protein molecules since water in the hydration shell has larger αsol value as compared to that of bulk water. However, on further increase of the protein concentration, αsol value starts decreasing.
To delineate this issue in more detail, Manna et al. made a detailed investigation on the concentration dependence of THz absorption of the aqueous buffered solutions of HSA protein (up to 2.6 mM of protein concentration) at three THz frequencies, namely, 0.1, 1.5 and 2 THz . Similar results were obtained from these three measurements. αsol value was expected to change linearly to follow Eqs. (23) and (24), which could be derived assuming that the protein solution is a two-component system.
Here, αpr is the absorption coefficient of the protein, αbw is the absorption coefficient of bulk water, is the volume fraction occupied by protein molecules and is the volume fraction occupied by bulk water. Eq. (24) was derived using several considerations, such as the value of the radius of gyration of HSA is about 3.3 nm , the structure of the monomer HSA protein molecule consists of hydrophobic and hydrophilic pores (2: 1 (v/v) ratio) with porosity factor of about 33% of the volume of the protein molecule .
Therefore, the plots of against HSA concentration, [HSA], as shown in Figure 6 (red lines), represent the two-component system. However, a large deviation of the experimental data points from the line representing two component system suggests inadequacy of the two-component model. This deviation could be explained by considering that the THz absorption coefficients of the water molecules in the hydration layer, (αhl), were different from that of bulk water (αbw) and hence constitute the third component in the aqueous solution of the protein. Hence, Eq. (25), in which Vhl was the total volume fraction of water associated with the protein molecules in a solution with protein concentration of 1 × 10−3 mol dm−3, was derived .
To understand the reasons for significant decrease of THz absorption coefficient of the protein solutions with increasing concentration beyond 6 x 10−4 mol dm−3, the possibility of aggregation of proteins at higher concentration regime was explored. To delineate this aspect, dynamic light scattering (DLS) measurements (Figure 6) were carried out using concentrations of proteins covering the entire range of THz absorption measurements. The DLS data recorded for the solution containing protein concentration of 0.4 x 10−3 mol dm−3 revealed the existence of only monomeric protein molecules with the most probable diameter of about 6–8 nm in the solution. This is quite in good agreement with the diameter of the hydrated monomer protein molecules. However, at higher concentrations (say, >1 x10−3 mol dm−3 of protein), the DLS data revealed two important features. Firstly, the size distribution of the monomer band became wider indicating the presence of particles of diameter in the range 12–15 nm, possibly suggesting formation of dimers or trimers of HSA, in addition to the monomeric species. Secondly, at higher concentrations of the protein, DLS data also revealed the presence of large size aggregates of the most probable diameter of about 700 nm. However, a quantitative estimation of the relative percentages of monomer, dimer and aggregates was not possible from the DLS data because the intensity distribution of the scattered radiation was not directly proportional to the number of the particles. However, this experiment confirmed the presence of protein aggregates in solutions with higher concentrations of protein and the formation of aggregates may possibly be held responsible for nonlinear dependence of THz.
On the other hand, CD measurements confirmed that the tertiary structure of HSA remained unchanged through the entire range of HSA concentrations used for THz measurements. This suggested that the native structure of HSA molecules remained unaltered through the entire range of concentrations of HSA. Therefore, hydration states of proteins, even in the aggregated state, remain unchanged and hence possibly justifies the assumption made in the earlier works regarding overlapping of hydration shells at higher concentrations of proteins.
Patro and Przybycien have simulated the structures of reversible protein aggregates as a function of protein surface characteristics, protein–protein interaction energies and assessed the aggregate properties . Results of their simulation reveals that aggregate particles have the kind of organization of the hydrophobic and hydrophilic domains as they are present in HSA protein monomer molecules and aggregation of protein molecules causes the loss in the total solvent accessible surface area (SAS) is about 67% and the mean solvent content for these aggregates vary in the range of 0.37–0.55 volume fraction depending on the conformation of the monomer protein . Therefore, at higher concentrations of protein, volume fraction of hydration water decreases due to formation of aggregates. In addition, proteins are THz transparent and as we increase the protein concentration, protein aggregates replace the water molecules and leads to lowering of total THz absorbance of the solution.
A method of analysis was adopted to analytically fit the data in the regime of higher concentrations of the protein to predict the relative concentrations of the monomer and aggregated particles. In this analysis, the value of αhl, which was estimated from the linear regime of the plot of vs. [HSA] was used. Assuming that the THz absorption coefficient of the water molecules residing inside the aggregate as well as that constituting the hydration layer around the surface of the aggregate are similar to that constituting the hydration layer around the monomer protein molecule, Eq. (9) was revised to write Eq. (11), which provided a quantitative estimate of the relative numbers of the HSA molecules in the monomeric and aggregated forms in solution.
Here, x is the concentration of monomer in the unit of 1 × 10−3 mol dm−3 and is the volume fraction of hydration water associated with the aggregates, both in the interstitial places as well as outside the aggregate constituting the hydration layer, vhl, is the volume of hydration layer associated with one protein molecule.
Manna et al. estimated the percentage of the number of HSA molecules, which exist as monomer (or dimer or trimer) in solution without being associated with aggregate formation, for each of the HSA concentrations used for THz absorption measurements . We find that for the protein solution with its concentration of 8 x 10−4 mol dm−3, about 40% of the protein molecules exists as monomer (or dimer or trimer) in solution and 60% protein molecules are part of aggregates. The monomer (or dimer or trimer) concentration becomes <1% at the protein concentration of 2.6 x 10−3 mol dm−3, i.e. nearly all the protein molecules are associated with aggregate formation. These calculations suggest that formation of aggregates at higher concentrations of protein may be the responsible factor for the turnover of the THZ absorbance of the protein solution at ∼6 x 10−4 mol dm−3 concentration.
The absorption coefficient of a complex system is the weighted sum of the absorption of its components. However, determination of the complex dielectric function of proteins in solution using THZ-TDS spectroscopy allowed detailed analysis beyond what was possible from simple absorption measurements. Following this approach, Novelli et al. determined both the phase and amplitude of the induced dipole in the volume containing the protein and hydration water . This result revealed that not only the amplitude of the induced dipole varied with the concentration of protein, but also its phase changed above a concentration threshold (Figure 7). They proposed a phenomenological model, which explained that the phase of the induced dipole in the protein-solvent interaction region began to vary when there was significant overlap between hydration layers of the neighboring protein. This result suggested that indirect electromagnetic protein–protein interactions could take place if mediated by the extended hydration layers surrounding each protein.
6. Conclusions and challenges ahead
THz spectroscopy, from its very inception, has mostly been used by scientists studying cosmology, condensed matter physics and materials. The huge absorption of water in this frequency range used to be treated as a drawback of this technique; however, for chemists and biologists this point serves as an advantage since all the biological function is somehow or other related to water dynamics. Due to its inherent sensitivity to water hydrogen bonding dynamics, THz spectroscopy has become an indispensable tool for direct observation of fast and coupled biomolecule-water network. The initial studies by the groups of E. W. Heilweil, P.U. Jepsen, A. Marklez, C. Schmuttenmaer and M. Havenith in the late 1990s and early 2000s have established THz spectroscopy at a concrete platform to be recognized as a potential tool to label free detection of water dynamics in the vicinity of biomolecules, the effect being extended to several layers and remains practically undetected using conventional spectroscopic methods. The last decade has witnessed a huge leap towards exploiting this frequency window in biophysical studies, a few of such results have been depicted in this article. Now that the phenomenon has been established beyond any doubt, new sort of experimental studies, where the role of hydration in ultrafast processes (like electron transfer or proton transfer) could explicitly be determined, is the new challenge to the researchers.