Open access peer-reviewed chapter

QSPR Prediction of Chromatographic Retention Times of Tea Compounds by Bioplastic Evolution

By Francisco Torrens and Gloria Castellano

Submitted: July 27th 2018Reviewed: September 28th 2018Published: November 5th 2018

DOI: 10.5772/intechopen.81735

Downloaded: 354


Structure-property relationships model the ultrahigh-performance liquid chromatographic retention times of tea compounds. Bioplastic evolution presents a viewpoint in evolutionary science. It conjugates the result of acquired characters and associations rising between three rules: evolutionary indeterminacy, morphological determination, and natural selection. It is used to propose the co-ordination index, which is utilized to describe the retentions of tea constituents. In molecules, three properties allow computing the co-ordination descriptor: the molar formation enthalpy, molecular weight, and surface area. The result of dissimilar kinds of characteristics is examined: thermodynamic, steric, geometric, lipophilic, etc. The features are molar formation enthalpy, molecular weight, hydrophobic solvent-accessible surface area, decimal logarithm of the 1-octanol/water partition coefficient, etc. in linear and quadratic associations. The formation enthalpy, molecular weight, hydrophobic surface, partition, etc. differentiate the molecular structures of tea components. Feeble quadratic associations result between partition, hydrophobic surface and retention. The morphological and co-ordination descriptors complete the associations.


  • biological plastic evolution
  • morphological index
  • co-ordination index
  • formation enthalpy
  • lipophilicity
  • solvent-accessible surface
  • solvation parameter model
  • metabolomics
  • metabolic profiling
  • catechin derivative
  • polyphenol
  • green tea
  • black tea

1. Introduction

Fast separation of complex samples, via high-resolution (HR) chromatography and mass spectrometry (MS), requires meeting the simultaneous need of high sample throughput and high-quality (HQ) data in metabolomics. Hyphenation of ultrahigh-performance liquid chromatography (LC) (UHPLC) and maXis ultra-HR time-of-flight (UHR-TOF)-MS delivers speed without compromising performance factors, e.g., sensitivity, mass accuracy, and resolution. Black tea (BT) and green tea (GT), Camellia sinensisL. (Theaceae), account for 95% of the world tea consumption [1]. The health benefits of BT and GT are hypothesized. Understanding the potential health-promoting effects and improvement in quality/taste is interesting. In BT production, GT leaf catechin (GTC) (glycosylated) flavan-3-ol flavonoids are enzymatically oxidized (fermented) to yield a complex mixture of products, e.g., theaflavins (TFs) and thearubigins (TRs). Despite the importance of tea beverages, most chemical constituents were not confirmed because of mixture complexity. Antioxidant activity (AOA) of standard (gallated) GTCs decays as follows: (−)-epigallocatechin (EGC) 3-O-gallate (EGCg) > (+)-gallocatechin (GC) 3-O-gallate (GCg) > (−)-epicatechin (EC) 3-O-gallate (ECg) > EGC > GC > EC > (+)-catechin (C) [2]. The contents of cis-GTCs are the key factors affecting GT AOA. GT, oolong(blue) tea (OT), and BT are unoxidized, semi-oxidized, and oxidized, respectively, during production. Darjeeling tea is sold as BTbut it belongs to OTs. The oxidation grade of tealeaves rises GT < OT < BT. GTCs are excellent electron donors (EDs) and effective traps (scavengers) of physiologically relevant in vitroreactive oxygen species (ROSs).

Data generated from BT, GT, and Darjeeling tea extracts were analyzed via UHR-TOF-MS, with electrospray ionization (ESI) in negative ion (NI) mode [3]. Mass data and isotopic pattern information in MS/MS-MS spectra enable the sum formula generation. Combining the formulae with database (DB) queries facilitates the identification of unknown compounds. Some tea polyphenolic compounds and metabolites penetrate the blood-brain barrier (BBB) into brain regions, which mediates cognition. In rats, trihydroxybenzoic acid glycoside theogallin or its metabolite cyclitol, cyclic polyol, cyclohexanecarboxylic quinic acid moved via BBB and presented cognition-enhancing activities [4]. The effects of flavonoids on the central nervous system (CNS) were reviewed [5]. Flavan derivative, flavan-3-ol EC, is able to cross BBB more efficiently than stilbenoid resveratrol, which is more hydrophilic. Polyphenols entering the brain were revised [6]. The potential role of GTCs in the prevention of the metabolic syndrome was re-examined [7]. The clinical evidence of GT effects was discussed [8]. The GTCs and caffeine (Caff) and their synergism in body weight regulation were reviewed [9]. The antiobesity effects of GTCs were revised [10]. The chemistry of low-molecular-weight BT polyphenols [11], and secondary ones produced during tea processing [12], was re-examined. The content of Caff decayed during GT oxidation [13, 14, 15]. The changes of GT secondary metabolites [14] and phenolics/quality potential of crush, tear, and curl BT [15] were reported during oxidation. The EGCg attenuated lipopolysaccharide (LPS)-induced nitric oxide (nitrogen monoxide, NO) production in cells [16]. The antiviral role of GTCs was reviewed [17]. The EGCg was identified as an inhibitor of phosphoglycerate mutase 1 (PGAM1) [18]. Quantitative analysis of GTCs from GT extract in human plasma was performed via UHPLC-MS [19].

The model is an expansion of solvent-dependent conformational analysis program (SCAP) from 1-octanol/water to other organic solvents [20]. In earlier publications, SCAP was used to compute the partition coefficients of porphyrins, phthalocyanines, benzobisthiazoles, fullerenes, acetanilides, local anesthetics (procaine analogues) [21], enzyme lysozyme [22], barbiturates, hydrocarbons (HCs) [23], polystyrene (PS) [24], Fe/S proteins [25], C-nanotubes (CNTs) [26], D-glucopyranoses, polyiodides, polyiodines, and crown ethers [27]. Bioplastic evolution(BPE) and quantitative structure-property relationships (QSPRs) were used for phenylalcohols, 4-alkylanilines [28], aromatics [29], phenylureas [30], pesticides [31], flavonoids [32], isoflavonoids [33], natural sesquiterpene lactones (STLs) [34], coffee chlorogenic acids (CGAs) [35], purine derivative alkaloid methylxanthines (Caff and its metabolites), alkaloid and predominant nicotine metabolite cotinine [36, 37], and tea leaf infusions [38]. Mucoadhesive polymer hyaluronan (HA) favors transdermal penetration absorption of model drug Caff [39, 40]. The present report explains QSPR examination and calculation of the retentions of tea compounds. The aim of this work is to discover features that differentiate tea components consistent with retentions. This study uses molecular descriptors (MDs) for tea components. The goal is the corroboration of the values of MDs via their ability to distinguish tea phytochemicals, and their advantage as prognostic MDs for retention, contrasted with formation enthalpy, molecular weight, hydrophobic accessible surface (HBAS) area and partition. Section 2 describes the method. Sections 3 and 4 illustrate and discuss the results. Finally, the last section summarizes our conclusions.


2. Computational method

Biology presents an important idea ever elucidated in 400 years of experimental science: biological evolution (the other is the existence and organization of the periodic table of the elements). In allometry(biological scaling), biological plastic(bioplastic) evolutionpresents a viewpoint in evolutionary science. It conjugates the result of (1) the acquired characters and (2) associations rising between three rules: evolutionary indeterminacy, morphological determination, and natural selection. The association between morphology and functionality in the living forms stretches out in that the former is the substance foundation of the latter, which is the dynamic result of the former in the background of the relationship between the substantial setting and living substance. Morphology, functionality, energy cost, and vital viability are jointly affected: When a morphology is useful, it achieves its effort with least power charge, and the fundamental feasibility of the organ/organism is the utmost. Counting ideas engage describing functional co-ordination index Ic: the relationship between the work achieved by morphology Tand the corresponding morphological index Im:


The greater the work Tattained by a specific morphology Im, the greater the Ic. For an organism, Ruiz-Bustos suggested Im as the relationship between morphological surface area Sand body weight W[41]:


The replacement of Eq. (2) in Eq. (1) turns out to be


The equation of Tby its correspondence in classical mechanics provides


Replacing Eq. (4) in Eq. (3) gives


The Ic rises as follows. (1) The greater the body weight at the same journeyed time/space, the greater the Ic. (2) The Ic is proportional to the gap journeyed in the shortest achievable time. (3) The smaller the body surface, the greater the Ic and function-morphology co-ordination needs lesser power charge.

Code SCAP is founded on an algorithm by Hopfinger, parametrized for 1-octanol and water solvents. One can center a solvation sphereon every group of the molecule [42, 43]. The intersecting volume Vo between the solvation and the van der Waals (VDW) spheres of the other atoms is computed. The SCAP handles four parameters for a solvent: (1) n: utmost number of solvent molecules filling the solvation sphere; (2) Δgo: change of the Gibbs free energy connected with the removal of one solvent molecule out of the solvation sphere [44, 45]; (3) Rv: radius of the solvation sphere; (4) Vf: free volumeavailable for a solvent molecule in the solvation sphere. In this, part of the volume keeps out the solvent molecules. The volume contains the VDW volume of the group at which the sphere is centered and a volume on behalf of the groups bonded to the central one. The latter is modeled by a set of cylinders. The dissimilarity between the total volume of the solvation sphere and that excluded to the solvent molecules stands for volume V′, which is accessible for nsolvent molecules. The Vf is computed as Vf = V′/nVs. Variation of free energy, connected with the removal of all solvent molecules out of the solvation sphere of a group R, results in ΔGRo = nΔgo (1−Vo/V′) and the solvation free energy of a molecule ΔGsolvo = −ΣR=1NΔGRo. The partition coefficient Pbetween 1-octanol and water results in


at a given temperature Ttaken as 298 K, where Ris the gas constant and ΔGsolvo(1-octanol) and ΔGsolvo(water) the standard-state Gibbs free energies of solvation in kJ·mol−1. Extending SCAP for dissimilar solvents, the parameters were adapted, considering the result of relative permittivity and molecular volume on 1-octanol properties. For a general solvent, the utmost number of solvent molecules, which permitted packing the solvation sphere, is connected with the molecular volume of the solvent as follows:


where Vo, Vw, and Vs are the molecular volumes of 1-octanol, water, and general solvent, respectively. The no, nw, and ns are the utmost numbers of molecules of 1-octanol, water, and general solvent, respectively, which allowed packing the solvation sphere. The change in the standard Gibbs free energy is connected with the removal of one solvent molecule out of the solvation sphere, ΔGSo, which is computed via the generalized Born equation


where Δgoo denotes Δgo for 1-octanol, and εo and εs are the relative permittivities of 1-octanol and general solvent. The radius of the solvation sphere is connected with the molecular volume of the solvent molecule as follows:


where Rv,o denotes Rv for 1-octanol. The free volume accessible for a solvent molecule in the solvation sphere is as follows:


where Vf,o denotes Vf for 1-octanol.

3. Calculation results

For the 12 tea components {polyol acids [quinic (cf. Figure 1a) and coumaroylquinic acids (Figure 1h)], non-flavonoid polyphenols [gallic acid (Figure 1b) and corilagin], glycosides [theogallin (Figure 1c), digalloyl glucose, and trigalloyl glucose] and GTCs [GC (Figure 1d), EGC (Figure 1e), EGCg (Figure 1f), EC (Figure 1g), and ECg]}, UHPLC retention times, Rt, were obtained by Barsch et al. Epi-diastereoisomers show the gallate, etc. residues in cis-position. The chromatographic analysis is in accord with the technical literature [46].

Figure 1.

(a) Quinic acid, (b) gallic acid, (c) theogallin, (d) GC, (e) EGC, (f) EGCg, (g) EC, and (h) coumaroylquinic acid.

Quinic acid was taken as the reference molecule for the retention time Rto, owing to its least Rt (cf. Table 1). Relative changes (RtRto)/Rto were computed for all the components. The molar formation enthalpy was calculated with code MOPAC-AM1 [47]. The diastereoisomers GC and EGC show similar formation enthalpy and HBAS. Decaffeination does not alter the metabolite composition extensively. Caffeine does not differentiate the samples since the data were acquired in ESI NI mode where Caff does not ionize.

MoleculeRt (min)RtRt° (min)(RtRt°)/Rt°ΔHfo (kJ·mol−1)aHBAS (Å2)b
Quinic acid0.80.00.000−1239.589.68
Gallic acid2.41.62.000−836.085.88
Gallocatechin (GC)−1078.1173.11
Epigallocatechin (EGC)−1063.5174.31
Digalloyl glucose5.34.55.625−2362.3223.46
Epigallocatechin gallate (EGCg)−1590.7209.50
Epicatechin (EC)−880.4202.04
Coumaroylquinic acid6.75.97.375−1372.0265.79
Trigalloyl glucose6.96.17.625−2908.9291.39
Epicatechin gallate (ECg)−1434.9260.13

Table 1.

Retention, formation enthalpy, and hydrophobic-accessible surface area for tea components.

Molar formation enthalpy calculated with MOPAC–AM1.

HBAS: hydrophobic solvent-accessible surface area (Å2).

In molecular structures, the use of co-ordination MDs needs adapting variables T, S, and W(Eq. (3)): Tis redescribed as minus standard formation enthalpy (kJ·mol−1); S, molecular surface area (Å2); and W, molecular weight (g·mol−1). The MDs of the tea components (cf. Table 2) illustrate that Im is constant, while Ic rises with W. The molecular surface and HBAS areas were computed with our code TOPO [48]. The diastereoisomers, GC and EGC, show similar physico/physiochemical features and BPE MDs.

MoleculeW[g·mol−1]aT[kJ·mol−1]bS2]cIm [mol·Å2·g−1]dIc [kJ·g·mol−2·Å−2]e
Quinic acid1921239.5196.081.0211213.7
Gallic acid170836.0168.230.990844.8
Gallocatechin (GC)3061078.1285.270.9321156.4
Epigallocatechin (EGC)3061063.5286.510.9361135.8
Digalloyl glucose4842362.3421.880.8722710.1
Epigallocatechin gallate (EGCg)4581590.7410.890.8971773.1
Epicatechin (EC)290880.4274.360.946930.6
Coumaroylquinic acid3381372.0332.680.9841393.9
Trigalloyl glucose6362908.9550.910.8663358.2
Epicatechin gallate (ECg)4421434.9401.040.9071581.5

Table 2.

BPE indices for the compounds of tea extracts.

W: molecular weight (g·mol−1).

T: minus standard formation enthalpy (kJ·mol−1).

S: molecular surface area (Å2).

Im: morphological index (mol·Å2·g−1).

Ic: co-ordination index (kJ·g·mol−2·Å−2).

In the plot of MDs vs. molecular weight W(cf. Figure 2), some points collapse, especially diastereoisomers GC and EGC with similar BPE MDs. The only index that is constant is Im. The MDs are more responsive to Wdecay: Ic > T > S > Im.

Figure 2.

Variation of chemical indices for tea compounds vs. molecular weight:y = −300 + 5.42x;y = −10.3 + 4.21x;y = 51.2 + 0.773x;y = 1.06–0.000352x.

Changes in (RtRto)/Rto vs. molar formation enthalpy ΔHfo and molecular weight Mw present correlation. The model is


where ris the correlation coefficient, s, the standard deviation, and F, the Fisher ratio. The mean absolute percentage error (MAPE) is 21.66% and the approximation error variance (AEV) is 0.3064. The addition of the co-ordination MD Ic betters the fit


and AEV decays by 17%.

Adding the quadratic hydrophobic solvent-accessible surface area betters the fit


and AEV decays by 70%. The integration of the molar formation enthalpy improves the fit, according to lesser standard deviation, greater Fisher statistic, and lesser AEV:


and AEV decays by 70.2%. The formation enthalpy and hydrophobic-accessible surface better the fit


and AEV decays by 72%. The quadratic logarithm of the 1-octanol/water partition coefficient improves the fit


and AEV decays by 74%. However, this development should be taken with care because though the correlation coefficient, MAPE, and AEV enhance (greater r, and lesser MAPE and AEV), the standard deviation and Fisher statistic deteriorate (greater sand lesser F) because of one less degree of freedom in the model: notice three vs. two variables in Eqs. (16) and (15), respectively. Linear equations (11), (12), and (15) are more satisfactory for extrapolation than quadratic equations (13), (14), and (16), which go better with intrapolation. Extra fitting parameters were tested: molecular dipole moment, organic solvent/water partition coefficients, free energies of solvation and transfer from water to organic solvents, molecular volume, surface area, globularity, rugosity, hydrophilic (HLAS) and total solvent-accessible surface (AS) areas, molecular fractal dimension, and fractal dimension averaged for external atoms. Notwithstanding, the results do not better Eqs. (11)(16).

4. Discussion

Molecular studies allowed predicting parameters related to phytochemicals, drugs, and metabolite bioactivities. Direct correlation of MDs with activity was obtained. The chromatographic behavior of drugs in phases of different polarity contains information about their pharmacological performance, e.g., barbiturates and neuroleptics. Chromatographic parameters in a polar stationary phase system correlate better with some MDs, whereas Kováts parameters, obtained from the apolar phase interaction, correlate the best with some others. The MDs predict chromatographic parameters, e.g., retention times in gas chromatography (GC)/LC and retention factor Rf in thin-layer chromatography (TLC). Topological MDs (TDs) were used in chromatographic chiral separations. The chromatographic properties of natural phenol/sugar derivatives were predicted by molecular topology (MT). The properties of chiral quinic acid, theogallin, (+)-GC, (−)-EGC, digalloyl glucose, (−)-EGCg, (−)-EC, trigalloyl glucose, and (−)-ECg were forecasted by MT.

This study related LC-MS retentions for tea compounds to MDs. Molecular functions were obtained through multivariate linear (MVLR) and quadratic (MVQR) regressions, which were selected based on their statistical parameters. Regression analysis of the molecular functions showed a forecast of the experimental elution sequence for the tea components. In order to predict the sequence in tea substances, two- or three-variable models were used in which the appearance of the co-ordination index, molar formation enthalpy, molecular weight, HBAS, or 1-octanol/water partition coefficient reveals the importance of thermodynamic, steric, geometric, and lipophilic analysis in retention, allowing the use of such equations in predicting its value. Molecular structures may be differentiated even in other derivatives of tea components not included in the series. Weak MVQR relationships appeared between physico/physiochemical properties (logPand HBAS) and retention.

The reason why plants accumulate polyphenols is related to their defense system, and their functions depend on chemical reactivity and physico/physiochemical properties. The structural diversity of plant polyphenols in nature indicates that they present different and wide-ranging functions. Some polyphenols, e.g., GTCs and proanthocyanidins, are susceptible to enzymatic and nonenzymatic oxidation depending on the plant. Polyphenol oxidation in plant tissues, e.g., BT production, proceeds with a reduction in oxygen molecules or polyphenol quinines, in which reactivity with proteins and other co-existing compounds plays a role during post-harvesting. The secondary polyphenols, produced in plants after physical tissue damage, relate to the plant defense system though many products were not characterized chemically. Artificial processing, e.g., drying, oxidation, and roasting, is different from the natural reactions, e.g., insolubilization and polymerization, occurring in living plants and produces different compounds. Scientific studies indicated that polyphenols in foods present health benefits. Identifying the mechanisms of their production and chemical structures is important. The GT presents the greatest variability in physico/physiochemical properties. Many beneficial effects of GT are related to GTCs, particularly ECg and, especially, EGCg content. The BBB permeability, easy access via the diet, and low toxicity show them as promising molecules, for prevention and treatment of chronic neurodegenerative diseases.

5. Conclusion

From the discussion of the present results, the conclusions follow.

  1. The object of this work was to build up structure-property relationships for the qualitative and quantitative calculation of the ultrahigh-performance liquid chromatographic retention times of tea components. The outcomes add an augmented scientific knowledge in the field of association calculation of components in dissimilar tea samples.

  2. Structure-property relationships result as expected for predicting retention times, for the elucidation of unknown components in metabolomics studies. Code SCAP permits the hydration and solvation free energies, and partition coefficients, which show that for a given atom, energies and partition coefficients are responsive to the occurrence in the molecule of other atoms and functional groups.

  3. The parameters needed to compute the co-ordination descriptor are the molar formation enthalpy, molecular weight, and surface area. Linear and quadratic correlation models were obtained for the chromatographic retention time.

  4. A benefit of our structure-activity relationships is that they discover feeble quadratic relationships, occurring between the partition coefficient, hydrophobic solvent-accessible surface area, and retention. The tendency between the co-ordination index and the molecular weight indicates not only a homogeneous molecular structure of tea components but also the capacity to calculate and adapt their features, which is nontrivial in metabolomics studies.

  5. The result of dissimilar kinds of characteristics was examined: thermodynamic, steric, geometric, lipophilic, etc. The molar formation enthalpy, molecular weight, hydrophobic solvent-accessible surface area, partition coefficient, etc. differentiated tea components in linear and quadratic equation models.

  6. The morphological and co-ordination descriptors completed multivariable regression expressions for the chromatographic retention.



The authors thank support from Generalitat Valenciana (Project No. PROMETEO/2016/094) and Universidad Católica de Valencia San Vicente Mártir(Project No. UCV.PRO.17-18.AIV.03).

Conflict of interest

The authors declare no conflict of interest.

© 2018 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Francisco Torrens and Gloria Castellano (November 5th 2018). QSPR Prediction of Chromatographic Retention Times of Tea Compounds by Bioplastic Evolution, Tea - Chemistry and Pharmacology, Gonçalo Justino, IntechOpen, DOI: 10.5772/intechopen.81735. Available from:

chapter statistics

354total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Tea Polyphenols Chemistry for Pharmaceutical Applications

By Ponnusamy Ponmurugan, Shivaji Kavitha, Mani Suganya and Balasubramanian Mythili Gnanamangai

Related Book

First chapter

Flavonoids: Classification, Biosynthesis and Chemical Ecology

By Erica L. Santos, Beatriz Helena L.N. Sales Maia, Aurea P. Ferriani and Sirlei Dias Teixeira

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us