Strategies and optimization parameters and type of fermentation in production of various proteins and small molecules.
E. coli is considered one of the best model organism for biopharmaceutical production by fermentation. Its utility in process development is employed to develop various vaccines, metabolites, biofuels, antibiotics and synthetic molecules in large amounts based on the amount of yield in shake flasks, bioreactors utilised by batch, fed-batch and continuous mode. Production of the desired molecule is facilitated in the bioreactor by employing strategies to increase biomass and optimised yield. The fermentation is a controlled process utilising media buffers, micronutrients and macronutrients, which is not available in a shake flask. To maximise the production temperature, dissolved oxygen (aerobic), dissolved nitrogen (anaerobic), inducer concentration, feed or supplementation of nutrients is the key to achieving exponential growth rate and biomass. Design of experiments (DOE) is critical for attaining maximum gain, in cost-effective manner. DOE comprises of several strategies likewise Plakett-Burman., Box–Behnken, Artificial Neural Network, combination of these strategies leads to reduction of cost of production by 2–8 times depending on molecules to be produced. Further minimising downstream process for quickly isolation, purification and enrichment of the final product.
- E. coli
- Design of experiments (DOE)
- Response surface methodology
- Bacterial Fermentation
- Process optimisation
- disulfide bridges
- codon optimisation
- post-translational modifications
- inclusion bodies
|Name of Product/Molecule||Type of Fermentation||Design optimization||Media||Carbon Source||Reference|
|1,3-propanediol||Batch||Glycerol and glucose Co-fermentation||Reinforced Clostridium Medium (RCM)||Glucose|||
|D-amino acid Oxidase||Batch||Combination fermentation||Complex medium||Glycerin|||
|O-acetylhomoserine||Batch||Glycerol-Oxidative pathway||MS medium||Glycerol|||
|Heparosan||High cell density||High cell density fermentation||Chemically defined fermentation media||Glucose|||
|O-succinyl-l-homoserine||Batch||Multilevel fermentation optimization||Defined media||Glucose|||
|(2S)-Naringenin||Batch||Fermentation and metabolic pathway optimization||Optimised MOPS minimal medium||Glucose|||
|L-tryptophan||Batch||Batch Fermentation||Complex medium with additives||Glucose|||
|Beta-farnesene||Batch||Batch Fermentation||2 × YT medium||Glycerol|||
|L-tryptophan||Batch||Enzymatic hydrolyzate||Complex medium with yeast extract||Glucose|||
|VP2 VLP vaccine||High cell density||Fed batch||Luria Bertani Media||Glycerol|||
|Vitamin B12||Batch Fermentation||Metabolic engineering||LB media||Glucose|||
|L-Valine||Batch fermentation||Two stage fermentation||Complex Medium||Glucose|||
|PhoA||Batch Fermentation||Design of experiments for ||Complex medium Phosphate optimization||Glycerol|||
|L-methionine||Batch fermentation||Design of experiments with Genetic engineered Escherichia coli||Complex medium with yeast extract||Glucose|||
|Reteplase||Batch fermentation||Optimization using response surface methodology||LB medium (Response surface methodology for pH, Temp, RPM)||—|||
|Tauroursodeoxycholic acid||Batch fermentation||Optimization of engineered Escherichia coli cell factory||Complex medium||Glucose and Glycerol|||
|Tyrosine phenol lyase||Batch fermentation||Statistical medium optimization||Complex medium||Sucrose|||
|Phytochromes Production||High-cell density ||High cell density fermentation||Minimal Medium||Glucose|||
|Keratinolytic protease||Batch fermentation||Protein engineering to enhance keratinolytic protease activity||Complex Medium||Glucose|||
|L-Threonine||Batch fermentation||Biofilm-Based fermentation strategies||Complex Medium||Glucose|||
|Therapeutic DNA vaccine||Batch fermentation||Response surface methodology||2 x YT medium||Mannitol|||
|Pyruvate Oxidase||High cell density||Temperature gradient based methodology||Yeast, Tryptone complex medium||Glycerol|||
|rPDT fusion protein||Piolet scale batch fermentation||Statistical experimental design||LB medium||—|||
|Platelet aggregation inhibitor||High cell density fermentation||Optimization of production||Yeast, Typtone||Glycerol|||
|Vitreoscilla haemoglobin||Batch fermentation||Immobilised ||Cheese Whey, Yeast extract||—|||
|D-Lactose||Fed-batch fermentation||Optimization of D-lactose production||Modified mineral salt medium||Glucose|||
|Curcumin||Batch Fermentation||Optimization of fermentation||TB medium (Tryptone, Yeast extract)||Glycerol|||
|Alkaline beta-mannanase||Fed-batch fermentation||Induction conditions optimization||TB medium (Tryptone, Yeast extract) with additives||Glycerol|||
|HIV-1 P17||Fed batch fermentation||Multiparameter optimization for secreted protein||TB medium (Tryptone, Yeast extract)||Glucose|||
|Heptatitis Delta Virus (HDV) antigen||Fed batch fermentation||Optimization of Fed batch||LB medium||Glycerol|||
|Chitosanase||Fed batch fermentation||Optimization of secretion efficiency PelB-CSN-N||Terrific Broth with trace elements||Glycerol|||
|Nitrilase||High cell density||Optimization of temperature, substrate and IPTG||Super Optimal broth (SOB) medium||Glycerol|||
|Human Interferon Beta||Batch fermentation||Reduction of acetate by Response surface methodology||Terrific Broth (TB)||Glucose|||
|Beta-chitin||Batch fermentation||Selection of strain and IPTG optimization||Luria Bertani (LB)||—|||
|Beta-glucosidase||Batch fermentation||Ag43-mediated thermostable for high temperature fermentation||Mineral Media||Glucose|||
|46 KDa Antibody Fab’ fragment||Batch fermentation||ultra-scale-down (USD) approach based on focused acoustics||Modified defined medium||Glycerol|||
|Therapeutic protein production (10 KDa)||Fed-Batch fermentation||Optimization of inclusion bodies by autoinduction without adding IPTG due to galactose in the complex media||Yeast and Soy Peptone||Glycerol, Glucose|||
|Catechol-O-methyltransferase||Batch fermentation||Optimization of production by artificial neural network||Tryptone, Yeast extract (SOB Medium)||Glycerol|||
|Platelet aggregation inhibitor, salmosin||Fed-batch fermentation||Optimization of carbon and nitrogen sources||Yeast and Tryptone||Glycerol|||
|VB4–845, an immunotoxin||High cell density fermentation||Optimization of VB4–845 titre in the supernatant||Glycerol Minimal medium||Glycerol, Arabinose|||
|GDP-mannose||Coupling fermentation||Optimization by Coupling fermentation||Defined media||Fructose, Mannose|||
|Heparosan||Fed-Batch fermentation||Defined medium using exponential fed-batch glucose addition with oxygen enrichment||Defined media||Glucose|||
|Pig liver esterase||Batch fermentation||Optimization of co-expression in combination with chaperones GroEL/ES allowed the production of soluble and active enzyme||Complex medium||Glucose|||
|Penta-N-acetyl-chitopentaose||Two step fermentation||Optimization of oligosaccharides production||Minimal Medium||Glycerol|||
|Resilin||Batch fermentation||Primary induction step before lactose induction||Lactose, Casein, Yeast||Glycerol|||
|Malaria antigen PvRII||High cell density||High cell density for production of recombinant proteins||Defined media||Glucose|||
|Bacterial phytase||Fed-batch fermentation||Optimization of two different feeding strategies||Terrific Broth||Glucose|||
|Human interferon-gamma (hIFN-gamma)||High cell density fermentation||Exponential growth by using glucose as carbon source||M9 Modified medium||Glucose|||
|B subunit of Escherichia coli heat-labile enterotoxin (LTB)||High cell density||Step wise addition of IPTG||Complex media||Glucose|||
|Plasmid pAQN carrying the aqualysin I (AQI) gene||Batch fermentation||Optimization of yeast extract supplementation||LB and M9 minimal media||Glucose|||
|Prochymosin||Batch fermentation||Use of lactose as energy source and inducer||Complex medium||Glycerol, Glucose|||
|HIV-1 protease||Batch fermentation||Separation of growth and production phases in a two-step process||LB and M9CA medium||Glucose|||
|Porcine somatotropin protein||High cell density||Oxygen enrichment, yeast extract (YE) effect, optimal specific growth in switching on gene expression, and feeding strategies.||Defined media with additives||Glucose|||
|Human interferon-alpha 1||High cell density||Optimization of nutrient source||Defined media||Glucose|||
|Human interleukin-1 beta||High cell density||Maintenance of low glucose and acetate for production||Defined media||Glucose|||
|EcoRI||Batch fermentation||Control of pH and Oxygen||LB media||—|||
Production of Biopharmaceuticals and Biomolecules during fermentation require media formulated using cheap ingredients for supplementation of carbon and nitrogen. The carbon sources are typically glucose, glycerol, molasses (Table 1). Nitrogen sources are typically yeast extract and digested proteins by enzymes. The utilisation of carbon by
DoE is a multifactorial methodology utilising statistical approaches to design and analyse an experimental process by which better outcome and results are obtained. Bacterial growth is the system response impact of “DoE planning” by varying critical parameters and simultaneous factors, such as phosphate, sulphate, vitamin and amino acid concentrations. The full factorial design, DoE is attributed by important principals to allow for robust, valid experimentation. These are known as treatments, replicated, randomised to protect against hidden factors . All treatments must be performed in blocks to control sources of variation. In an experimental process design, all four principles, the statistical approach, conform to regulations and complete exploration of an extensive experimental design process, helps in reducing the number of experiments to get a good outcome . For cultivation using
|Component||Batch||Basal and Initial medium||Components added in feed solutions||Cost Per 10 Litre of Fermentation media (USD)|
|Batch||Fed batch (Basal)||Fed Batch (Feed Media)|
The DoE with a correct statistical model to understand the effect is needed to be adopted by industrial producers to achieve a robust and high yielding process. The DOE is statistically planned trials for optimising factors associated with increased product output, yield and cost reduction. Typically, DOE experiments are done to screen up to 2–15 elements. Traditionally, using OFAT, the number of experiments needed to determine the effect on yield, the output is numerous. Using the DOE approach, several components, additives and sources of nutrients are compared simultaneously, and their impact is evaluated. Based on the response critical factors are ranked. Soon after the response effect by variables are generated, the analysis is successively recorded. Various DoE experiments are studied to understand influencing factors determined by a statistically relevant experimental plan. The difference in factors is measured by Analysis of variance (ANOVA). The results obtained after analysing responses are plotted using contour plots and response surface methodology (RSM). There are several software packages for optimisation of critical factors and parameters Design Expert (Stat-Ease Inc.), GT-SUITE (DOE), DOE ++ (RecCom), MODDE (Umetrics), DoE Fusion PRO (S-Matrix Corp.), STAVEX (Aicos), Minitab (Minitab Inc.), and JMP (SAS) .
2. Critical fermentation ingredients
Critical fermentation ingredients are media components which cannot be replaced. These can only be standardised for maximised yield. Typically, buffers and nitrogen sources such as yeast extract, tryptone are not changed. Since
2.1 Batch fermentation
In microbial batch, fermentation cultivation is done in a fixed volume of medium in a fermenter. The standard inoculum in the fermenter is 50–200 mL of shake flask volume in 2–5 L of fermentation media. The batch fermentation typically OD600 is 20–40 in 8–12 h time. The microbial growth depletes the nutrients resulting in the accumulation of by-products; there is a continuous change in the culture environment. After completion of the batch, media and cells are harvested. The advantages are batch fermentation, ease of operation, low risk of contamination, high yield of protein to biomass in less time of fermentation, and majorly for soluble or excreted proteins. Typical disadvantages are relatively long downtime between batches due to vessel setup and sterilisation, low cell/biomass densities, due to cleaning. DOE is needed to optimise the required nutrients and minimise product accumulation during fermentation. Typical batch fermentation media constitutes Yeast extract, Bactotyptone (or Soybean meal) 10 g −l to 24 g −l, respectively. Buffers of Sodium and potassium phosphate in combination to reach pH 6.8. to 7.0, 100 X amino acid solution, Trace elements (400x) is CoCl2. 6H2O (1 mg/ml−1), MnCl2.4H2O (6 mg ml−1), CuCl2.2H2O (0.6 mg ml−1), H3BO3 (1.2 mg ml−1), Na2MoO4.2H2O(1 mg ml−1), Zn(II)acetate.2H2O (5.2 mg ml−1) and Fe(III)citrate (40 mg ml−1) . The typical medium components are listed in the (Table 2) for batch fermentation as a base design to start optimisation.
2.2 Fed-batch fermentation
Fed-batch fermentation is a standard mode of fermentation in the bioprocess industry. Typically, fed-batch fermentation starts at the end of batch fermentation.
The first equation, ms (i), is the value of feed rate at the initiation of the fed-batch phase at time ti. μset is the specific growth rate, m is specific maintenance coefficient, YX/S is yield coefficient, V is the bioreactor volume, and So is the initial glucose concentration. In the second equation, ms(t) is the rate of addition of substrate (g h−1). After induction of protein expression, the specific growth rate of
Start of feed is determined by measuring the concentration of substrate in the fermentation broth typically after 10–12 hrs of batch. The feeding strategy should be designed so that the growth rate is maintained to limit the production of toxic formate, acetate and other metabolic compounds, enhancing bacterial growth. The growth of bacteria and conversion of feed to biomass is maximum when the exponential growth phase is maintained. The utility of fed-batch and importance is obtaining high cell density and biomass, leading to increased production of the high amount of product yield. The fed-batch is applicable to increase product yield by limiting growth rate and controlled substrate utilisation (Table 2). The media for batch and fed-batch fermentation is listed (Table 2).
3. Culture nutrient optimization strategies
3.1 One factor at a time- classical media optimisation methods
Selection of one-factor-at-a-time (OFAT) is a traditional method for optimisation of media. In this strategy only one factor is varied keeping all other parameters constant. The usual choice is ease and convenience; it makes OFAT the most preferred choice for formulating, designing, optimising, and scaling up the fermentation medium . This method is still popular among many research groups for developing the medium for fermentation. Physical parameters, supplementation, removal, replacement and feedback experiments are primary considerations during OFAT. They comprise of growth temperature, operating pressure, size of nutrients or extracts. The bioprocess in fermentation controls constant supply of nutrients, removing metabolic products and toxic compounds, and constantly disseminating the nutrient solids, buffers and salts in liquid and gaseous phases. There is a constant evolution of design improvement for agitation and aeration; these allow better control over flow dynamics, the minimal effect of viscosity, and even circulation of components and nutrients. The healthy growth of culture in the batch is maintained by supplementing nutrients such as M9 medium with FeSO4∙7H2O; M63 medium with KOH; A medium MgSO4∙7H2O, 20% Glucose or sugar, vitamins, casamino acids or L-amino acids . Removal experiments are required for the identification of critical components needed for the media. Certain media affect the reduction of formate, acetate, and reduction of pH. There are few examples associated with removing glucose from complex media to prevent inhibition of bacterial growth. Replacement experiments identifies correct nutrients complexes for nitrogen source yeast, soy peptone, bactotryptone, meat extract and protein powder. The carbon sources utilised are glucose, glycerol, sucrose, lactose and others. The use of OFAT for designing of media for fermentation limits number of experiments, the approach is suitable for production of metabolite. In one study, precursor carbohydrate phosphotransferase system (PTS) encoding genes a vital DXP pathway were deleted. This resulted in the enrichment of Isoprenoid phosphoenolpyruvate. Growth medium and production of lycopene (a C40 isoprenoid) resulted in maximisation by these culture conditions .
Defined media recipes used in the fermentation of
3.2 Statistical designs for
E. coliMedia optimization
Typically, statistical medium optimisation is beneficial in improving overall product output, reduces time needed for process development and cost. The microbial processes have complex reactions. Evaluation of results statistically increases the reliability of results, further reduction in the number of experiments. In one of the study, the GDP mannose pyrophosphate yield was improved upto 100% after conducting 33 experiments . Improvement of media by DOE is for understanding various test variants, multiple investigations, and uniform pattern. The results obtained after the various experiments are used to predict media improvement using mathematical models. The current advancement in statistical techniques provides rapid analysis of experimental findings. Meticulously planned experiments can enhance the desired outcomes using DOE strategies. For designing a full factorial, possible combinations of relevant factors, e.g. temperature, pH, buffers, carbon, nitrogen sources, strain, are considered. Similarly, partial factorial design is considered if knowledge about few components is not available. These experiments if planned and output is studied properly results in quick and definitive reproducible processes.
3.3 Identification of critical components: Plakett Burman design
Cultivation at a large scale requires a medium that will produce maximum yield of product per gram of substrate, maximum biomass and minimum undesirable by-products. Also, consisten with minimal problems during media preparation and sterilisation. While considering the biomass in isolation, it must be recognised that efficiently grown biomass produced by an optimised high productivity growth rate is not necessarily best suited for its ultimate purpose, such as synthesising the desired product. Different combination, and sequences of process condition need to be investigated to determine phases, specific sets of conditions during optimisation. OFAT for media optimisation using traditional replacement experiments with keeping one factor at a time for nutrient, antifoam, pH, temperature are highly time-consuming and expensive. Minimum number of experiments and development of process in short duration of time is prerequisite to media optimisation. Therefore, other alternative strategies must be considered, which allows more than one variable to be changed at a time, and these methods have been described in earlier studies by Placket and Burman 1946, and Hendrix 1980 (Table 3).
|Factor 1||Factor 2||Factor 3||Factor 4||Factor 5||Factor 6|
|Std||Run||A:Select Soytone||B:Yeast Extract||C:Glucose||D:Buffer||E:Trace Metals||F:MgSO4|
The Plakett Burman algorithm is a rapid statistical approach enables us to obtain the physicochemical parameters and factors influencing the fermentation process with the limited number of planned experiments . For the given number of observation, the linear effect of all aspects are screened with maximum accuracy. The design is practical when investigating a large number of factors to produce an optimal or near-optimal response. Statistically optimised media design along with kinetic models characterises the fermentation behaviour more rapidly to achieve maximum productivity. Also, when complex carbon-nitrogen substrates, such as yeast extract or peptone, are used together with carbohydrate substrates, the Dissolved Oxygen (DO) change is not as significant when the carbon source is depleted, as cells continue to utilise the complex substrates . The use of a good reliable model is essential to develop better strategies for optimising the fermentation process . In one the study, during production of succinic acid  increasing output was achieved by combining Plackett-Burman design (PBD), steepest ascent method (SA), and Box–Behnken design (BBD) for fermentation medium. PBD identified Glucose, yeast extract, and MgCO3 as critical components with optimal concentration was located to be 84.6 g L−1 of glucose, 14.5 g L−1 of yeast extract, and 64.7 g L−1 of MgCO3 . Also, the productivity was enhanced by 67.3% and 111.1%, respectively. Microbial fermentation for L-methionine (L-Met) production was enhanced by Plackett-Burman (PB) design, and Box–Behnken design (BBD) estimated glucose 37.43 g/L, yeast extract 0.95 g/L, KH2PO4 1.82 g/L, and MgSO4.7H2O 4.51 g/L), L-Met titre was increased to 3.04 g/L from less than 2.0 g/L. an increase of 38.53% and 30.0% compared with those of the basal medium, respectively. Furthermore, higher L-Met productivity of 0.261 g/L/h was obtained, representing 2.13-fold higher in comparison to the original medium .
In another study, yield of O-succinyl-l-homoserine (OSH) was improved through multilevel fermentation optimisation; Plackett-Burman design was used to screen out three factors (glucose, yeast and threonine) from the original 11 factors that improved the titre of OSH.
Plackett Burman randomisation is an excellent tool for the determination of the effect of variables for optimisation. Once such approach for preparation of Bacterial Ghosts (BGs) preparation is established using these methods. The twelve experiments containing either the +1 or − 1 value for each variable in each experiment in random arrangement have been conducted simultaneously to get the best results and enable the best possible comparison. The BGQ has been given 100% quality as 10, while ten cells have been evaluated as either bad or good. This will decrease the range of the differences if we use %. Unexpectedly,
|Source of Variation||Degrees of Freedom||Sum of Squares [Partial]||Mean Squares [Partial]|
3.4 Optimisation of fermentation conditions: Box-Behnken response surface methodology
George E. P. Box and Donald Behnken in 1960 developed the Box–Behnken response surface method. This algorithm establishes a comparison between composite central, three-level full factorial and Doehlert designs to optimise the fermentation conditions. In one of the example result of optimisation, the titre of O-succinyl-l-homoserine (OSH) reached 102.5 g l−1, which is 5.6 times higher than before (15.6 g l−1) . Similarly, by Box–Behnken combination and Plackett-Burman design and were optimised further by employing the Response Surface Methodology, O-acetylhomoserine OAH production was up to 9.42 and 7.01 g/L. The effect of glycerol, ammonium chloride and yeast extract were screened for fermentation conditions .
scFv anti-HIV-1 P17 protein was optimised by the sequential simplex method. Plackett-Burman design (PBD) and sequential simplex were combined with the aim of improving feed medium for enhanced cell biomass, relative protein to biomass ratio. The scFv anti-p17 activity was enhanced by 4.43, 1.48, and 6.5 times more than batch cultivation, respectively .
DNA vaccine pcDNA-CCOL2A1 production was increased using the response surface method (RSM) in
3.5 Functional characteristics with minimum experiments—Taguchi design
There are several challenges associated with the PBD and Box–Behnken design. To overcome these challenges of Box–Behnken new array based on “Orthogonal Array” was developed. Using this method, less number of experiments, instead of full factorial, is implemented. The system and technique provide control over three stages, likewise system strategy, tolerance design and parameters designing. The strategy design helps in determining tolerance, affecting factors in product output. Taguchi design is using a number of OAs to initiate the experimental setup, these arrays are utilised to suit the number of experimental iterations. Second step is conducting total tests with orthogonal arrays. These experiments are decided as per number of trial experiments as per Taguchi design, followed by randomisation of experiments for determining the output. This design analyses main effect and two-factor interactions. Noise for uncontrolled experimental variables is considered, focal point for two-point analysis. Taguchi methodology removes effect of noise due to uncontrolled variables; this is better as compared to PBD . The Taguchi method provides help in functional characteristic for capturing acceptable deviations. Human insulin-like growth factor I (hIGF-I) was produced in one study in
3.6 Deciphering outcome with- central composite design
This design is widely used in building a second-order (quadratic) model in response surface methodology (RSM). It consists of factorial Design with two levels +1 and − 1; centre points, factorial Design in experiments with median values; and star points identical runs for centre points except for one factor considering values above median and below the median. The number of star points is double the number of factors used. CCD is defined on the level of factors: as Face centred CCD (CCF), Inscribed CCD (CCI) and Circumcentered CCD (CCC).
3.7 Predicting effects of responses—partial least squares modelling
The effect of media ingredients, interactions with the system (X) and co-relation in response to culture ΔOD600 (Y) is defined by the Partial least squares (PLS) model. PLS covariance of matrix design and outcome are inferred accurately by virtue of small underlying events not measured directly. The ideal or latent variable (LVs) to study outcome needs to be carefully evaluated to avoid overfitting training data. The prediction of the accuracy of models for DoE iterations with multiple values, the Root Mean Predicted Residual Error Sum of Squares, is with the lowest value of LVs, having the slightest error. The significance of LVs is calculated by the Voet T2 test. The score of media design and component determined by OD600, the threshold of 0.8 is accepted. The lower score of model threshold defines to remove from subsequent designs; these threshold values are considered to study the positive and negative effect on the contribution of various factors associated with the increasing growth of the culture. In one study, 2D spectrofluorometry was utilised for fermentation processes to monitor the fermentation process online to produce extracellular 5-aminolevulinic acid (ALA). Various chemometric methods used for analysis of the spectral data are principal component analysis (PCA), partial least square regression (PLS) and principal component regression (PCR). PCA results visualised and considered for online fermentation monitoring. PCR and PLS compared for correlation between the 2D fluorescence spectra, PLS had slightly better calibration and prediction performance than PCR .
3.8 Minimal product trial experiments—definitive screening designs
Traditional definitive screening designs (DSD) require a low number of experiments and trials to determine the positive outcome. Jones and Nachtscheim developed Jones DSD methodology. These designs are popular in biopharma due to the relatively small number of experiments. These designs use three levels for each factor, allows estimation of nonlinear effects. Evaluate the number of runs to determine X is 2 X + 1 or 2X + 3 for even and odd values. Typically, the Design of X = 6 is used if X < 6. There are few dummies runs with additional factorial or centre points are added to precisely determine the experimental error. In DSD, few different factorial trials or centre point trials are added to the initial design to define and evaluate the experimental error. One such example of DSD is for upstream process development for cell growth and increased product output in fed-batch high-cell-density fermentation. The expression of the desired gene cloned in the plasmid was under the control of the phoA promoter . Simultaneous evaluation of phosphate concentration from 2.79 mM to 86.4 mM was designed using DoE. Several parameters, phosphate content, temperature, pH, and DO evaluated using a Definite screening design (DSD), resulting in determining each parameter’s impact on product formation. Similarly, a 24-bioreactor ambr250™ system for fermentation utilised 10-factor DSD to characterise the process of demonstrating 16 batches reproducible workflow for recombinant protein production. This strategy was further evaluated by QbD approaches to assess techniques for late-stage depiction in small experiments and subsequently leading to large scale fermentation parameters improvement .
3.9 Stepwise regression and artificial neural network modelling
Artificial Neural Networks (ANN) are known for parallel, and continuous learning capabilities are known to interpret nonlinear functions. These are utilised to predict steady-state and dynamic processes. One iteration multi-layer perceptron (MLP) is famous for estimating hidden layers between output and input layers. Using this method, simulation of dissolve Oxygen (DO) parameters, Feeding (F), Biomass, Glucose, Acetate, and output production of γ-interferon is modelled. Several DoE iteration modelled using stepwise regression; these models are fitted with linear regression, six terms per model are allowed with Heredity Restriction. The goodness-of-fit of the resulting model is evaluated using Akaike Information Criterion (AICc). An artificial neural network (ANN) is used to create weighted ensemble of regression models. There are three nodes in single hidden layer of ANN. Sigmoid activation functions were used, cross-validation of 19 of the media formulations defined in the second DOE iteration were randomly selected and withheld from ANN training set to do validation studies .
4. DOE optimisation techniques for
4.1 Evaluation of production and process—response surface methodology (RSM)
RSM is simple, robust and efficient, in predicting processes of metabolite or product production. Also, this method helps in the determination of factors for specifications, changes in levels of the elements, response with specified levels, quantitative understanding of system behaviour, predict product properties, factor combinations not run and stability of the designed process.
RSM methodology consists of different phases. Typically, performed in three steps, First is the screening factors by steepest ascent/descent, secondly by quadratic regression model fitting, third optimization using canonical regression analysis.
For a cost-effective and robust process, improvising parameters related to medium, productivity, safety and usefulness are desired outputs. The interdependency of factors associated with productivity is difficult to understand, and this slows downs the enhancement process and yield evaluation.
Response surface methodology (RSM), is based on factorial designs to elevate the process and final product yield. RSM is considered a sturdy, robust, and efficient mathematical approach. It includes experimental statistical methods, multiple regression design, and analysis, resulting in developing the best strategies guided by constrained equations. RSM is typically applied to study the response of different media ingredients . The production of Examples for
Where β0 is defined as the constant, Bi the linear coefficient, Bii the quadratic coefficient and Bij the cross-product coefficient. Xi and Xj are levels of the independent variables, while ε is the residual error. This variable and RSM predicted 7.81 g L−1 glucose, 30°C for fermentation and induced at OD600 1.66, Combined with BBD to get the 95.50% acetate and 97.96% productivity of rhIFN-β .
Plackett Burman design and Response surface methodology are utilised together to increase the production of the desired product multiple-fold. Combining these techniques is usually employed to enhance the product outcome of several microbial processes, batch, fed-batch fermentation. RSM is widely used, with Plackett Burman, CCD, Box Benken. Even after much success, several limitations are associated with RSM, likewise predicting responses based on second-order polynomial equation , results in poor estimation of optimal designs, leading to low levels of yield or outcome. One limitation is developing a model for many variables on physical and chemical inputs due to nonlinear biochemical network interactions, with partial knowledge of these systems . Another limitation is the study of multiple interactions and significant variations, resulting in error, bias and or no reproducibility. These challenges are dealt with better using Artificial neural networks (ANN) .
4.2 Study of interaction of pathways and multiple parameters—artificial neural network
An artificial neural network (ANN) is designed for a computing system to simulate the information and process the data similar to the human brain, guided by artificial intelligence (AI). It solves problems impossible or difficult by human or statistical standards. Handling units consist of inputs and outputs; using these inputs, ANNs produce desired or defined work. ANNs are built as neurons, are interconnected like a web in the human brain. There are hundreds or thousands of artificial neurons or processing units interconnected by nodes .
Similarly, as human functions, ANNs have a set of rules for learning backpropagation, an abbreviation for backward propagation of error, to perfect their output results. Typical processes in ANN are the training phase to recognise patterns in data, whether visually, aurally, or textually; in this supervised learning, the actual output is compared with the desired outcome. The differences are adjusted using backpropagation. The program runs backwards as we advance, and adjustments are made until the actual and expected output difference results in a minimum possible error. Designing of medium or metabolic process ANN is highly suitable, and it generates large amounts of data. The architecture of ANN consists of three layers: a layer of “input” connected to “hidden” units, ultimately connected with “output”. The conditions for ANNs three types are Supervised, Unsupervised and Reinforced learning. The objective of supervised learning for the neural network is to provide input training data and possible experimental output. An unsupervised output unit is trained to respond to clusters or patterns present in the input data. Reinforced learning is an intermediate system; learning systems’ actions are considered good or bad based on environmental responsibility. These parameters are adjusted till the time equilibrium state is attained. These systems are applied for system designing, modelling, optimisation. It leads to control the noisy signals and generalise through system training procedure. The ANNs are employed in various fermentation processes to optimise nutrient and prediction biomass outcome in different culture conditions. ANN has several limitations; likewise, it needs proper training, also based on input data to get the quality output . To overcome some of these challenges, ANN combined with a genetic algorithm (GA) is applied to improve the concentration and shelf life of aspartate-β-semialdehyde dehydrogenase protein .
4.3 Study of biological process of evolution- genetic algorithm (GA)
Genetic algorithm (GA), developed in 1975 by Holland and Long, is based on Charles Darwin’s theory of natural selection. GA is a model for the study of biological evolution by testing crossover and recombination mutation in adaptive and artificial systems. A genetic algorithm works as a problem-solving strategy using essential genetic operators. There are several GA designed to deal with complex problems and parallelism for stationary or non-stationary functions, linear or nonlinear, continuous or discontinuous, random or noise. Improvement in yield of recombinant G-CSF was obtained in auto-induction medium. The backpropagation (BP) algorithm and radial basis function (RBF) algorithm combined with the Genetic Algorithm improve G-CSF yield. The yield of models was 72.24 and 76.09%, respectively, and are higher than those obtained using non-optimised auto-induction mediums.
There are some disadvantages or genetic algorithms as well. The formulation of the fitness function, population size, choice of factors for mutation and cross over, selection of criteria for these factors needed to be carried out carefully. Despite drawbacks, GA is one of the widely used algorithms in modern nonlinear optimisation .
4.4 Geometric function evaluation- Nelder-Mead simplex algorithm
Nelder–Mead published this algorithm in 1965. The objective is to solve the classical unconstrained optimisation problem of minimising a given nonlinear function; without derivative only numerical evaluation of the objective function is needed. This algorithm is based on geometry, and in three-dimensional space, simplex is a tetrahedron determined by four points (vertices) and their interconnecting line segments. For two dimensions, simplex is an equilateral triangle, and three dimensions should be tetrahedron. The objective function is evaluated every point with the highest numerical value of all four points is perpendicularly mirrored against the opposite plain segment, generating reflection . An expansion can accompany the reflection to take more significant steps or a contraction to shrink the simplex where an optimisation valley floor is reached. The optimisation procedure continues until the termination criteria are met. The termination criterion is usually the maximum number of reflections with contractions or tolerance for optimisation variables. The algorithm can be implemented in N dimensions, where simplex is a hypercube with N + 1 vertex points. The NM method provides significant improvements in primary iteration and improves outcome. NM is combined with ANN to optimise the production of metabolites .
5. Problems and bottlenecks in
E. colimedia optimisation
Medium optimisation involves many experiments irrespective of the media chosen, which accounts for labour cost and is an open-ended experiment. Many experiments are carried out at shake flask, even after generation of large amount of data using single experiments. The results obtained at piolet scale batch fermentation are not reproducible. During shake flask experiments, the precise control over pH, oxygen transfer and evaporation is not controlled. The experiments carried out at shake flask may or may replicate during fermentation. Also, soluble proteins expression may lead to inclusion bodies formation. Optimisation of media is time consuming due to the requirement of rigorous experimental planning. Moreover, the media utilised in the production of recombinant products faces challenges due to variability in different batches, media availability, cost of media, bulk storage, transport time. For Biotherapeutics, Enzymes and Probiotics, the cost of media needs to lower in Probiotics compared to Enzymes and Biotherapeutics, respectively. The choice of fermentation depends on the solubility of protein from batch to fed-batch.
Improvement of production of recombinant products is also guided by downstream processing of protein. Therefore, series of experiments designed for correct folding and confirmation are most important. Significant protein amounts can be achieved using pH, time for fermentation, oxygen transfer and temperature for fermentation. Also, inducer and harvest time are critical for increasing output. The critical factors for fermentation in batch and fed-batch are different. Therefore, the choice of media defined, semi-defined or complex media with vitamins, minerals and trace elements needed to be considered for evaluation in DOE experiments. To evaluate the considerable amount of output and variables combination and application of various algorithms is done to achieve desired output. In all the optimisation process and advanced algorithms such as Artificial neural networks and Genetic Algorithm are applied to achieve the desired output efficiently. The need for innovation as per sustainable development goals (SDGs) for United Nations 2030 plan is needed to increase the reach of technologies to low income countries. The application of DOE can improve the yield and cost leading to improved access to Biomolecules, Biopharmaceuticals, enzymes and metabolites.
6. Future strategies
The selection of host cells for industrial application has some technical difficulties despite the availability of many gene manipulations theoretically in various organisms. The availability of a genetic map, gene exchange system, useful vector and transformation procedures, and metabolic pathways leading from raw material to the desired product are essential criteria for selecting a suitable host strain. The most popular organisms used to date for the expression of the recombinant proteins are
For the successful production of the recombinant protein-based vaccine, producing biologically active protein is an essential requirement that can be further scaled up. Production of a biologically active recombinant protein depends on the host cell’s microenvironment for expression and compatibility of codon usage.
The recombinant plasmid carrying cloned gene would behave differently compared to the original vector plasmid. It can be easily understood, as it is preserved under a delicate quasi-equilibrium state in the host cell. There are several reasons for the instability of recombinants. The higher the plasmid gene expression, the more segregants (plasmid free cells) tend to appear. The recombinant plasmid is relatively unstable when the cloned gene products are inhibitory to the host cells. Phenotypic instability of plasmid is due to the disappearance of the entire plasmid or the deletion of a specific region . Both plasmid copy number and plasmid loss rate are features affected by factors such as media composition growth rate and culture strategy  and other factors such as temperature, agitation rate, and pH .
Therefore, future strategies for optimisation of cultivations needed to be shift to conclusions evaluated during experimental phases before actual fermentation to identify role of batch, fed-batch or different media components. The utilisation of carbon, nitrogen and other minimally required nutrients during batch and fed-batch is critical for delivering output and achieve sustainable development goals (SDGs) for technological innovation. The method design and modelling approaches are future strategies for increasing output during a process development. Utilisation of one factor and carrying out experiments by statistical media optimisation can be improved by combining several algorithms such as Plakett Burman, Box–Behnken, Taguchi design, Central composite design, partial least squares modelling in determining optimal factors. Response surface methodology with Artificial neural network (ANNs) can be applied to difficult model kind of fermentation processes. A free artificial neural network is applicable for carrying out nonlinear regression models to optimise metabolic processes. These algorithms are combined and applied to increase productivity and optimise the product output by reducing the cost of fermentation and product development.
Optimisation of critical factors and nutrient sources is an essential step for metabolite, recombinant proteins before pilot fermentation. In this chapter, strategies, conventional, advanced process design are reviewed and detailed. DOE approaches with statistical evaluation are critical for process development are essential for saving experimentation time. The strategies and examples shared in this review have been analysed for ease of implementation, time consumption. The conditions and media designed needs to be further tested under realistic conditions, full scale process with replication to production setup.
Overall, this chapter detailed need of critical factors identification, their significant contribution in enhancing process of metabolite production. Also, recently, co-fermentation of glycerol and glucose in engineered
Puneet Kumar Gupta acknowledges financial support from Grace Lifetech Pvt. Ltd. for the study. Jyotheeswara Reddy Edula acknowledges support from the Tata Institute of Genomics and Society (TIGS).
Conflict of interest
The authors declare no conflict of interest.
Acronyms and Abbreviations
Design of Experiments
Analysis of variance
Response surface methodology
Ethylenediamine tetra acetic acid
Steepest ascent method
Minimum Inhibition Concentration
Minimum Growth Concentration
Bacterial Ghosts Quantification
Face centred CCD
Definitive screening designs
Quality by design
Artificial Neural Network
Sustainable Development Goals
Chinese Hamster Ovary