Application of ANN in bioprocess industries.
Abstract
Low cost cellulase production has become a major challenge in recent years. The major hurdle in the production of biofuel and other products from biomass is the lack of efficient economically feasible cellulase. This can be achieved by proper monitoring and control of bioprocess. In order to implement any control scheme, the accurate representation of the system in the form of a model is necessary. There are many challenges associated with modeling the fermentation process such as inherent nonlinear dynamic behavior, complexity of process due to co-existence of viable and nonviable cells, presence of solid substrates, etc. Toward the achievement of this goal, researchers have been developing new techniques that can be used to monitor the process online and at-line. These newer techniques have paved the way for designing better control strategies that can be integrated with quality by design (QbD) and process analytic technology (PAT).
Keywords
- biomass
- lignocellulosic substrates
- hybrid model
- NARX
- LSTM
1. Introduction
Monitoring and control of biomass plays a vital role during fermentation process [1]. Low cost Production of ethanol/cellulase from lignocellulosic substances has been a topic of research for the past few years as it uses the carbon source from industrial or agricultural waste [2, 3]. These processes mostly involve fungi such as
2. Biomass estimation techniques
The first principles model based on mass or energy balance equations is widely used in industries. Unstructured models such as monod model captures the process dynamics effectively only during log phase and on the other side, structured models have more parameters and are difficult to use [5]. The kinetic parameters in first principles model such as specific growth rate, biomass yield are found using laborious experimentation methods [6].
New process monitoring approaches use several online sensors to determine the concentration of viable biomass that are found useful during control of fermentation process. A simple on-line method for fungal biomass estimation based on agitation rate has been evolved for DO stat cultures. The estimator is developed based on changes in dissolved oxygen concentration in the initial transient time and yield change. The estimation parameters are found using agitation rate at 20% of DO concentration [7].
Another method for estimation of viable biomass is the
In recent years, soft sensors are widely used for the estimation of biomass.
Artificial Neural Networks (ANN) based soft sensor have the capability to learn nonlinearity of the process using experimental plant data and thus can be used to estimate the state of bioprocess such as biomass concentration [12, 13]. The rapid development of algorithms and information technology is the major motivation behind the broad application of ANNs in research and development [14]. Currently, ANNs are employed in the prediction of various outcomes including process control, medicine, forensic science, biotechnology, weather forecasting, finance and investment and food science. However, it is noteworthy to state that the use of ANNs in biofuel production is currently in the early phases of its development. Generally, microbial fermentations exhibit non-linear relationships which could pose several problems during bioprocess modeling and optimization. The application of robust models such as ANN helps to capture this nonlinear behavior, and thus provides a model that links the process inputs to the corresponding output parameters. On comparison with other empirical models, neural networks are relatively less sensitive to noise and hence can be applied to process control systems with higher level of uncertainty [15]. During batch/fed-batch, ANN can be effectively used for estimation of biomass or product, optimization of fed-batch run and online control of bioprocess systems [16, 17, 18]. ANNs are suitable for many applications such as nonlinear filtering, prediction of output using input are widely used in the modeling of dynamic systems [19]. Several ANNs such as feed forward back-propagation neural network (FFBPNN) [20], Hopfield [21], radial basis function (RBF) networks [22], recurrent [23] and hybrid neural network (HNN) [24] found extensive application in bioprocess industries based on their functions. The BPNN with supervised learning has been reported in biofuel process modeling as shown in Table 1.
Input parameters | Output parameters | ANN Type | ANN structure | R2 value | References |
---|---|---|---|---|---|
pH | Biomass concentration | — | 1–2-1 | — | [3] |
Sin (glucose) | Biomass concentration | HNN | 1–3-1 | — | [24] |
Sin (glucose), sodium nitrate concentration, yeast extract concentration | Lipid Productivity, biomass concentration | FFBPNN | 3–10-1 | 0.99 | [25] |
Table 1.
Sin, initial substrate concentration; R2, coefficient of determination.
3. Biomass modeling methods
The Food and Drug Administration (FDA) of United States has initiated the online monitoring and closed loop control of a bioprocess via Process Analytical Technology (PAT) initiative. PAT highlights the concept of process understanding in order to deliver high quality products and this can be achieved by the design of accurate bioprocess models. The mathematical models are in general classified as black box, white box and gray box models.
3.1 Black box models
Black box models are input/output models that do not require
where
The input output experimental data is used to determine the values of variables a, b and the order of the polynomials
3.2 White box models
White box models are mechanistic models that are been used widely used incorporates the available process knowledge in the form of first principle model equations. As an example, the first principles model widely used for fungal fermentation during cellulase production are represented in Eq. (5), Eq. (6), Eq. (7) as follows [26].
where
3.3 Hybrid models
Gray box models also known as hybrid models are considered as an effective tool for model identification. These models combine

Figure 1.
Configurations of a hybrid model. (a) Parallel form (b) Serial form.
The parallel configuration Figure 1(a) uses complete first principles model and the error between outputs of this model and real-time process are modeled. The serial configuration Figure 1(b) is used when there is less number of unknown parameters. This is the most frequently used hybrid model structure. The choice of hybrid configuration strongly depends on the First principles model structure. When the First principles model is not accurate, Parallel configuration is a better choice as parallel hybrid model can compensate for the First principles model mismatch [29]. When the First principles model is accurate, serial configuration seems to be a better choice as it offers better extrapolation [30].
3.3.1 Hybrid fuzzy models
A non-linear process such as bioreactor can be modeled using Fuzzy logic via Rule base analysis. Initially a model structure is chosen wherein the number of inputs and number of fuzzy sets per variable are found, then estimation of fuzzy membership function parameters with regard to its shape, position is carried out and finally the rule base mapping from fuzzy sets to functions are carried out. The fuzzy logic system draws decision with the help of Fuzzy Inference System which uses “If ..Then” rules along with OR, AND connectors. For example, if specific growth rate is high, biomass concentration is high; else if specific growth rate is low, biomass concentration is low. Fuzzy models render transparency and therefore have the advantage of interpretability when compared to other data- driven methods. The popular fuzzy models in use are the Takagi-Sugeno and the Mamdani type. Takagi-Sugeno type fuzzy models are suitable for modeling the non-linearity of the process and can be represented as many linear models in parallel, wherein the sub-models are chosen based on some specified rule. These are generally well suited to perform mathematical analysis and can be applied in Multiple Input Single Output (MISO) systems. Also, in Takagi-Sugeno model, output membership function are either constant or linear. The Mamdani type fuzzy model has a fuzzy logic set as the output of each rule. These are well suited to perform with manual input and can be applied in Multiple Input Single Output (MISO) systems and Multiple Input Multiple Output (MIMO) systems. The output membership function is present and the output is not continuous. Moreover, the Mamdani type fuzzy model is more accurate than Takagi-Sugeno, but it requires estimation of huge number of parameters.
Hybrid Fuzzy models comprises both first principles model equations and Fuzzy models [31]. The parameter estimation of hybrid fuzzy models are commonly done by Kalman filter. Fuzzy models are identified from experimental process data commonly by Fuzzy Clustering Method (FCM). Improvements in accuracy and performance of fuzzy models can be achieved by implementation of FCM based on Artificial Bee Colony (FCMABC), FCM based on Chicken Swarm Oprimization (FCMCSO), etc.
A case study to exhibit the Hybrid fuzzy modeling approach, the low cost cellulase production process is considered where the objective is to find the product concentration (cellulase) based on the interactions between the biomass and substrate.
The following assumptions are made during the batch/fed-batch operation of the bioreactor. The reactor is completely mixed and the feed flow rate (
The first principles model consists of 4 state equations for biomass concentration X (g/l), Substrate concentration S (g/l), Product concentration P (g/l) and volume of the bioreactor V (l) as defined in Eq. (8–11).
where
where
The kinetic parameters
0.32 | |
5 | |
3 | |
7 |
Table 2.
Results of Kalman filter.
From the table, it is observed that the filter is tuned properly. In this work, the fuzzy sub-model identification for specific growth rate and product formation rate are done with fuzzy clustering. The basic idea is to form clusters (similar groups) with the available experimental data. Each cluster exhibit an independent rule in the rule base. The advantage of using fuzzy clustering method is that the experimental data is focused and from that, the fuzzy model with independent rules is developed. The fuzzy models for kinetic parameters are represented in Figure 2 and the hybrid model output in comparison with experimental data is illustrated in Figure 3. The optimization of fuzzy model parameters will improve the performance of hybrid model.

Figure 2.
Fuzzy model for kinetic parameters μ and

Figure 3.
Performance of hybrid model for biomass(a), substrate(b) and product (c) concentrations.
3.3.2 Hybrid ANN models
Hybrid ANN models are combination of first principles and ANN wherein the ANNs are used to estimate the kinetic parameters (black box models) [32]. The hybrid model shown in Figure 4 is a combination of neural network estimator with the Mass Balance equations (Mathematical model). The neural network estimator is capable of estimating the process parameters from the real time measurements and these kinetic parameters (μ and Ys/x) are updated in the mass balance equations to give the value of the state variables in the next time instant.

Figure 4.
Hybrid model structure.
In general, the kinetic parameters are determined offline from experimental data. Due to the ability of neural networks to learn and model non-linear relationships, the parameter values can be estimated after proper training. Neural network with varying number of hidden neurons has been trained and MSE between the actual data and estimated data are calculated. Network with less MSE has been selected to find optimal hidden neurons. In this case study, a neural network structure comprising of two layer feed-forward network with sigmoid hidden neuron and linear output neuron is used. The state variables
where S(t) is the substrate concentration,
Where
The response from the hybrid model obtained for training input is shown in Figure 5(a) and test input is shown in Figure 5(b).

Figure 5.
Validation of hybrid model with training and testing data set.
The process parameters μ and YX/S change with respect to time and the corresponding state variable measurements. The time varying natures of the parameters are shown in Figure 6.

Figure 6.
Variation of kinetic parameters μ and YX/S with respect to time.
3.4 NARX models
Non-linear regressive models with exogenous input (NARX) are found to be effective for non-linear system identification, as they have good predictive capability. To predict the system behavior without a deep mathematical knowledge [33, 34], model identification with input–output measurements is generally used. For closer approximations of actual process, a NARX model is commonly employed [35]. The NARX is a recurrent dynamic network, with feedback connections enclosing several layers of the network. The defining equation for the NARX model is given in Eq. (17).
where
As a case study [37], a NARX model is developed for estimation of biomass concentration using the dataset of pH, agitation speed and substrate concentration values starting from the time of inoculation till the end of fed batch process. The experimental data is divided into training, testing and validation in the ratio 70:15:15. The inputs for the NARX model are present values of pH, substrate concentration (S), agitation speed and previous sampling instant biomass concentration, X(k-1) and the output is the estimated biomass concentration, X(k) as shown in Figure 7.

Figure 7.
NARX model for estimation of biomass concentration.
To obtain best performance from NARX model, two hidden layers are used and the numbers of hidden neurons in each layer are chosen based on Mean Square Error (MSE). ANN parameters used in the NARX model development are listed in Table 3.
Parameters | Value |
---|---|
Architecture | Dynamic neural network (NARX) |
Training Algorithm | Levenberg–Marquardt algorithm |
Number of iterations | 1000 |
Number of hidden layer | 2 |
Number of hidden neurons in first layer | 18 |
Number of hidden neurons in second layer | 10 |
Bias | 1 |
Activation function (hidden layer) | Sigmoid |
Activation function (Output) | Linear |
Performance Evaluation | Mean Square Error |
Table 3.
ANN parameters for NARX model development.
Experimental validation of NARX Model is done with the help of capacitance probe. The annular dielectric probe when inserted into the bioreactor, gives a capacitance value that can be directly related to the concentration of biomass. The probe is useful particularly during fungal biomass cultivation due to the nonexistence of accurate offline measurements [38]. As the probe measures only the viable cells excluding the dead cells and insoluble substrates, it is an optimal choice for process validation. The probe generates a dielectric spectrum at 2 different frequencies given, based on cell size, morphology etc. [39]. In this case study, the capacitance probe is utilized in microbial mode and the biomass is measured at frequencies 0.6 MHz and 15 MHz. The 15 MHz reading is used as a form of auto zero and subtracted from the 0.6 MHz. Therefore, the capacitance of background matter is automatically subtracted from the signal. A resolution of 0.1 pF/cm on the instrument typically represents 106 Cells/ml, or 0.5 grams per liter. The dynamic NARX network are trained with different sets of input–output data. The response of NARX network to test inputs is shown in Figure 8.

Figure 8.
Comparison of NARX model output with experimental biomass concentration data.
It is inferred from Figure 8, that the trained dynamic NARX network can be used in the place of biosensor. The error response for single NARX network is shown in Figure 9.

Figure 9.
Error plot for the NARX model.
The performance of NARX network is analyzed based on the performance criteria, Root Mean Square Error (RMSE) and coefficient of determination (R2). RMSE is calculated by the Eq. (18).
where

Figure 10.
Correlation graph between biomass concentrations monitored with real-time capacitance probe and estimated by the NARX model.
It is observed that the NARX model has a low value of RMSE, a very high value of R2 and good correlation to real time probe data, which confirms that this dynamic neural network soft sensor performs well in the estimation of biomass concentration.
3.5 LSTM models
Recurrent Neural Networks (RNNs) are a type of deep networks that are structured to capture the temporal dependencies of the process effectively [40]. Long Short-Term Memory (LSTM) networks are a type of recurrent neural network capable of learning order dependence in sequence prediction problems. The LSTM network was invented with the goal of addressing the vanishing gradients problem. The key insight in the LSTM design was to incorporate nonlinear, data-dependent controls into the RNN cell, which can be trained to ensure that the gradient of the objective function with respect to the state signal does not vanish [41] and hence LSTMs are well suited for classification and prediction problems. The LSTM model developed for the estimation of maximum specific growth rate is shown in Figure 11. The input gate, forget gate and output gate equations are given by Eq. (19), (20) and (21) respectively.

Figure 11.
LSTM model developed for maximum specific growth rate estimation.
The new memory cell and final memory cell equations are given as Eq. (22) and Eq. (23). The hidden state equation is represented in Eq. (24).
where,
Similar to the development of NARX model, the modeling of LSTM network also includes data collection, parameter determination, training, testing, validation. The modeling can be done either in MATLAB or using python coding. As a case Study, consider a LSTM network in which two hidden layers are chosen and the number of neurons in each hidden layer is varied till the MSE reaches minimum value. The parameters chosen to frame the LSTM network are listed in Table 4. The training to testing ratio is chosen as 67:33. The predicted values of maximum specific growth rate calculated from the LSTM model are presented in Figure 12.
Parameter | Choice | |
---|---|---|
Batch size | 50 | |
No. of hidden layers | 2 | |
No. of hidden neurons | Hidden layer 1 | Hidden layer 2 |
15 | 47 | |
Activation function | Sigmoid | |
Optimizer | Adam | |
Learning rate | 0.01 | |
No. of epochs | 800 | |
Table 4.
Parameters for LSTM model development.

Figure 12.
Comparison of LSTM predicted maximum specific growth rate data with experimental data.
The performance of LSTM model is evaluated by the statistical measures RMSE, R2 and Accuracy factor (Af). The Af averages the distance between every point and the line of equivalence as a measure of finding the closeness between predicted and observed values. The RMSE, R2, Af values of the LSTM model are found to be 0.011, 0.994 and 1.024 respectively. The RMSE and Af values are minimum which suggests that the LSTM predictive model fit well with the experimental data.
4. Conclusions
Several modeling techniques that will aid in the monitoring and estimation of fungal biomass in the presence of lignocellulosic substrates during fed-batch fermentation are discussed in this chapter. Moreover, the bioprocess models are validated with experimental data as discussed in case studies. The use of these soft sensors in industries with accompanying control system will improve the cellulase concentration yield.
Acknowledgments
The author acknowledge Anna University and Department of Science and Technology for their funding and support.
References
- 1.
Neves AA, Pereira DA, Vieira LM, Menezes JC. Real time monitoring biomass concentration in Streptomyces clavuligerus cultivations with industrial media using a capacitance probe. Journal of biotechnology. 2000 Nov 17;84(1):45–52. - 2.
Ellilä S, Fonseca L, Uchima C, Cota J, Goldman GH, Saloheimo M, Sacon V, Siika-Aho M. Development of a low-cost cellulase production process using Trichoderma reesei for Brazilian biorefineries. Biotechnology for biofuels. 2017 Dec 1;10(1):30. - 3.
Sarkar N, Ghosh SK, Bannerjee S, Aikat K. Bioethanol production from agricultural wastes: an overview. Renewable energy. 2012 Jan 1;37(1):19–27. - 4.
Lynd LR, Weimer PJ, Van Zyl WH, Pretorius IS. Microbial cellulose utilization: fundamentals and biotechnology. Microbiology and molecular biology reviews. 2002 Sep 1;66(3):506–77. - 5.
Lin J, Lee SM, Lee HJ, Koo YM. Modeling of typical microbial cell growth in batch culture. Biotechnology and Bioprocess Engineering. 2000 Oct 1;5(5):382–5. - 6.
Zelić B, Vasić-Rački Đ, Wandrey C, Takors R. Modeling of the pyruvate production with Escherichia coli in a fed-batch bioreactor. Bioprocess and Biosystems Engineering. 2004 Jul 1;26(4):249–58. - 7.
Jeong-Geol N, Kim HH, Chang YK. On-line estimation of cell growth from agitation speed in DO-stat culture of a filamentous microorganism, Agaricus blazei. Biotechnology and Bioprocess Engineering. 2005 Dec 1;10(6):571. - 8.
Norsyahida A, Rahmah N, Ahmad RM. Effects of feeding and induction strategy on the production of BmR1 antigen in recombinant E. coli . Letters in applied microbiology. 2009 Nov;49(5):544–50. - 9.
Holland T, Blessing D, Hellwig S, Sack M. The in-line measurement of plant cell biomass using radio frequency impedance spectroscopy as a component of process analytical technology. Biotechnology journal. 2013 Oct;8(10):1231–40. - 10.
Slouka C, Wurm DJ, Brunauer G, Welzl-Wachter A, Spadiut O, Fleig J, Herwig C. A novel application for low frequency electrochemical impedance spectroscopy as an online process monitoring tool for viable cell concentrations. Sensors. 2016 Nov;16(11):1900. - 11.
Bogaerts P, Hanus R. On-line state estimation of bioprocesses with full horizon observers. Mathematics and computers in simulation. 2001 Jun 11;56(4–5):425–41. - 12.
James S, Legge R, Budman H. Comparative study of black-box and hybrid estimation methods in fed-batch fermentation. Journal of process control. 2002 Jan 1;12(1):113–21. - 13.
Won H, Yoon-Keun C. An artificial neural network for biomass estimation from automatic pH control signal. Biotechnology and Bioprocess Engineering. 2006 Aug 1;11(4):351–6. - 14.
Veloso AC, Rocha I, Ferreira EC. Monitoring of fed-batch E. coli fermentations with software sensors. Bioprocess and biosystems engineering. 2009 Apr 1;32(3):381–8. - 15.
Zelić B, Bolf N, Vasić-Rački Đ. Modeling of the pyruvate production with Escherichia coli : comparison of mechanistic and neural networks-based models. Bioprocess and biosystems engineering. 2006 Jun 1;29(1):39–47. - 16.
Garcíaa-Gimeno RM, Hervás-Martíanez C, Barco-Alcalá E, Zurera-Cosano G, Sanz-Tapia E. An artificial neural network approach to Escherichia coli O157: H7 growth estimation. Journal of Food Science. 2003 Mar;68(2):639–45. - 17.
Vlassides S, Ferrier JG, Block DE. Using historical data for bioprocess optimization: modeling wine characteristics using artificial neural networks and archived process information. Biotechnology and Bioengineering. 2001 Apr 5;73(1):55–68. - 18.
Haider MA, Pakshirajan K, Singh A, Chaudhry S. Artificial neural network-genetic algorithm approach to optimize media constituents for enhancing lipase production by a soil microorganism. Applied biochemistry and biotechnology. 2008 Mar 1;144(3):225–35. - 19.
Menezes Jr JM, Barreto GA. A new look at nonlinear time series prediction with NARX recurrent neural network. In2006 Ninth Brazilian Symposium on Neural Networks (SBRN'06) 2006 Oct 23 (pp. 160–165). IEEE. - 20.
Whiteman JK, Kana EG. Comparative assessment of the artificial neural network and response surface modelling efficiencies for biohydrogen production on sugar cane molasses. BioEnergy Research. 2014 Mar 1;7(1):295–305. - 21.
Kmet T, Kmetova M. Adaptive critic design and Hopfield neural network based simulation of time delayed photosynthetic production and prey–predator model. Information Sciences. 2015 Feb 10;294:586–99. - 22.
Dewasme L. Neural network-based software sensors for the estimation of key components in brewery wastewater anaerobic digester: an experimental validation. Water Science and Technology. 2019 Nov 15;80(10):1975–85. - 23.
Huang Y, Kangas LJ, Rasco BA. Applications of artificial neural networks (ANNs) in food science. Critical reviews in food science and nutrition. 2007 Apr 1;47(2):113–26. - 24.
Wu Z, Shi X. Optimization for high-density cultivation of heterotrophic Chlorella based on a hybrid neural network model. Letters in applied microbiology. 2007 Jan;44(1):13–8. - 25.
Mohamed MS, Tan JS, Mohamad R, Mokhtar MN, Ariff AB. Comparative analyses of response surface methodology and artificial neural network on medium optimization for Tetraselmis sp. FTC209 grown under mixotrophic condition. The Scientific World Journal. 2013 Oct;2013. - 26.
Ma L, Li C, Yang Z, Jia W, Zhang D, Chen S. Kinetic studies on batch cultivation of Trichoderma reesei and application to enhance cellulase production by fed-batch fermentation. Journal of biotechnology. 2013 Jul 20;166(4):192–7. - 27.
Oliveira R. Combining first principles modelling and artificial neural networks: a general framework. Computers & Chemical Engineering. 2004 May 15;28(5):755–66. - 28.
Peres J, Oliveira R, De Azevedo SF. Knowledge based modular networks for process modelling and control. Computers & Chemical Engineering. 2001 May 1;25(4–6):783–91. - 29.
Bhutani N, Rangaiah GP, Ray AK. First-principles, data-based, and hybrid modeling and optimization of an industrial hydrocracking unit. Industrial & engineering chemistry research. 2006 Nov 8;45(23):7807–16. - 30.
Corazza FC, Calsavara LP, Moraes FF, Zanin GM, Neitzel I. Determination of inhibition in the enzymatic hydrolysis of cellobiose using hybrid neural modeling. Brazilian Journal of Chemical Engineering. 2005 Mar;22(1):19–29. - 31.
Van Lith PF, Betlem HB, Roffel B. Hybrid fuzzy-first principles modelling using fuzzy clustering, genetic algorithms and neuro-fuzzy methods: a comparison. IFAC Proceedings Volumes. 2000 Jun 1;33(10):455–60. - 32.
Valente E, Rocha I, Rocha M. Modelling fed-batch fermentation processes: an approach based on artificial neural networks. In2nd International Workshop on Practical Applications of Computational Biology and Bioinformatics (IWPACBB 2008) 2009 (pp. 30–39). Springer, Berlin, Heidelberg. - 33.
Linder J, Enqvist M. Identification of systems with unknown inputs using indirect input measurements. International Journal of Control. 2017 Apr 3;90(4):729–45. - 34.
Ang J, Ingalls B, McMillen D. Probing the input–output behavior of biochemical and genetic systems: system identification methods from control theory. InMethods in enzymology 2011 Jan 1 (Vol. 487, pp. 279–317). Academic Press. - 35.
Chetouani Y. Non-linear modeling of a reactor-exchanger by using NARX neural networks. InProceedings of European Congress of Chemical Engineering (ECCE-6) Copenhagen 2007 Sep 16 (Vol. 1620). - 36.
Won H, Yoon-Keun C. An artificial neural network for biomass estimation from automatic pH control signal. Biotechnology and Bioprocess Engineering. 2006 Aug 1;11(4):351–6. - 37.
Murugan C, Natarajan P. Estimation of fungal biomass using multiphase artificial neural network based dynamic soft sensor. Journal of microbiological methods. 2019 Apr 1;159:5–11. - 38.
Carvell J, Lee M, Bhat AR. Recent developments in scaling down and using single use probes for measuring the live cell concentration by dielectric spectroscopy. InBMC proceedings 2015 Dec 1 (Vol. 9, No. S9, p. P46). BioMed Central. - 39.
Kiviharju K, Salonen K, Moilanen U, Eerikäinen T. Biomass measurement online: the performance of in situ measurements and software sensors. Journal of industrial microbiology & biotechnology. 2008 Jul 1;35(7):657–65. - 40.
Wong WC, Chee E, Li J, Wang X. Recurrent neural network-based model predictive control for continuous pharmaceutical manufacturing. Mathematics. 2018 Nov;6(11):242. - 41.
Sherstinsky A. Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network. Physica D: Nonlinear Phenomena. 2020 Mar 1;404:132306.