Open access peer-reviewed chapter

Using Cellular Automata and Global Sensitivity Analysis to Study the Regulatory Network of the L-Arabinose Operon

By Advait A. Apte, Stephen S. Fong and Ryan S. Senger

Submitted: March 5th 2012Reviewed: August 3rd 2012Published: May 8th 2013

DOI: 10.5772/52085

Downloaded: 1700

1. Introduction

The field of computational biologyhas grown significantly in recent years, allowing researchers to investigatecomplex biological systems in silico.In this chapter, a new method of combining cellular automata (CA) with global sensitivity analysis (GSA) is introduced. This method has the potential to determine which mechanisms of a regulated biological network contribute to characteristics such as stability and responsiveness. For dynamic models of biochemical reactions and networks, determining the correct values ofthe kineticparameters that govern the system is often problematic[12, 13, 19, 29, 30, 33, 36, 46, 47, 49, 50, 55, 57]. This isoften due to the difficulty ofobtaining accurate experimental measurements ofvital kineticconstants[6, 10, 17, 21, 32, 37]. This is currently the case for biochemical reactions occurring in complex environments in vivothat cannot be approximated through more simple experimentsin vitro.Complex gene regulatory networks are a prime example.Modelling these systems using aCA approach allows researchers to easily change kineticparameter values (individuallyand in combination) to study their effects on system function[4, 7, 8, 25, 38, 51]. Since the overall system function (i.e., the output molecule or action) is something that can be measured easily, CA modelling provides a method for determining “difficult-to-measure” parameters using “easy-to-measure” observations of the system being studied.However, the real challenge after conducting a CA study with combinatorial parameter variation is interpreting the results. It has been found that GSAis an extremely useful tool for identifying the parameters that most significantly affect overall model performance[9, 11, 16, 24, 28, 31, 44, 45, 56]. In this chapter,a new approach that combinesCA and GSA is applied for analysing regulatory mechanisms of the L-arabinose (ara) operon. In particular, the influence of the negative autoregulatory (NAR) action of the transcription factor AraC on system dynamics and stability is calculated. The purpose of this chapter is to provide instruction through a detailed example of how to apply CA and GSA simultaneously to analysea biological network that must be studied in vivo. An in-depth explanation of GSA and the regulatory elements of the araoperon are provided in the Introduction. Detailed descriptions of CA model building and system parameters as well as a comprehensive GSA tutorial are presented in the Materials and Methods section.Computational experiments illuminating the influence of the NAR mechanism on the studied regulatory network are presented and discussed throughout the rest of thechapter.

1.1. Global sensitivity analysis

GSA uses Monte Carlo simulations to calculate the outputs of a model over the entire range of all input parameters [39, 40]. This variance-based method calculates the contribution of each input parameter to the total variability of the model output. In other words, GSA is used to determine which inputs most significantly influence the output. This provides the investigator one or multiple targets that can be manipulated to effectively engineer the system. GSA differs significantly from the traditionally used method of partial gradient-based sensitivity analysis (SA). With traditional SA, the change in a model output is calculated by allowing only one parameter to vary, while keeping others constant. This variance in the model output is likely to change if all other parameters are held constant at different values. The GSA approach takes this into account and enables the consideration of multiple parameters simultaneously over the entire range of each parameter. To consider the model output variance caused by only a single parameter is a “first-order” analysis. Two parameters may be considered simultaneously to develop a measure of their interactions in a “second-order” analysis. Or, a single parameter can be considered with is interactions with all other parameters in the model. This is called the “total effect index” [39, 40]. Thus, another significant advantage of GSA over SA is that GSA accounts for the influence and interactions between input parameters over the entire input space. A simplified tutorial of GSA has been published for a deterministic ammonia emissions model [35]. The GSA methods are also presented in detail in this chapter for the araoperon model system.

1.2. The L-arabinose operon

Transcriptional regulation networks are largely made up of recurring regulatory patterns called network motifs. These network motifs have been shown to carry out many signal transduction functions[2, 3, 27, 48]. One of the most abundantly found network motifs is thenegative autoregulation (NAR) motif. In an NAR motif, a transcription factor (TF) negatively regulates the promoter of its own gene or operon. This has been found to

  1. dramatically increase response acceleration and

  2. increase the stability of the gene product concentration response to noise [26, 41].

The L-arabinose system is an example of an NAR network.L-arabinose,which is a five-carbon sugar foundin plant cell walls, is used as a carbon source by many organisms. The araoperon contains genes encoding enzymes leading to L-arabinose catabolism. The selective usage of the araoperon isone of the best-studied gene regulation systems and is well-characterized[5, 15, 22, 23, 34, 42, 52, 59]. The entire arabinose system consists of the following:

  1. the system specific transcription factor (TF) araC,

  2. the arabinose transporters (araE, araFGH,and araJ), and

  3. the araoperon containing the arabinose catalytic enzymes, araBAD.

Ultimately, the araoperon is responsible for the conversion of arabinose to D-xylulose-5-phosphate, which then enters the pentose phosphate pathway. The AraC TF regulates the araoperon [1, 14, 18, 20, 26, 52, 54, 59].Recent studies have shown that araCand the araoperon share a common regulatory protein,cAMP Receptor Protein (CRP),which is activated by cAMP. AraC both activates and represses the araoperonusing a DNA looping mechanism. As a negative regulator, AraC isde-activated by L-arabinose, allowing transcription of the araoperon. AraC represses its own promoter through a NAR motif[1, 14, 26, 43, 58].

1.3. Goals of the Modelling Effort

In this research, the overall influence of NAR on the dynamics of a regulated biological network was studied by applying a unique combination of CA and GSA. Here, the expression of araBADwas calculated in the

  1. presence and

  2. absence of NAR by AraC.

The CA approach was used to simulate this network given altered kinetic rate constants and initial concentrations. Then, the GSA approach was applied to determine which of these parameters most directly influence araBADexpression. When applied to the

  1. presence and

  2. absence of NAR, the difference in GSA results give clues to the influence of the NAR mechanism in regulating the system dynamics.

In the case studied in this research, NAR was found to equally distribute model sensitivity across all input parameters. This dramatically increases stability and responsivenessof the regulatory network. The approach presented in this chapter of combining CA and GSA can be applied to virtually any biologicalnetwork using the methods presented in this chapter.


2. Methods

2.1. Model Construction

The araoperon model was constructed using NetLogo simulation environment [53]. To perform a CA simulation, individual (agents) (i.e., interacting molecules) were allowed to move among (cells) (i.e., spatial locations) inside the simulation environment and undergobiochemical reactions with other agents in their Von Neumann neighbourhood. In all simulations, a two-dimensional16 x 16 matrix of cells was used as the simulation environment, and 10 time steps were executed. Whether a reaction occurs between interacting agents is governed by probability. The agents of the araoperon model are:

  1. L-arabinose,

  2. cAMP,

  3. AraC (the TF regulator),

  4. CRP, and

  5. AraBAD (representing gene products of the araoperon).

Characteristics and reaction rules for individual agents were predefined at model initialization in NetLogo. Basal expression levels for CRP, AraC, and AraBAD were set at 10, 0, and 0 cells respectively.The number of agents occupying cells represents concentration in agent based modelling. For example, CRP was present in 10 cells of the simulation environment upon initializing the simulation. Specifics of the varied model parameters are discussed in detail in the next section. These included

  1. the concentrations of L-arabinose and cAMP,

  2. the probabilities of biochemical reactions, and

  3. NAR by ArgC.

Monte Carlo methods were used to select 2000 values of each parameter to perform CA simulations. This was followed by 1000 independent iterations of the model to perform GSA. The CA simulation records activation of the araoperon measured as number of AraBAD agents present in cells at the end of the simulation. Simulations are also often run to record the number of model “events” required to reach a specified concentration of an agent of interest (e.g., AraBAD). Independent simulations use different values of the varied parameters, resulting in different values of the targeted agent. Two common approaches use

  1. a set number of model events to derive a target agent or

  2. a different number of model events required to reach a specified concentration of the target agent.

The first approach was used in this study. Two different scenarios for NAR byAraC regulation were simulated in this research:

  1. AraC is not allowed to negatively autoregulate its own promoter and

  2. NAR by AraC is allowed.

These simulations seek to understand the influence of the NAR mechanism on overall rigidity and robustness of the araoperon regulatory network.

2.2. System parameters

A mathematical model was created to simulate the dynamics of araoperon activation in the presence and absence of NAR by AraC. CA was applied by allowing critical parameters to vary over thousands of simulations of the system. GSA was then applied to the results to determine which system parameters most influence araoperon activation.The following parameters were taken into consideration while building and simulating the model. The upper and lower bounds of the parameters and a description of their functions are described in detail below.

Output (Y1)93...296

Table 1.

The M1matrix of GSA.

  1. conc_arabinose: The initial concentration of L-arabinose was allowed to vary between 1 and 250 cells. Upon binding,L-arabinoseactivates the TF and autoregulatorAraC.Unbound AraC inhibits transcription of the araoperon.

  2. conc_cAMP: Initial concentration of the second messenger cAMP was allowed to vary between 1 to 250cells. cAMP binds to CRP causing its activation. The cAMP-CRP positively regulates transcription of the araoperon.

  3. rate_CRP_activate: This parameter controls the probability of CRP activation by cAMP. This parameter was varied between 0 and 1 for the simulations described in this chapter. This probability parameter represents the rate of activation of CRP.

  4. rate_araC_activate: This parameter controls the probability of AraC activation by L-arabinose and was allowed to vary between 0 and 1.This probabilityultimately controlsthe activity of the araCgene and transcription of the araoperon.

  5. rate_araBAD_activate: This parameter controls the probability of araBADactivation by CRP and was allowed tovary between 0 and 1.

  6. rate_araC_autoreg: This parameter controls the NARbyAraC protein and was allowed tovary between 0 and 1.Thisrepresents the rate at which AraC supresses its promoter.

2.3. Global sensitivity analysis

GSA was performed on the araoperon activation model described in this chapter. The following step-by-step tutorial is given to combine CA with GSA.

The procedure starts with the derivation of the M1and M2matrices shown in Tables 1 and 2, respectively. To build each table, 1000 CA simulations were run given random values of the system parameters. For each simulation, the model output (activated araBADor expressed AraBAD) was calculated and recorded. The estimated unconditional means (E^Y) and estimated unconditional variances (V^Y) of the model outputs were calculated for both matrices according to the following, where Nis the number of simulations (i.e., 1000 for this study).


Next, the Pmatrix was created for the calculation of the first-order sensitivity index for each model parameter. To illustrate this example, the model parameter conc_cAMPwas used. The Pmatrix for this case is shown in Table 3.

The Pmatrix consists of the conc_cAMPparameter values from matrix M2, and all other parameters are assigned their values from M1. Then model outputs were calculated for the Pmatrix using these new inputs. Thus, 1000 more simulations are required for each parameter a first-order sensitivity index is desired. The first-order sensitivity index (Sconc_cAMP) was calculated by the following.


Next, the total effect index was calculated by creation of the Rmatrix for each parameter. The first-order index describes the influence of a single parameter on the model output directly. The total effect index takes into account all interactions of a parameter with all other parameters when determining the effect on model output.

Output (Y2)60893...3043

Table 2.

The M2matrix of GSA.

conc_cAMP (M2)190189227
conc_arabinose (M1)159179244
rate_CRP_activate (M1)0.0190.1030.500
rate_araC_activate (M1)0.4180.3910.406
rate_araBAD_activate (M1)0.5770.3260.445
rate_araC_autoreg (M1)000
Output (YP)12196...6

Table 3.

The Pmatrix of GSA.

The Rmatrix is shown in Table 4 for the conc_cAMPparameter example. To build the Rmatrix, the parameter values from M1for conc_cAMPwere used along with parameter values from M2for all other parameters. An additional 1000 CA simulations are required to calculate the model outputs for the Rmatrix. The calculation of the total effect index for rcAMP (ST(conc_cAMP)) was calculated as follows.


3. Results

The CA modeling of the araoperon was performed for two cases

  1. without NAR (i.e., negative autoregulation) by AraC and

  2. with NAR by AraC (as is observed experimentally).

GSA was applied to both cases in order to determine how the NAR mechanism impacts overall system dynamics. To simulate the model without NAR, the parameter rate_araC_autoregwas held constant at 0. The results of the GSA calculations derived from Eqs. 1-6 and the values in Tables 1-4 are given in Table 5. This case was simulated without NAR by AraC.

conc_cAMP (M1)10518832
conc_arabinose (M2)67212208
rate_CRP_activate (M2)0.8870.2810.671
rate_araC_activate (M2)0.6380.4130.693
rate_araBAD_activate (M2)0.5300.6200.891
rate_araC_autoreg (M2)000
Output (YR)288519...324

Table 4.

The Rmatrix of GSA.

The first-order sensitivity indices for each system parameter for the araoperonmodel without NAR by AraCare reported in Fig. 1. These values are reported as a percentage of the summation of all first-order index values. When interactions between single parameters are not taken into consideration, the probability of CRP activation was found to be the single most important parameter significantly influencing the araBADactivation(29.58%).All other parameters show similar influence (~17%). The total effect indices for the araoperon model without NAR by AraC is shown in Fig. 2. When all the interactions between all parameters were considered, probability of CRP activation (rate_CRP_activateparameter) was shown to have most influence (29.47%) over araBADactivation. The more noticeable result is the small contribution from the initial concentration of L-arabinose (conc_arabinoseparameter). This is significant because the araoperon is known to require the presence of L-arabinose to be active in the cell.

E^Y11(estimated unconditional mean of M1)256.78
E^Y12(estimated unconditional mean of M2)228.49
V^Y1(estimated unconditional variance of M1)309411.56
V^Y2(estimated unconditional variance of M2)N/A
Sconc_cAMP(first-order sensitivity index of conc_cAMP)0.18
ST(conc_cAMP)(total effect index of conc_cAMP)0.94

Table 5.

GSA calculations for the rcAMPparameter

Figure 1.

First-order indices calculated by GSA for the case without NAR by AraC.

The first-order indices for each parameter for the araoperonmodel with NAR by AraC are reported in Fig. 3. By activating the NAR role of AraC, the first-order indices show very close index values for all parameters (~16.5%). In other words, the NAR reduced the excessive influence of CRP activation over araBADactivation. The total effect indices are shown in Fig. 4. A pattern similar to that revealed by first-order indices was obtained. All total effect indices were also similar for all parameters (~16.5%).Adding the NAR by AraCto the regulation network dramatically increased the influence of L-arabinose concentration on araBADactivation.

Figure 2.

Total effect indices calculated by GSA for the case without NAR by AraC.

Figure 3.

First-order indices calculated by GSA for the case with NAR by AraC.


4. Discussion

In this study, a unique combination of CA and GSA were used to study the parameters that influence the dynamics of the araoperon regulatory network. The results of the GSA study revealed the degree to which individual parameters affect the output of a biological model.GSA was used to explorethe influence of NAR on the regulatory network by calculating the impact of parameter variance on model output.Comparing first-order and total effect sensitivity indices with and without NAR by AraC elucidates the roles NAR plays in the signaling network. These include

  1. increasing network stability and

  2. increasing the response of the network to L-arabinose concentrations.

Equal distribution of variation among all parameters suggests that the NAR mechanism increases network robustness,providing protection against random perturbations (both biological and environmental) of the system.

Figure 4.

Total effect indices calculated by GSA for the case with NAR by AraC.

GSA has shown that parameter sensitivity indices can provide useful insight in interpreting the results of CA simulations.Thus, the combination of CA and GSA provides a valuable tool for the identification of source of output variability. In the case of the araoperon, all model parameters showed to contribute equally to the variance of araBADactivationlevel. While all the systemparameters are important and can significantly influence araBADactivation, those parameters with higher first-order sensitivities can have profound effects on regulation of the araoperon if NAR function by AraC is lost. This scenario demonstrates the potential of CA and GSA for identifying targets for manipulating highly interconnected gene regulatory networks.

© 2013 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Advait A. Apte, Stephen S. Fong and Ryan S. Senger (May 8th 2013). Using Cellular Automata and Global Sensitivity Analysis to Study the Regulatory Network of the L-Arabinose Operon, Emerging Applications of Cellular Automata, Alejandro Salcido, IntechOpen, DOI: 10.5772/52085. Available from:

chapter statistics

1700total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Cellular Learning Automata and Its Applications

By Amir Hosein Fathy Navid and Amir Bagheri Aghababa

Related Book

First chapter

An Interactive Method to Dynamically Create Transition Rules in a Land-use Cellular Automata Model

By Hasbani, J.-G., N. Wijesekara and D.J. Marceau

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us