Theoretical and experimental values of the scaling law for the first order moment for different values of the crosscorrelation.
1. Introduction
Water is essential to all forms of life. The development of humanity is associated to the use of water, and nowadays, the constant availability and satisfaction of water demand is a fundamental requirement in modern societies. Although water seems to be abundant on our planet, fresh water is not an inexhaustible resource and has to be managed in a rational and sustainable way. The demand for water is dynamic and influenced by various factors, from geographic, climatic and socioeconomic conditions, to cultural habits. Even within the same neighbourhood the userspecific water demand is elastic to price, condition of the water distribution system (WDS), air temperature, precipitation, and housing composition (regarding only residential demand in this case). On top of all these factors, demand varies during the day and the week.
Traditionally, for WDS modelling purposes, water demand is considered as being deterministic. This simplification worked relatively well in the past, since the major part of the studies on water demands were conducted only with the objective of quantifying global demands, both on the present and on the longterm. With the development of optimal operating schedules of supply systems, hourly water demand forecasting started to become increasingly more important. Moreover, taking in consideration all the aforementioned factors that influence water use, it is clear that demand is not deterministic, but stochastic. Thus, more recently, in order to guarantee the requested water quantities with adequate pressure and quality, the studies began to focus on instantaneous demands and their stochastic structure.
1.1. Descriptive and Predictive Models for Water Demand
The first stochastic model for (indoor) residential water demands was proposed by Buchberger and Wu [1]. According to the authors, residential water demand can be characterized by three parameters: frequency, duration and intensity, which in turn can be described by a Poisson rectangular pulse process (PRP). The adopted conceptual approach is relatively similar to basic notions of queuing theory: a busy server draws water from the system at a random, but constant, intensity and, during a random period of time. Residential demands were subdivided into deterministic and stochastic servers. Deterministic servers, including washing machines and toilets, produce pulses which are always similar. Stochastic servers, like water taps, instead produce pulses with great variability, and their duration and intensity are independent. The PRP process found to best describe water demand is nonhomogeneous, i.e., when the pulse frequency is not constant in time. Different authors used real demand data to assess the adequacy of the nonhomogeneous PRP model, achieving good results [2]. Moreover, the PRP model was confirmed to allow the characterization of the spatial and temporal instantaneous variability of flows in a network, unlike the traditional models that use spatial and temporal averages and neglect the instantaneous variations of demand. One drawback to the rectangular pulse based models is the fact that the total intensity is not exactly equal to the sum of the individual intensities of overlapping pulses, due to increased head loss caused by the increased flow [3]. This problem can however be solved by introducing a correction factor. The daily variability of demand represents another drawback to the PRP model, since it can invalidate the hypothesis that pulses arrive following a time dependent Poisson process [2]. One possible solution to this question is to treat the time dependent nonhomogeneous process as a piecewise homogeneous process, by dividing the day into homogeneous intervals [4]. Another solution consists in using an alternative demand model: the cluster NeymanScott rectangular pulse model (NSRP), proposed by Alvisi [5]. The model is similar to the PRP model, but the total demand and the frequency of pulses are obtained in different ways. In the PRP model the total water demand follows a Poisson process resulting from the sum of the singleuser Poisson processes, with a single arrival rate. In the NSRP model, a random number of individual demands (or elementary demands) are aggregated in demand blocks. The origin of the demand blocks is given by a Poisson process, with a certain rate between the subsequent arrivals. The temporal distance between the origins of each of the elementary demands to the origin of the demand block, follows an exponential distribution with a different rate. The variation of these parameters during the day reflects the cyclic nature of demands. A good approximation of the statistical moments for different levels of spatial and temporal aggregation was achieved; however, the variance of demand becomes underestimated for higher levels of spatial aggregation.
The aforementioned models are mainly descriptive. More recently, Blokker and Vreeburg [6] developed a predictive enduse model, based on statistical information about users and end uses, which is able to forecast water demand patterns with small temporal and spatial scales. In this model, each enduse is simulated as a rectangular pulse with specific probability distribution functions for the intensity, duration and frequency, and a given probability of use over the day. Enduses are discriminated into different types (bath, bathroom tap, dish washer, kitchen tap, shower, outside tap, washing machine, WC). The statistical distribution for the frequency of each enduse was retrieved from survey information from the Netherlands. The duration and intensity were determined, partly from the survey and partly from technical information on waterusing appliances. From the retrieved information, a diurnal pattern could be built for each user. Users represent a key point in the model and are divided into groups based on household size, age, gender and occupation. Simulation results were found to be in good agreement with measured demand data. The EndUse model has also been combined with a network solver, obtaining good results for the travel times, maximum flows, velocities and pressures [7].
The PRP and the EndUse model were compared against data from Milford, Ohio. The achieved results showed that both models compare well with the measurements. The EndUse model performs better when simulating the demand patterns of a single family residence, while the PRP models describes more accurately the demand pattern of several aggregated residences [8]. The main difference between the models is the number of parameters they use: the PRP model is a relatively simple model that has only a few parameters, while the EndUse model has a large number of parameters. However, the EndUse model is very flexible towards the input parameters, which also have a clearer physical meaning and hence more intuitive to calibrate. The PRP model describes the measured flows very well. From the analytic description provided by the PRP model, a lot of mathematical deductions can be made. Thus, one can classify the PRP model as a descriptive model with a lot of potential to provide insight into some basic elements of water use, such as peak demands [9]and crosscorrelations [10]. The EndUse model is a Monte Carlo type simulation that can be used as a predictive model, since it produces very realistic demand patterns. The EndUse model can be applied in scenario studies to show the result of changes in water using appliances and human behaviour. Possible improvements to the model include the incorporation of leakage, the consideration of demands as a function of the network pressure and the application of the model outside the Netherlands [11]. Li [10]studied the spatial correlation of demand series that follow PRP processes. It was verified that while time averaged demands that follow a homogeneous PRP process are uncorrelated, demands that follow a nonhomogenous PRP process are correlated, and that this correlation increases with spatial and temporal aggregation. A similar conclusion about the correlation was achieved by Moughton [12]from field measurements.
1.2. Uncertainty and reliabilitybased design of water distribution systems
The problem of WDS design consists in the definition of improvement decisions that can optimize the system given certain objectives. As aforementioned, in the earliest works regarding the optimal design of water distribution systems (WDS), input parameters, like water demand, were considered as being deterministic, often leading to underdesigned networks. A robust design, allowing a system to remain feasible under a variety of values that the uncertain input parameters can assume, can only be achieved through a probabilistic approach. In a probabilistic analysis the input parameters are considered to be random variables, i.e., the single values of the parameters are replaced with statistical information that illustrates the degree of uncertainty about the true value of the parameter. The outcomes, like nodal heads, are consequently also random variables, allowing the expression of the networks’ reliability.
Uncertainty in demand and pressure heads was first explicitly considered by Lansey [13]. The authors developed a singleobjective chance constrained minimization problem, which was solved using the generalized reduced gradient method GRG2. The obtained results showed that higher reliability requirements were associated to higher design costs when one of the variables of the problem was uncertain.
Xu and Goulter [14] proposed an alternative method for assessing reliability in WDS. The mean values of pressure heads were obtained from the deterministic solution of the network model. The variance values were obtained using the firstorder second moment method (FOSM). The probability density function (PDF) of nodal heads defined by these mean and variance values was used to estimate the reliability at each node. The approach proved to be suitable for demands with small variability. Kapelan [15] developed two new methods for the robust design of WDS: the integration method and the sampling method. The integration method consists in replacing the stochastic target robustness constraint (minimum pressure head) with a set of deterministic constraints. For that matter it is necessary to know the mean and standard deviation of the pressure heads. However, since pressure heads are dependent of the demands, it is not possible to obtain analytically the values for the standard deviations. Approximations of the values of the standard deviations are obtained by assuming the superposition principle, which makes it possible to estimate the contribution of the uncertainty in demand on the uncertainty of pressure heads. The sampling method is based on a general stochastic optimization framework, this is, a double looped process consisting on a sampling loop within an optimization loop. The optimization loop finds the optimal solution, and the sampling loop propagates the uncertainty in the input variables to the output variables, thus evaluating the potential solutions.
The aforementioned optimization problems are formulated as constrained singleobjective problems, resulting in only one optimal solution (minimum cost), that provides a certain level of reliability. More recently, these optimization problems have been replaced with multiobjective problems. Babayan [16] formulated a multiobjective optimization problem considering two objectives at the same time: the minimization of the design cost and the maximization of the systems’ robustness. Nodal demands and pipe roughness coefficients were assumed to be independent random variables following some PDF.
At this point, all the aforementioned models assume nodal demands as independent random variables. However, in reallife demands are most likely correlated: demands may rise and fall due to the same causes. Kapelan [17] introduced nodal demands as correlated random variables into a multiobjective optimization problem. The authors verified that the optimal design solution is more expensive when demands are correlated than the equivalent solution when demands are uncorrelated. A similar conclusion was achieved by Filion [18]. These results sustain that assuming uncorrelated demands can lead to less reliable network designs. Thus, even if increasing the complexity of optimization problems, demand correlation should always be taken into account in the design of WDS.
The robust design of WDS has gained popularity over the last years. Researchers have been focusing on methods and algorithms to solve the stochastic optimization problems, and great improvements have been made in this aspect. However, the quantification of the uncertainty itself has not been addressed. Values for the variance and correlation of nodal demands are always assumed and no attention is being paid in properly quantifying these parameters. The optimization problems could be significantly improved if more realistic values for the uncertainty would be taken into account.
This work addresses the need to understand in which measure the statistical parameters depend on the number of aggregated users and on the temporal resolution in which they are estimated. It intends to describe these dependencies through scaling laws, in order to derive the statistical properties of the total demand of a group of users from the features (mean, variance and correlation) of the demand process of a singleuser. Being part of the first author’s PhD research, which aims the development of descriptive and predictive models for water demand that provide insight into peak demands, extreme events and correlations at different spatial and temporal scales, these models will, in future stages, be incorporated in decision models for design purpose or scenario evaluation. Through this approach, we hope to develop more realistic and reliable WDS design and management solutions.
2. Statistical characterization of water demand
Recent studies on uncertainty in water distribution systems (WDS) refer that nodal demands are the most significant inputs in hydraulic and water quality models [19]. The variability of water demand affects the overall reliability of the model, the assessment of the spatial and temporal distributions of the pressure heads, and the evaluation of water quality along the different pipes. These uncertainties assume a different importance depending on the spatial and temporal scales that are considered when describing the network. The degree of uncertainty becomes more relevant when finer scales are reached, i.e., when small groups of users and instantaneous demands are considered. Thus, for a correct and realistic design and management, as well as simulation and performance assessment of WDS it is essential to have accurate values of water demand that take into account the variability of consumption at different scales. For that matter, the thorough description of the statistical properties of demand of the different groups of customers in the network, at specific temporal resolutions, is essential.
For a better understanding of this aspect, let us consider the distribution of the customers in a network. Figure 1 shows the network of a small town where the customers can be classified mainly as residential.
The most peripheral pipe serves the inhabitants of one single building. When moving upwards in the network, the number of customers increases reaching a maximum of 1258 customers near the tank. Obviously, as a consequence, the mean flow increases from the peripheral building to the tank. The increase of the variance of the flow is, however, less obvious. For larger networks and more densely populated towns, the difference between the number of customers that are close and far from the tank, and consequently, the variations of the mean and variance of the flow is even more pronounced.
Another important aspect when modelling a network is the choice of the adequate temporal resolution. This choice depends on the characteristics of the available measurement instruments and on the type of analysis to perform. When modelling the peripheral part of a network, characterized by a significant temporal variation of demand, it is important to adopt fine temporal resolutions, i.e., in the order of seconds. For the estimation of peak flows in design problems Tessendorff [20] suggests the use of different temporal resolutions on different sections of the network: the author suggests the use of a 15 second time interval for customer installation lines, two minutes for service lines, 15 minutes for distribution lines, and 30 minutes for mains and secondary feeders. The statistical properties of water demand are affected by the considered temporal resolution. The use of longer sampling intervals causes an inevitable loss of information about the signals, resulting in lower estimates for the variance [21, 22]. This aspect is particularly relevant at the peripheral pipes of the network that, as aforementioned, are characterized by large demand fluctuations. Therefore, understanding the spatial and temporal scaling properties of water demand is essential to build a stochastic model for water consumption.
Water demand can be described by a stochastic process in which
If the consumers are assumed to be of the same type, the properties of demand can be considered to be homogeneous in space, this is, they are independent of the particular consumer that is taken into consideration. Regarding the temporal variability, the stochastic process can only be assumed to be stationary in time intervals during which the mean stays constant. Once the length of this time interval,
For homogeneous and stationary demands, the expected values for the mean and variance,
2.1. Correlation between consumers
The definition of the mean and variance for each type of consumer is not enough for a complete statistical characterization of demand. In order to obtain a realistic representation of the demand loads at the different nodes in a network; essential for the assessment of the network performance under conditions as close as possible to the actual working conditions, the correlation between nodal demands cannot be ignored. This correlation can be expressed through the crosscovariance and crosscorrelation coefficient functions.
The crosscovariance,
As known, the WDS need to guarantee minimum working conditions, this is, the minimum pressure requirements have to be satisfied at each node even under maximum demand loading conditions. If all the consumers in the network are of the same type, it seems reasonable to assume a perfect correlation between demands, and to simplify the analysis of the network by assigning the same demand pattern to all the consumers. The synchronism of demands is the worst scenario that can occur on a network, causing the widest pressure fluctuations at the nodes. The assumption of a perfect correlation for design purposes results in reliable networks, but it also requires the increase of the pipe diameters, which consequently increases the networks cost. In fact, as mentioned earlier, each consumer has his own demand pattern based on specific needs and habits, without knowing what other consumers are doing at the same time. This means that demand signals in real networks are correlated, but are not synchronous. Thus, in order to obtain the optimum design of a network, it is essential to estimate the accurate level of correlation between the consumers. On the other hand, to estimate accurately the spatial correlation between demands, it is necessary to collect and analyse historical series, resulting in additional costs in the design phase. However, these additional costs will most certainly be compensated by the achieved reduction of the following construction costs.
2.2. The scaling laws approach in modelling water demand uncertainty
Water demand uncertainty is made of both aleatory or inherent uncertainty, due to the natural and unpredictable variability of demand in space and time, and epistemic or internal uncertainty, due to a lack of knowledge about it. Hutton [23] distinguishes epistemic uncertainty in two types. The first type concerns the nature of the demand patterns, and the lack of knowledge about this variability when modelling WDS both in time and space. This uncertainty is defined as ‘twodimensional’ uncertainty since it is composed by both aleatory and epistemic uncertainty. It can be reduced with extended and expensive spatial and temporal data collection or through the employment of descriptive and predictive water demand models. The second type of epistemic uncertainty takes the spatial allocation of water demand into account when modelling WDS [24].
Dealing with the ’twodimensional’ uncertainty when modelling WDS, requires not only a complete statistical characterization of demand variability, but also the determination of the correlation among the different users and groups of users. The natural variability of demand can be expressed using probability density functions (PDF). A PDF is characterized by its shape (e.g. normal, exponential, gamma, among others) and by specific parameters like the population mean and variance. Thus, in order to represent uncertain water demand using a PDF, it is necessary to identify and estimate the values of these parameters. The consideration of different spatial and temporal aggregation levels induces changes in the PDF parameters, often leading to a reduction of the uncertainty. The autocorrelation and crosscorrelation that characterize the water demand signals affect the extent to which the PDF parameters vary, and can introduce an additional sensitivity to the specific period of observation in question.
In order to understand the effects of spatial aggregation and sampling intervals on the statistical properties of demand, it is possible to develop analytical expressions for the moments (mean, variance, crosscovariance and crosscorrelation coefficient) of demand time series, at a fixed time sampling frequency
Where
The development of the scaling laws is based on the assumption that the demand can be described by a homogeneous and stationary process, which implies that the
Where
In order to estimate the expected value of the demand variance it is necessary to consider the covariance function
Where
The term
Where
This equation represents the scaling law for the variance, neglecting the bias that can be caused when using small the demand series (short observation periods).
Introducing the Pearson crosscorrelation coefficient given by
If demands are perfectly correlated in space then
If demands are uncorrelated in space then
Since the crosscorrelation coefficient can assume values between
Where
In conclusion, it can be stated that the variance in the consumption signal of a group of users
The variance function,
Where
Similarly, the expected value of the crosscovariance is given by:
Neglecting the term
Where,
As a consequence, the expected value of the Pearson crosscorrelation coefficient between the demands of
This equation shows that this coefficient depends separately on the spatial aggregation levels of each group,
From equation (19) it is possible to observe that the expected value
Since by definition
From equation (21) it is also possible to observe that the Pearson crosscorrelation coefficient between the
As expected, all the curves have a common starting point, since
In the particular case in which both groups of users have the same statistical properties, i.e., they belong to the same process, and assuming
From equation (22) it is clear that the crosscorrelation coefficientincreases with the number of aggregated users, tending to one. This limit value is reached as sooner as the crosscorrelation coefficient,
3. Validation of the Analytical expressions
3.1. Synthetically generated signals: scaling laws for the mean and the variance
In order to confirm the analytical development reported in the previous paragraph, the scaling laws were derived for groups of synthetically and simultaneously generated consumption signals. At this aim the Multivariate Streamflow model [28], with a normal probability distribution, was used. Each group was assumed to contain 300 consumption signals with 3600 realizations each, distinguished by different values of the crosscorrelation coefficient between them. The correctness of the procedure used to generate each demand series was tested by checking that the mean, the variance and the crosscorrelation coefficient of the generated signals were equal to the input parameters of the model. Only little differences were observed (Table 1), which are explained due to the fact that the generated demand series are realizations of a stochastic process and, consequently, their moments necessarily differ from the theoretical ones.
Once the single consumption signals of each group were generated, they were aggregated randomly selecting one at a time, until a maximum of 100 aggregated consumption signals was reached. The first and second order moments, mean and variance, were calculated for each aggregation level. In order to obtain a result as general as possible, the same procedure has been repeated 50 times, aggregating each time different users [25]. The obtained results are summarized in Table 1 and 2, with reference to equation 5.









0  0.70  0.7003  1.00  0.9996 
0.001  0.70  0.7017  1.00  0.9993 
0.010  0.70  0.6971  1.00  1.0001 
0.025  0.70  0.7020  1.00  0.9989 
0.050  0.70  0.7096  1.00  1.0004 
0.10  0.70  0.7063  1.00  1.0009 
0.20  0.70  0.7086  1.00  0.9994 
0.30  0.70  0.7032  1.00  1.0008 
0.40  0.70  0.6923  1.00  1.0003 
0.50  0.70  0.6942  1.00  0.9985 
0.60  0.70  0.6857  1.00  1.0002 
0.70  0.70  0.6874  1.00  1.0011 
0.80  0.70  0.6852  1.00  0.9998 
0.90  0.70  0.6789  1.00  1.0009 
0.99  0.70  0.7050  1.00  0.9997 



0  3.9808  1.0004 
0.001  3.5205  1.0541 
0.010  1.9864  1.3079 
0.025  1.4498  1.4984 
0.050  1.2702  1.6403 
0.10  1.2940  1.7686 
0.20  1.5621  1.8675 
0.30  1.8879  1.9139 
0.40  2.1713  1.9379 
0.50  2.4485  1.9570 
0.60  2.7685  1.9692 
0.70  3.0803  1.9804 
0.80  3.4051  1.9888 
0.90  3.6567  1.9945 
0.99  3.9960  1.9985 
Results confirm the linear scaling for the first order moment and show that the variance increases with the spatial aggregation level according to an exponent that varies between 1 and 2. In theory, for spatially uncorrelated demands the scaling laws is linear and for perfectly correlated demands the scaling law is quadratic.
3.2. Synthetically generated signals: scaling laws for the crosscovariance
In this case pairs of aggregated consumption series, A and B, were obtained by randomly selecting among pairs of the previously generated groups of signals. Different values of the product


0.10  0.10  0.10  0.3754  0.3747  1.00  0.9998 
0.20  0.10  0.10  0.3880  0.3907  1.00  0.9982 
0.40  0.10  0.10  0.3676  0.3624  1.00  1.0020 
0.40  0.20  0.10  0.3463  0.3440  1.00  1.0003 
0.80  0.60  0.10  0.3939  0.3908  1.00  1.0009 
0.50  0.50  0.10  0.3545  0.3541  1.00  0.9993 
0.50  0.50  0.20  0.7449  0.7401  1.00  1.0007 
0.50  0.50  0.30  1.0999  1.0923  1.00  1.0004 
0.50  0.50  0.40  1.4643  1.4605  1.00  0.9999 
0.50  0.50  0.50  1.8408  1.8504  1.00  0.9986 
Results confirm that


0.10  0.40  0.4000  2.00  1.9998 
0.20  0.80  0.7914  2.00  1.9997 
0.30  1.20  1.2160  2.00  2.0008 
0.40  1.60  1.6009  2.00  1.9985 
0.50  2.00  2.0059  2.00  1.9992 
0.60  2.40  2.3955  2.00  2.0003 
0.70  2.80  2.7934  2.00  2.0000 
0.80  3.20  3.2043  2.00  1.9999 
0.90  3.60  3.6057  2.00  1.9999 
0.99  3.96  3.9408  2.00  2.0003 
Equation 17 then becomes
3.3. Real consumption data: scaling laws for the mean and the variance
The parameters of the scaling laws were also derived for a set of real demand data. The indoor water uses demand series of 82 singlefamily homes, with a total of 177 inhabitants, in a building belonging to the IIACP (Italian Association of Council Houses) in the town of Latina were considered [29, 30]. The apartments are inhabited by singleincome families, belonging to the same low socioeconomic class.The daily demand series of four different days (4 consecutive Mondays) of the 82 users were considered [25]. For each user the different days of consumptions can be considered different realizations of the same stochastic process. In this way the number of customers was artificially extended to about 300, preserving at the same time the homogeneity of the sample. The temporal resolution of each time series is 1 second.








67  0.468  1.994  1.2288 
78  1.066  6.678  1.114 
89  0.988  7.401  1.0435 
910  0.891  6.205  1.0756 
1011  0.735  4.336  1.113 
1112  0.791  4.782  1.089 
1213  0.68  4.452  1.092 
1314  0.807  5.322  1.065 
1415  0.827  5.338  1.0688 
1516  0.704  3.857  1.1311 
1617  0.512  2.266  1.1739 
1718  0.634  3.112  1.1666 
1819  0.667  3.594  1.1412 
1920  0.707  5.445  1.0384 
2021  0.68  3.702  1.1253 
2122  0.635  3.412  1.099 
2223  0.397  1.958  1.0771 
mean  0.717  4.344  1.1084 
confidence limits 95%  0.082  0.759  0.024 
The series were divided into time periods of 1 hour to guarantee the stationarity of the process. In Table 5 the estimated values of the expected values of the mean and the variance of the unit user and the exponent α for the scaling law of the variance are reported. The same exponents for the mean were always trivially equal to 1. In these results the first six hours of the day and the last one were excluded because, during the night hours consumptions are very small and therefore their statistics have a poor significance. It was observed that the mean scales linearly with the number of customers. Differently, the variance shows a slight nonlinearity with the number of users. It must be underlined that the average daily value of the exponent α is 1.1, showing that there is a very weak correlation between the considered users.
3.4. Real consumption data: scaling laws for the crosscovariance andcrosscorrelation coefficient
Considering the consumption signals belonging to homogeneous users, equation 23 is valid and a quadratic scaling law for the crosscovariance should be expected. This behaviour was confirmed by the measured data for all the time intervals considered. In Figure 4 the scaling law of the consumption signals between 11:00 and 12:00 am is graphically reported.
The obtained crosscorrelation coefficient between the single user signals was low, being always less than 0.05, but increased noticeably when the number of aggregated users increased, as expected according to equation 22. For groups of 150 aggregated users the crosscorrelation coefficient reached the values shown in Table 6. These results enhance the importance of evaluating the crosscorrelation degree at different levels of spatial aggregation. Even if the crosscorrelation between singleuser demand signals is relatively low and less likely to significantly affect the performance of a network, it can largely increase with the spatial aggregation of users, becoming not negligible at those larger scales.




01  0.56  1314  0.49 
12  0.6  1415  0.5 
34  0.5  1516  0.42 
45  0.58  1617  0.49 
56  0.36  1718  0.5 
67  0.48  1819  0.39 
78  0.71  1920  0.51 
89  0.61  2021  0.52 
910  0.39  2122  0.39 
1011  0.46  2223  0.47 
1112  0.57  2324  0.61 
1213  0.48     
4. Stochastic simulation of a network
To illustrate the effect of the uncertainty of water demands on the performance of a network, particularly, the effect of the level of correlation between consumptions on the outcome pressure heads, a simple network simulation was performed. The water distribution network of Hanoi (Fujiwara and Khang, 1990) was considered for this matter (Figure 5).
The data for the Hanoi network were taken from the literature (Fujiwara and Khang, 1990), and the pipe diameters were assumed to be the ones obtained by Cunha and Sousa (2001). The demand data from the literature was used to estimate the number of users at each node, assuming a singleuser mean demand of 0.002 l/s. All the users in the network were assumed to be residential and having the same characteristics. The standard deviation of demand was assumed to be 0.06 l/s. The Multivariate Streamflow model [28] was used to generate synthetic stochastic demands with different levels of crosscorrelation between the singleusers. The nodal demands were then introduced in the network and the performance of the network was simulated using EPANET [32]. For each considered degree of crosscorrelation between demands, 100 simulations were performed, resulting in series of pressure heads for each node and for each correlation level.
The first aspect that emerges from the simulations, is that the number of nodes that fail, i.e., which do not satisfy the minimum pressure requirements, increase when the crosscorrelation degree increases. Higher correlations imply more synchronous consumptions, leading to pressure failures. Figure 6 illustrates this result.
Observing Figure 6 it is clear that the crosscorrelation between demands significantly affects the outcome pressure heads. The number of nodes that do not satisfy the minimum pressure requirements in the network increase from 194 nodes (total nodes in the network that fail in 100 simulations) when the crosscorrelation between demands is equal to 0.001, to a total of 543 nodes when the crosscorrelation between demands is 0.999. In other words, the probability of failure increases from 6.3% to 17.5% between the minimum and maximum levels of crosscorrelation that were considered.
Another aspect that emerges from the simulations is the increase of standard deviation of the pressure heads at each node of the network, which is illustrated in Figure 7.
The standard deviation of the pressure head verified at each node increases when the crosscorrelation between demands increases. The average standard deviation of the pressure heads along the network when the crosscorrelation between demands is equal to 0.01 is 1.35m, while the average standard deviation of the pressure heads when the crosscorrelation is 0.99, is 5.75m. This means that the crosscorrelation increases from 0.01 to 0.99 the standard deviation of the pressure heads increases more than 4 times.
The obtained results clearly show that the level of crosscorrelation between demands significantly affects the performance of a network and should, therefore, not be ignored when designing and managing WDS.
5. Conclusions
Understanding and modelling the stochastic nature of water demand represents a challenging field for researchers. Stochastic modelling faces difficulties like scarce availability of data for calibration purposes, high computational efforts associated to simulations, and the complexity of the problem itself. Moreover, the statistical properties of water demand change with the spatial and temporal scales that are used, which makes it even more difficult to accurately model the stochastic structure of demand. The proposed scaling laws represent a step forward in understanding the relation between the parameters that describe probabilistic demands and the spatial and temporal scales in which demands are measured and in which they should be modelled for WDS design or management purposes. The use of scaling laws allow a more accurate quantification of the statistical parameters, like variance and correlation, based on the real demand patterns, number of users at each node and the sampling time that is used. The scaling laws also allow to easily change the scale of the problem, since the statistical parameters and levels of uncertainty can be derived for any desired time or spatial scale.
The scaling laws were derived analytically and validated using synthetically generated stochastic demands and real demand data from Latina, Italy. A good agreement was found between the theoretical expressions, the synthetic demand data and the real demand data. Results show that the mean increases linearly with the number of aggregated users. The variance increases with spatial aggregation according to an exponent that varies between 1 and 2. In theory, for spatially uncorrelated demands the scaling laws is linear and for perfectly correlated demands the scaling law is quadratic. This aspect is clearly verified by the synthetic data. The scaling law for the covariance between 2 groups of users increases according to the product between the numbers of users in each group. The crosscorrelation coefficient depends separately on the number of users in each group, and increases towards a limit value. Even for small values of crosscorrelation between singleuser demands, this parameter cannot be ignored since it significantly increases with the aggregation of consumers.
The performed network simulation considering stochastic demands with different predefined levels of correlation show a clear influence of the degree of correlation on the outcome pressure heads: higher levels of correlation lead to larger fluctuations of the pressure heads and to more frequent pressure failures. At this point, the stochastic correlated demands were only used for simulation purposes. However, in future work a similar approach, can be used for design and management purposes. The consideration of correlated stochastic demands will result in more realistic and reliable water distribution networks.
Acknowledgments
The participation of the first author in the study has been supported by Fundação para a Ciência e Tecnologia through Grant SFRH/BD/65842/2009.
References
 1.
Buchberger S. G. Wu L. 1995 Model for instantaneous residential water demands  2.
Buchberger S. G. Wells G. J. 1996 Intensity, Duration, and Frequency of Residential Water Demands  3.
Garcia V. J. GarciaBartual R. Cabrera E. Arregui F. GarciaSerra J. 2004 Stochastic Model to Evaluate Residential Water Demands  4.
Buchberger S. G. Lee Y.. 1999 Evidence supporting the Poisson pulse hypothesis for residential water demands  5.
Alvisi S. Franchini M. Marinelli A. 2003 A stochastic model for representing drinking water demand at resindential level  6.
Blokker E. J. M. Vreeburg J. H. G. 2005 Monte Carlo Simulation of Residential Water Demand: A Stochastic EndUse Model  7.
Blokker E. J. M. Vreeburg J. H. G. Vogelaar A. J. 2006 Combining the probabilistic demand model SIMDEUM with a network model Water Distribution System Analysis  8.
Blokker E. J. M. Buchberger S. G. Vreeburg J. H. G. van Dijk J. C. 2008 Comparison of water demand models: PRP and SIMDEUM applied to Milford, Ohio, data  9.
Buchberger S. G. Blokker E. J. M. Vreeburg J. H. G. 2008 Sizes for SelfCleaning Pipes in Municipal Water Supply Systems  10.
Li Z. Buchberger S. G. Boccelli D. Filion Y. 2007 Spatial correlation analysis of stochastic residential water demands  11.
Blokker E. J. M. Vreeburg J. H. G. van Dijk J. C. 2010 Simulating Residential Water Demand with a Stochastic EndUse Model  12.
Moughton L. J. Buchberger S. G. Boccelli D. L. Filion Y. Karney B. W. 2006  13.
Lansey K. E. Mays L. W. 1989 Optimization model for water distribution system design  14.
Xu C. Goulter I. C. 1998 Probabilistic model for water distribution reliability  15.
Kapelan Z. Babayan A. V. Savic D. Walters G. A. Khu S. T. 2004 Two new approaches for the stochastic least cost design of water distribution systems  16.
Babayan A. Savic D. Walters G. 2005  17.
Kapelan Z. Savic D. Walters G. 2005  18.
Filion Y. Adams B. Karney A. 2007 Cross Correlation of Demands in Water Distribution Network Design  19.
Pasha K. 2005 Analysis of uncertainty on water distribution hydraulics and water quality  20.
Tessendorff H. 1972 Problems of peak demands and remedial measures  21.
RodriguezIturbe I. Gupta V. K. Waymire E. 1984 Scale considerations in the modeling of temporal rainfall  22.
Buchberger S. G. Nadimpalli G. 2004 Leak estimation in water distribution systems by statistical analysis of flow reading  23.
Hutton C. J. VamvakeridouLyrouda L. S. Kapelan Z. Savic D. 2011 Uncertainty Quantification and Reduction in Urban Water Systems (UWS) Modelling: Evaluation report  24.
Giustolisi O. Todini E. 2009 Pipe hydraulic resistance correction in WDN analysis  25.
Magini R. Pallavicini I. Guercio R. 2008 Spatial and temporal scaling properties of water demand  26.
RodriguezIturbe I. Marani M. D’Odorico P. Rinaldo A. 1998 On spacetime scaling of cumulated rainfall fields  27.
Van Marcke E. 1983  28.
Fiering M. B. 1964 Multivariate Technique for Synthetic Hydrology  29.
Guercio R. Magini R. Pallavicini I. 2003 Temporal and spatial aggregation in modeling residential water demand  30.
Pallavicini I. Magini R. 2007 Experimental analysis of residential water demand data: probabilistic estimation of peak coefficients at small time scales  31.
Fujiwara O. Khang D. B. 1990 A two phase decomposition method for optimal design of looped water distribution networks  32.
Rossman L. A. 2000