Statistical descriptive of all years.

## Abstract

In this chapter, we have conducted a statistical study of the annual extreme precipitation (AMP) for 856 grid cells and during the period of 1979–2012 in Algeria. In the first step, we compared graphically the forecasts of the three parameters of the generalized extreme value (GEV) distribution (location, scale and shape) which are estimated by the Spherical model. We used the Cross validation method to compare the two methods kriging and Co-kriging, based on the based on some statistical indicators such as Mean Errors (ME), Root Mean Square Errors (RMSE) and Squared Deviation Ratio (MSDR). The Kriging forecast error map shows low errors expected near the stations, while co-Kriging gives the lowest errors on average at the national level, which means that the method of co-Kriging is the best. From the results of the return periods, we calculate that after 50 years the estimated of the annual extreme precipitation will exceed the maximum AMP is observed in the 33-year.

### Keywords

- extreme precipitations
- kriging and Co-kriging
- cross validation
- return levels

## 1. Introduction

Natural disasters cause loss of human life and damage to infrastructure every year throughout the world. In Algeria, extreme rains are the source of flooding which can cause catastrophic damage both in inhabited areas and in the countryside.

One of the basic problems encountered in meteorology is the need to assess the meteorology risk caused by extreme precipitation in order to avoid human and material losses. Thus, the location and severity of floods can be determined.

In the twenties to the middle of the last century, the theory of extreme values has witnessed a remarkable development [1, 2, 3] most studies focused on the monthly or yearly mean values and we find their application in many fields like; rainfall in Algeria [4, 5], extreme precipitations in Argentina [6] Mapping snow depth return levels [7], precipitation and temperature [8, 9].

On the other hand, a lot of studies say there is a great spatial difference in rainfall [10] for this reason it was used interpolation methods, the kriging method is considered as the most used for spatial interpolation of rainfall [11, 12, 13], the kriging method has a special feature which is complementing the sparsely sampled primary variable, in the case of secondary variable there is another method called cokriging that outperforms the kriging method [14].

According to [15] In order to examine the spatiotemporal variations of meteorological variables there is a statistical method that allows to apply multiple strategies by cluster analysis to pinpoint the similar places, local and universal meteorology techniques which has been raising lately.

Our goal in this chapter is to compare the two methods kriging and co-kriging using the GEV and determine the location and severity of floods in all regions of Algeria.

## 2. Methodology and data

### 2.1 Generalized extreme value distribution

The cumulative distribution function is proposed by [16].

By deriving the Eq. (1) we get the density function

The logarithm of the likelihood function is given by:

For

For

3 and 4 with differentiating the two parameters:

### 2.2 Return period

The return period, also known as a recurrence interval is the estimated average time between events such as earthquakes, floods, landslides, or river floods. From Eq. (1) we can write the return level as following:

### 2.3 Variogram model

Various parameter variogram models have been used in the literature. Here is some of the most popular content.

The Spherical model has linear behavior at small separation distances near the origin, but flattens at large distances, which means that it shows a gradual decrease in spatial dependence until a certain distance beyond which the spatial dependence tends to smooth.

Where c_{0} is the nugget effect. The sill is c_{0} + c_{1}. The range for the spherical model can be computed by setting g(h) = 0.95(c_{0} + c_{1}).

The Gaussian model is used when the data exhibits strong continuity at short lag distances which means the spatial correlation is very high between two neighboring points.

_{0} is the nugget effect. _{0} + _{1} is the sill. The range is 3* Z*(

**) can be predicted without error for any**s

**on the plane.**s

### 2.4 Data description

The precipitation data used in this study are for the National centers for environmental information NOAA of USA, this data used especially in cases where surface data are difficult to obtain or insufficient. Our data represented by the annual daily maximal of rainfall from1979 to 2012 calculated in the 856 Algerian stations (Figure 1).

The preliminary analysis of the annual maximum precipitation data during the analysis period (1979–2012) included descriptive statistical calculations (Table 1 and Figure 2). More precisely, we calculated the minimum (Min), the maximum (Max), the mean (Mean), the standard deviation (std.dev) and the coefficient of variation (coef.var). Table 1 presents the values of the descriptive statistics for the annual time series of maximum precipitation for all stations (from 1979 to 2012). The results show that the maximum values is observed in the years 1982, 1992, 1994, 2001, 2006, and 2007, while the mean and highest values are observed in 1982 (Figure 2). The lowest value of coef.var. is for the year 1996 (68%), and the highest for 1984 (14%). On this basis, the observed data showed that all years had a coef.var. greater than 68%, highlighting the high variability of annual maximum precipitation over Algeria.

0.59 | 88.07 | 13.46 | 11.23 | 0.83 | 2.93 | 69.38 | 13.75 | 9.39 | 0.68 | |||

1.34 | 63.85 | 12.88 | 11.89 | 0.92 | 1.27 | 63.63 | 10.84 | 8.89 | 0.82 | |||

0.84 | 79.32 | 9.94 | 8.88 | 0.89 | 0.31 | 57.40 | 7.76 | 8.49 | 1.09 | |||

2.92 | 123.29 | 21.36 | 18.15 | 0.85 | 0.19 | 63.48 | 14.83 | 11.01 | 0.74 | |||

4.42 | 57.48 | 9.70 | 7.67 | 0.79 | 0.79 | 81.39 | 14.14 | 12.62 | 0.89 | |||

0.21 | 99.47 | 7.93 | 11.09 | 1.40 | 0.09 | 114.35 | 8.28 | 10.07 | 1.22 | |||

0.30 | 83.26 | 10.09 | 9.33 | 0.93 | 0.66 | 96.94 | 12.15 | 11.07 | 0.91 | |||

2.94 | 73.87 | 15.58 | 11.74 | 0.75 | 0.33 | 84.54 | 13.69 | 11.56 | 0.84 | |||

0.82 | 57.32 | 8.91 | 7.43 | 0.83 | 0.06 | 97.24 | 16.44 | 16.40 | 1.00 | |||

1.73 | 52.89 | 12.16 | 8.71 | 0.72 | 0.90 | 74.47 | 18.05 | 16.24 | 0.90 | |||

0.00 | 73.35 | 8.77 | 9.91 | 1.13 | 0.97 | 115.19 | 13.49 | 12.92 | 0.96 | |||

0.95 | 79.25 | 14.23 | 10.69 | 0.75 | 0.07 | 109.16 | 15.58 | 15.61 | 1.00 | |||

1.99 | 57.57 | 12.51 | 11.16 | 0.89 | 1.92 | 70.72 | 15.00 | 12.09 | 0.81 | |||

1.50 | 121.05 | 14.14 | 16.85 | 1.19 | 1.21 | 97.77 | 15.28 | 11.73 | 0.77 | |||

0.39 | 62.82 | 11.46 | 10.77 | 0.94 | 1.50 | 63.11 | 10.69 | 9.16 | 0.86 | |||

1.43 | 123.59 | 15.24 | 14.77 | 0.97 | 2.93 | 67.68 | 14.34 | 12.03 | 0.84 | |||

0.86 | 83.47 | 11.70 | 9.72 | 0.83 | 1.89 | 91.72 | 13.50 | 13.28 | 0.98 |

## 3. Results and discussions

### 3.1 Trend in annual maximum precipitation

In this study, the Mann-Kendall non parametric test is computed to characterize the time course of annual maximum precipitation at the national scale. The trend is considered significant if the value of the probability (* p*-value) is greater than 0.05 (95%). The Figure 3 shows the distribution of the significance trend (red color) and non-significant trend (blue color). From the results, we can observe that 130 (15%) stations have a significant trends and most of them are positive (97% of the total stations) at the national scale.

The maximum value of the annual maximum precipitation (AMPmax) for each station is calculated and presented in Table 2. From the Table 2, we can see that the mean value of AMPmax is 22.46 mm and 47% of the total stations with values greater than the mean of AMPmax. The coefficient of variation is 61%, indicating the significant spatial distribution of AMPmax at the national scale. Therefore, we applied a Kriging and Co-kriging approaches to better understanding the spatial distribution of the annual maximum precipitation in Algeria country.

Years | Min | Max | Mean | std.dev |
---|---|---|---|---|

5.88 | 123.59 | 37.00 | 22.46 | 0.61 |

In this study, we have 865 selected grids and 9496 predicted grids have locations where spaced every 20 m in the East and North grid directions and covered the irregularly shaped of the country (Figure 4). Due to the large numerical range of AMPmax values and to allow easy interpretation of the results, we worked with the logarithmic transformation of the variable. In this application, we chosen a base 10 logarithms (log10) for the data and we randomly selected control and test datasets. In this study, 30% of the total grids were excluded for testing (assessment).

### 3.2 Choosing the variogram model

We start by plotting the experimental variogram before adjusting the latter with the different models. The sum of the square errors (SSErr) and the regression coefficient (R^{2}) provided an accurate measure of the fit of the model to the variogram data, with a lower SSErr and a higher R^{2} indicating better fit of the model.

The values of the parameters of the different fitted models are presented in Table 3.

Model | Range | Nugget (C0) | Sill (C0 + C) | Nugget/Sill ((C0/C0 + C)*100) | SSErr | R^{2} |
---|---|---|---|---|---|---|

Spherical | 1075.984 | 0.02806 | 0.11425 | 24.56 | 2.26E-05 | 0.9890 |

Gaussian | 412.298 | 0.03665 | 0.10436 | 35.12 | 2.28E-05 | 0.9888 |

Theoretical and empirical semi-variogram were prepared for the AMPmax as shown in Figure 5. From the results, we can see that the spherical model has been found to be the most accurate model for annual maximum precipitation.

The spatial dependence is generally accessible in terms of the ratio between the nugget (C0) and the sill (C0 + C) expressed as a percentage. The AMPmax is considered to be a strong spatial dependence when the ratio value is less than 25%, moderate spatial dependence when this value is between 25% and 75%, and low spatial dependence when the value is greater than 75%. From Table 3, we can clearly see that the spatial dependence of AMPmax for the best-fitting semi-variogram model is strong and with a ratio of 24.56%.

### 3.3 AMPmax interpolation

The Spherical model is used to interpolate the AMPmax for both Kriging and co-Kriging methods at the national scale. In the first step, we compared graphically the forecasted and estimated GEV parameters (μ, σ and ξ). From Figure 6, we can see that a very clear spatial pattern for the estimates of the location and scale parameters however an absence of the spatial pattern for the shape parameter. The northern region is very marked compared to the rest of the regions with a significantly higher value of the location and scale parameters. On the other hand, the co-Kriging method clearly provided new regions where the values are high. Generally, the high values could be observed in northern Algeria.

In order to compare the two methods Kriging and Co-Kriging, we used Cross Validation method and some statistical indicators such as Mean Errors (ME), Root Mean Square Errors (RMSE) and Squared Deviation Ratio (MSDR) Table 4.

Μ | Σ | ξ | Μ | σ | ξ | |
---|---|---|---|---|---|---|

Min | −0.11691 | −0.23323 | −2.71689 | −0.41537 | −0.37368 | −2.52461 |

Max | 0.15085 | 0.19637 | 0.97323 | 0.70063 | 0.72695 | 0.98770 |

Mean | 0.00013 | −0.00011 | 0.00098 | 0.00105 | 0.00082 | 0.00224 |

ME | 0.00013 | −0.00011 | 0.00098 | 0.00105 | 0.00082 | 0.00224 |

RMSE | 0.02421 | 0.04819 | 0.37981 | 0.07523 | 0.08761 | 0.39080 |

MSDR | 0.08987 | 0.38275 | 1.91038 | 0.92698 | 1.08895 | 1.70011 |

Kriging | Co-Kriging |

Figure 7 display a bubble plots of the cross-evaluation error of the two methods, where positive values are drawn in green and negative values are drawn in red, and the size of the bubble is proportional to the distance from zero.

From Table 4 and Figure 7, we can clearly notice that the Kriging forecast error map with the three parameters of the GEV distribution shows low errors expected near the stations, while co-Kriging gives the lowest errors on average at the national scale, especially for the shape parameter.

After the validation of the two methods, Co-kriging method used to estimate the return levels (RLs) of the annual maximum of daily precipitations for the different stations using Eq. (6).

The return periods are shown in Table 5 for the 20, 50, and 100-year. The results show that the maximum annual maximum precipitation observed in 1982, 1992, 1994, 2001, 2006, and 2007 exceeds the 20-year regression level. AMP exceeding the maximum AMP during the observation period (123.59) begins to appear in the confidence interval of 50 year.

Min | Max | Mean | std.dev | coef.var | |
---|---|---|---|---|---|

20 years | 5.6697 | 106.8175 | 28.7420 | 17.8361 | 0.6205 |

50 years | 5.7920 | 252.1116 | 40.0470 | 30.8470 | 0.7703 |

100 years | 5.8421 | 493.1154 | 52.2700 | 52.0110 | 0.9950 |

Figure 8 shows the results for the three cases RLs considered, in the first and second cases (20 years and 50 years), we notice roughly the same results, although there are some remarks that need to be made. In 50-year RLs, there is an increase in the eastern part, far south and in the center of Algeria. Otherwise in 100-year RLs, we noticed a great difference, especially in the eastern region, in the far southern the state of Tamanrasset, the western region in the state of Tindouf, and in The Middle of the desert the Adrar region.

## 4. Conclusion

In the current research, we studied the spatial analysis of rainfall data in 856 grid cells during the analysis period (1979–2012). The main conclusions form this study can be summarized as follows:

We marked that the spherical model was found to be the most accurate model for annual maximum precipitation at the national scale.

We can clearly see that the Kriging forecast error map shows low errors expected near the stations, while co-Kriging gives the lowest errors on average at the national level, which means that the method of co-Kriging is the best.

Return levels were estimated for several return time periods. For the return level estimated from the GEV distribution, the point estimate that exceeds the return level of all previous maximum AMP records begins to appear in the 50-year regression period.

## References

- 1.
Fisher RA, Tippet LHC. Limiting forms of the frequency distribution of the largest or smallest member of a sample. Proceedings of the Cambridge Philosophical Society. 1928; 24 :180-190 - 2.
Gnedenko BV. Sur la distribution limite du terme maximum d’une série aléatoire. Annals of Mathematics. 1943; 44 :423-453 - 3.
Gumbel EJ. Statistics of Extremes. New York, NY: Columbia University Press; 1958 - 4.
Naima B, Hassen C, Lotfi H. Modelling maximum daily yearly rainfall in northern Algeria using generalized extreme value distributions from 1936 to 2009. Meteorological Applications. 2017; 24 :114-119 - 5.
Meddi M, Toumi S. Spatial variability and cartography of maximum annual daily rainfall under different return periods in Northern Algeria. Journal of Mountain Science. 2015; 12 (6):1403-1421. DOI: 10.1007/s11629-014-3084-3 - 6.
Ferrelli F, Brendel AS, Aliaga VS, Piccolo MC, Perillo GME. Climate regionalization and trends based on daily temperature and precipitation extremes in the south of the Pampas (Argentina). Geographical Research Letters: Cuadernos de Investigación Geográfica; 2019. DOI: 10.18172/cig.3707 - 7.
Blanchet J, Lehning M. Mapping snow depth return levels: smooth spatial modeling versus station interpolation. Hydrology and Earth System Sciences. 2010; 14 :2527-2544 - 8.
Shrestha AB, Bajracharya SR, Sharma AR, Duo C, Kulkarni A. Observed trends and changes in daily temperature and precipitation extremes over the Koshi river basin 1975-2010. International Journal of Climatology. 2016; 37 (2):1066-1083. DOI: 10.1002/joc.4761 - 9.
Ren Y-Y, Ren G-Y, Sun X-B, Shrestha AB, You Q-L, Zhan Y-J, et al. Observed changes in surface air temperature and precipitation in the Hindu Kush Himalayan region over the last 100-plus years. Advances in Climate Change Research. 2017; 8 :148-156. DOI: 10.1016/j.accre.2017.08.001 - 10.
Lloyd CD. Assessing the effect of integrating elevation data into the estimation of monthly precipitation in Great Britain. Journal of Hydrology. 2005; 308 (1–4):128-150. DOI: 10.1016/j.jhydrol.2004.10.026 - 11.
Adhikary SK, Muttil N, Yilmaz AG. Ordinary kriging and genetic programming for spatial estimation of rainfall in the Middle Yarra River catchment. Australia. Hydrology Research. 2016; 47 (6):1182-1197. DOI: 10.2166/nh.2016.196 - 12.
Moral FJ. Comparison of different geostatistical approaches to map climate variables: Application to precipitation. International Journal of Climatology. 2010; 30 (4):620-631. DOI: 10.1002/joc.1913 - 13.
Yang X, Xie X, Liu DL, Ji F, Wang L. Spatial interpolation of daily rainfall data for local climate impact assessment over greater Sydney region. Advances in Meteorology. 205:12. DOI: 10.1155/2015/563629 - 14.
Goovaerts P. Geostatistical approaches for incorporating elevation into the spatial interpolation of rainfall. Journal of Hydrology. 2000; 228 (1–2):113-129. DOI: 10.1016/S0022-1694(00)00144-X - 15.
Rad AM, Khalili D. Appropriateness of clustered raingauge stations for spatio-temporal meteorological drought applications. Water Resources Management. 2015; 29 :4157-4171 - 16.
Jenkinson AF. The frequency distribution of the annual maximum (or minimum) of meteorological elements. Quarterly Journal of the Royal Meteorological Society. 1955; 81 :158-171