Results of risk communication for precipitation extremes with positive trend at station Qionghai (ID: 59855).

## Abstract

Due to climate change, the common assumption of stationarity in extreme value analysis of climate extremes has been gradually challenged. The familiar concepts such as a return period and a return level do not apply in a changing climate. To quantify and communicate risk of climate extremes in a changing climate, nonstationarity should be considered carefully. In this chapter, both the concepts and interpretations of return period, return level, failure risk, and reliability under nonstationary condition were interpreted. It was concluded that the two interpretations of the return period became divergent under nonstationary condition, while the two interpretations of failure risk were consistent irrespective of stationarity. Moreover, two examples of risk communication based on generalized extreme value (GEV) distribution for nonstationary climate extremes were presented. In the first example, climate change and its impacts on global air temperature extremes were detected. In the second example, extreme value analysis was firstly applied to precipitation extremes at two weather stations in China. Then, the fitted GEV distribution on historical records was also extrapolated for future risk communication. With these examples, the concepts those were related to risk measure and communication in a changing climate could be easily understood and applied in practice.

### Keywords

- extreme value theory
- nonstationarity
- engineering design
- return level
- failure risk
- reliability

## 1. Introduction

Extreme climate events could, directly or indirectly, impact all sectors of the economy leading to severe losses of life and property [1, 2, 3]. Mitigating natural hazards caused by extreme climate events is crucial to the sustainable development of human society and economy [4]. In IPCC’s report, an extreme climate event is generally defined as the occurrence of a value of a weather or climate variable above (or below) a threshold value near the upper (or lower) ends of the range of observed values of the variable [1]. The fundamental probability theory of extreme values has been well developed for a long time and already applied in resolving the practical problems in engineering design and risk management [5, 6, 7]. For the most part, extreme value theory (EVT) assumes that extreme events are stationary, and these extremes could be successfully characterized by the probability distributions such as the generalized extreme value (GEV) and generalized Pareto (GP) distribution [8, 9]. The occurrences of extreme events are also assumed to be independent or weakly dependent, then, the return levels and return periods could be easily determined [7, 10]. Under stationary condition, there is a simple one-to-one relationship between a return period and a return level, and these two terms can be easily understood [10, 11]. Moreover, risk could be simply communicated using the probability distributions of extremes derived from EVT [11]. In this study, the term “risk” merely refers to the probability of an extreme event with substantial consequence but not the expected loss in general risk analysis.

During the past few decades, there were clear and convincing evidences for global warming and climate change [12], which raised fundamental interdisciplinary issues of risk analysis and communication [13, 14]. As climate changes, weather or climate extremes also change [4] and gradually challenge the stationarity assumption in climate and weather extreme value analysis [6, 11, 15, 16, 17]. It has been documented that in some places, climatic and hydrological extremes exhibit some type of nonstationarity in the form of trends, shifts or a combination of them [18, 19, 20]. In a nonstationary world, both the severity and frequency of climate and weather extremes will change [2, 3, 10, 16, 17]. Consequently, extreme value analysis of climate and weather extremes has to consider and account for the nonstationarity [15]. Katz et al. [21] presented a nonstationary GEV distribution by introducing time as a covariate. He further showed that both GEV and GPD distributions could be retained under nonstationarity, and maximum likelihood method was also applicable for parameter estimation [6]. Nonstationary extreme values modeling based on GEV and GPD distributions has been realized in R-package *ismev* [22] and *extRemes* [23]. The other R-package GAMLSS (generalized additive model in location, scale, and shape) also allows nonstationary modeling for block maxima, where the parameters are modeled as linear or smooth functions of covariates [24]. Another available R-package for nonstationary extreme value analysis is *GEVcdn*, in which the parameters in GEV distribution are specified as a function of covariate using a conditional density network [25]. Besides these R-packages, nonstationary extreme value analysis could also been implemented using a MATLAB toolbox *NEVA* [26]. Although the nonstationary models in these packages performed better than the stationary equivalents in fitting nonstationary climate extreme, the return period (or return level) and risk for nonstationary conditions were not explicitly presented.

In nonstationary extreme value analysis, the concepts of the return period (or return level) and risk needed to be carefully reformulated and extended, because these familiar concepts, strictly speaking, no longer apply in a nonstationary climate [27]. In stationary cases, there exist two parallel interpretations for the return period: expected waiting time to an extreme event and expected number of extreme events in a given return period [7, 11]. In addition, the return level is the same in each year under stationary conditions. In Wigley [16], the return period was defined as the expected waiting time, and the influence of nonstationarity on the risk of extremes was presented using some simple probability arguments. The concept of the return period was further extended to nonstationary condition in Olsen et al. [10], where the return period was defined as the expected waiting time until an exceedance as the measure of risk. The alternative interpretation of the return period (expected number of extreme events in a given return period) for nonstationary conditions was clearly explained in Parey et al. [28, 29]. Recently, Cooley [11] reviewed these two definitions of the return period suggested by Olsen et al. [10] and Parey et al. [28, 29], and proposed that the return period could be used to communicate risk in nonstationary climate. From the perspective of engineering design, Salas and Obeysekera [7] illustrated the estimation of the return period and examined the failure risk of hydrological structures in nonstationary climate. Rootzén and Katz [30] also concerned the failure risk in the design period and proposed a risk-based engineering design concept, Design Life Level, which served as the basis of risk communication in a nonstationary climate. In the above literatures, the concepts of the return period or return level have been extended and adapted to nonstationary condition; however, the interrelations between return period and risk communication, especially for engineering design purpose, was still ambiguous. The major reason causing such ambiguity is the diversified explanations of one terminology for different purposes. Therefore, a comprehensive interpretation of the return period (or return level) and failure risk (or reliability) under either stationary or nonstationary conditions simultaneously are needed.

The aim of this chapter is to present the extension process of return period, return level, and failure risk from stationary condition to nonstationary condition in a different way so that the commonness and difference could be clearly identified. Consistent with the way how a return period is defined and derived in some previous literatures, extreme value analysis will apply to the time series of annual maxima in this study. Accordingly, GEV distribution is used to illustrate the computation of the return period (return level) and failure risk (reliability) in nonstationary climate.

## 2. Concepts and interpretations

In some previous literatures, the interpretation of the return period usually began with the simple one-to-one relationship between a return period and a return level under stationary condition [7, 10, 28, 29, 30]. Like Cooley [11], we define random variable

In this study, we try to explain the concepts of the return period and failure risk using the time series of annual maxima and the underlying stochastic process but omitting the assumption of stationarity or nonstationarity. In practice, analyzing the extremes and their probability distributions is usually considered as the basis of frequency analysis, engineering design, and risk assessment.

### 2.1. Waiting time-based concepts

Given a exceedance level

where the second line in Eq. (2) is based on the temporal independence assumption. Then, the expectation of waiting time

The details of the derivations of Eq. (3) were shown in the appendix in [11]. The first definition of the return period is based on the expected waiting time. Specifically, a

Next, we adopt the commonly used definition of failure risk for an engineering structure, which is interpreted as the probability of the failure or the probability of exceedance over its design level in its design life period. We denote the failure risk by

Consequently, the reliability of the focal structure within its design life period is

### 2.2. Expected number-based concepts

We define random variable

Then, we get

The expectation of

Now, we say that the

Similarly, the reliability of a focal structure in its design life period

From Eqs. (4) and (8), we find that the two parallel interpretations of failure risk (or reliability) of a focal engineering structure are equivalent irrespective of

## 3. Risk communication

### 3.1. Risk measure under stationarity

Under a stationary assumption,

The

It has been proved that the two interpretations of return period in the stationary case are both correct with this identical exceedance probability under stationarity assumption [11]. Substituting Eq. (9) into Eq. (3), we get the interpretation of the return period based on waiting time of exceedance:

Similarly, substituting Eq. (9) into Eq. (7), we get the alternative interpretation of return period based on expected number of exceedance events:

The simple one-to-one relationship between a return period and a return level in the stationary case has been commonly utilized in frequency analysis and engineering design practice [8, 21]. For example, the frequency or expected waiting time of extreme events exceeding a given exceedance level can be easily determined using Eq. (9) in frequency analysis of climate extremes. In practice, a very important concept for an engineering structure is the design life period. Reversely, given a design life period or exceedance probability, return levels could also be determined easily. Moreover, the failure risk or reliability of a focal structure in its design life period

where

### 3.2. Risk measure under nonstationarity

Under nonstationary condition,

To determine

The return level in Eqs. (13) and (14) are the two extensions of the return period in the stationary case; however, these two extensions are not applicable in practical engineering design [7, 30]. For engineering design purpose, Rootzén and Katz [30] presented a new concept, Design Life Level, by keeping the failure risk at a low constant level during the design life period. The relationship between Design Life Level and design life period was expressed by the following equation [30]:

where

A variant of Design Life Level is Minimax Design Life Level [30]. The computation of Minimax Design Life Level is even simpler. During the whole design life period

Similarly, the first step to compute the Design Life Level or Minimax Design Life Level is nonstationary modeling of historical climate extremes. Then, the trends of extremes would be extrapolated over the design life period. Moreover, the statistical uncertainty in the return period and Design Life Level can be described by computing the standard errors using the delta method [11, 30].

## 4. Applications

In this section, we present two examples of extreme value analysis and risk communication in a changing climate. The cumulative distribution function of the GEV is expressed as [8]:

where

where

The first example of risk communication was for global annual maximum near surface air temperature (1948–2015). The global gridded data were extracted from the reanalysis products with a spatial resolution of 2.5 * 2.5 provided by Earth System Research Laboratory, NOAA (*extRemes* [23]. The aim of this example was to show the changes in climate extremes caused by climate change and how this change impacted risk communication; therefore, we did not extrapolate the trends of temperature extremes but only computed the 20-year return level (the expected number-based return level during 1996–2010). Solving Eq. (14) relied on numerical optimization techniques, and in this study, the particle swarm optimization method was applied. The result of the global 20-year return level of annual maximum near surface air temperature in 1996–2015 was shown in Figure 1(b).

In the second example, we used two time series of annual maximum precipitation (AMP) to illustrate the risk measure and communication under both stationary and nonstationary conditions. The two AMP time series were extracted from observation dataset of daily precipitation, which was provided by the National Meteorological Information Center (NMIC) of the China Meteorological Administration (CMA). The two AMP time series were selected, because either positive or negative trends were detected. The corresponding weather stations are Qionghai (Station ID: 59855) and Zunhua (Station ID: 54429), which are located at N19

Akaike information criterion (AIC) was also computed for model fitting evaluation [31]. The model that was preferred was having the minimum value of AIC. For AMP time series at station Qionghai, the best fitting model was

The results were shown in Tables 1 and 2, respectively. The first three rows in Tables 1 and 2 mainly illustrated the relationship between the return period and the return level under stationary and nonstationary conditions. The last two rows in Tables 1 and 2 showed the two concepts, Design Life Level and Minimax Design Life Level, for the purpose of engineering design under nonstationary conditions. For AMP time series at Qionghai station, there was a significant positive trend in precipitation extremes. Given a 50-year return period, the associated return level under stationary assumption was much lower than those under nonstationary assumption. From the perspective of engineering design, the return levels shown in the first three rows were unacceptable, because the failure risks in the following 50 years (design life period) were all larger than 0.55. To ensure a low failure risk, a higher design level is needed. For AMP time series at Zunhua station, there was a significant negative trend in precipitation extremes. When nonstationarity was considered, the return level became lower due to the decreasing trend in precipitation extremes. Furthermore, due to the same reason, the Design Life Level and Minimax Design Life Level in 50-year design life period (2014–2063) were lower than the 50-year return level under stationary assumption.

## 5. Discussion and conclusions

Due to the climate change, the stationary assumption that was commonly used in statistical analysis of climate extremes gradually became unacceptable [6, 15, 17]. How to quantify and communicate risk of climate extremes in nonstationary climate is essential for engineering design and risk assessment [7, 11, 30]. There were many attempts to quantify and communicate risk in a changing climate such as extending the concepts of the return period from stationary condition to nonstationary condition [10, 28, 29] or developing a new concept of the return level [30]. In stationary climate, frequency analysis, engineering design, and risk assessment were all based on the stationary extreme value distribution model [8]. It was assumed that the fitted extreme value distribution model on historical records also applied for future observations. Also due to stationarity, the concepts of risk measure for different purposes had not been strictly distinguished. Unlike the simple one-to-one relationship between a return level and a return period under stationary condition, risk measure and communication were more complicated under nonstationary condition, especially due to the time-varying essence of climate extremes. Therefore, a clear interpretation and illustration of the methods for risk measure and communication in a changing climate are of great importance.

In this study, climate extremes were presented in the form of annual maxima of extreme climate events. This chapter began with the two parallel interpretations of the return period, in which, the implicit relationship between a return level and a return period was included, but the stationary or nonstationary assumptions were omitted. This implicit relationship was also considered as the basis for frequency analysis and engineering design. In the stationary case, the two interpretations of the return period were equivalent. Although they were no more equivalent in the nonstationary case, they both provided independent methods for determining the associated return level for a given return period. Risk assessment usually aims to a focal engineering structure with a given design level. We showed that the concept of failure risk (or reliability) also had two parallel interpretations, and these two interpretations were consistent irrespective of stationary or nonstationary assumptions. In order to illustrate how risk was quantified and communicated in a changing climate, two examples of nonstationary climate extremes were used. Totally, we have reviewed two methods for estimating the return level for a given return period under nonstationary condition [10, 28, 29] and two newly refined concepts of the return level in a given design life period for engineering design purpose [30]. In the first example, we detected the trend of annual maximum of global near surface air temperature during 1948–2015. Nonstationary GEV distribution with a time-varying location parameter was used to fit near surface air temperature extremes with significant trends, and the expected number-based return levels in 1996–2015 were computed. In the second example, time series of observed annual precipitation extremes at two weather stations in China with significant trends was analyzed. Both stationary and nonstationary GEV distribution models were used to fit the precipitation extremes. For each station, one best fitting GEV distribution was identified. Then, the linear trends in the parameters of GEV distribution were extrapolated into the following 50 years (also considered as the design life period). Return level, Design Live Level, and Minimax Design Live Level were all computed for the design life period (2014–2063), respectively. It was concluded that the communication of risk in a changing climate was obviously different from that in a stationary climate. For frequency analysis purpose, general return level/return period might be quite capable of communicating risk of climate extremes. While for the engineering design purpose, Design Live Level or Minimax Design Live Level were recommended, because the failure risk of a focal structure would be very low if Design Live Level or Minimax Design Live Level were chosen as the design level.

A reliable statistical modeling on long-term data was the basis of risk communication in a changing climate [26]. In nonstationary extreme value modeling, there are usually many candidate models. The choice of extreme value distribution models might influence the risk measure substantially in nonstationary and changing climate, because the trend captured by the extreme value distribution should be extrapolated into the future design life period. In this chapter, we only chose time as the covariate in the nonstationary extreme value analysis, and the parameters in GEV distribution model were expressed as the linear function of time. Perhaps, there might be more suitable trends such as quadratic or exponential trends leading to more candidate models. Evaluating all these candidate models was not an easy task. In practice, only a few of commonly used models was evaluated and compared. Moreover, the model selection process is not simply by using tools such as AIC or BIC [11]. Sometimes, additional expert knowledge is needed. In nonstationary extreme value modeling, besides time, some other climate indictors representing the variability of the climate system were also chosen as covariates. Although the historical data could be successfully fitted with these additional climate indictors, it was difficult to extrapolate the historical climate variability. That is because the climate variability itself is difficult to predict due to the complicity of the climate system. To reduce the uncertainty in statistical extrapolation, the output of numerical climate models was also used. However, the reasonability of simulation results was constrained by the parameters setting and initial values [30]. Additionally, the standard error of return levels and Design Live Level was all estimated using the delta method. Although standard error is a simple measure to quantify the uncertainty of nonstationary extreme value modeling, the uncertainty cannot be properly reflected using the symmetric confidence interval. Lastly, as pointed in [30], neither Design Life Level nor Minimax Design Life Level could be used as the criteria for realistic engineering design, because more economic and political factors should be considered besides the failure risk and reliability. All the above mentioned things are outside the scope of this chapter; here, our primary objective was to discriminate the concepts and their interpretations of the return period/return level, failure risk/reliability under stationary and nonstationary conditions, and to illustrate the computations using realistic climate extremes. With these examples, we believed that the concepts those were related to risk measure and communication in a changing climate could be easily understood and applied.

## Acknowledgments

This work was partly supported by the Youth Innovation Promotion Association of CAS (2016195), CAS Knowledge Innovation Project (KZCX2-EW-QN209), and National Natural Science Foundation of China (31570423).