Open access peer-reviewed chapter

Maintenance and Renewal Cost Evaluation for Managing Assets of Electric Power Equipment and Operational Data Analysis for Failure Rate Estimation

Written By

Tsuguhiro Takahashi

Submitted: 15 July 2022 Reviewed: 29 July 2022 Published: 07 September 2022

DOI: 10.5772/intechopen.106874

From the Edited Volume

Maintenance Management - Current Challenges, New Developments, and Future Directions

Edited by Germano Lambert-Torres, Erik Leandro Bonaldi and Levy Eli de Lacerda de Oliveira

Chapter metrics overview

104 Chapter Downloads

View Full Metrics

Abstract

In recent years, “asset management” or “managing assets” technique has been expected to rationalize maintenance and operation of electric power equipment, especially for aging equipment. Some concrete support tools have been developed by considering life cycle cost for substation equipment in “Central Research Institute of Electric Power Industry, Japan,” which include failure risk evaluation. Such cost and risk evaluation are essential for comparative evaluation of different types of equipment. Failure probability is one of the most important factors for the evaluation. Because of its high reliability, electric power equipment can be expected to have a very long lifetime, therefore, durability test is not applicable, but rather relies on analysis of actual operational data. Collection, accumulation, and analysis of actual operational data are necessary for accurate evaluation. This chapter describes the evaluation method for the managing assets, and data collection and analysis to improve the accuracy of failure probability estimation.

Keywords

  • asset management
  • power equipment
  • preventive maintenance
  • life cycle cost
  • risk evaluation

1. Introduction

In general, power transmission and distribution equipment that can be expected to operate for more than several decades had a wide age distribution over time. The amount of capital investment is inevitably affected by society and the economy, therefore, the shift to the aging side of the age distribution is progressing as a common phenomenon in many countries in recent years. In US and European countries, the issue of aging has been discussed since the end of the 20th century [1], and the effective introduction of the so-called asset management technique for formulating maintenance and management strategies that appropriately balance risk and cost has been examined.

Advertisement

2. Maintenance and renewal cost evaluation for power transmission and distribution equipment

In the asset management for corporate activities, benefits as positive impacts and risks as negative impacts are generally evaluated and added for each possible activity to be considered as an evaluation index to select the optimal strategy. In the case of electric power transmission and distribution equipment, it is difficult to evaluate the contribution of individual equipment because the entire network system generates benefits, therefore in many cases, the optimal maintenance strategy is selected by minimizing the costs and statistically evaluating risks required to maintain the network size and reliability. As one simple model, CRIEPI has proposed the cumulative cost evaluation method, and some support programs have been developed [2, 3, 4, 5].

2.1 Maintenance and renewal “cost” in operation

Maintenance and renewal costs during normal operation are classified into the following four items based on their expenditure timing and characteristics of change by age.

  1. Average repairing cost.

    In ordinary operations, there are some necessary repairing costs, such as oil leakage repair cost for power transformers. Generally, it can be assumed to increase with age. For example, its characteristic is assumed to be proportion to age.

  2. Inspection cost.

    Generally, regulated inspection cost is needed commonly for each equipment. A periodic and a nonperiodic (which is performed at a certain age) inspection costs are considered.

  3. Overhaul cost.

    Some equipment can be applied so-called “Overhaul” to realize rejuvenation as a maintenance measure. Overhaul costs depending on their effect is considered.

  4. Installation cost of planned renewal.

    The installation cost of equipment can be regarded as installments over several years, by considering depreciation. The property tax should also be considered during these years.

2.2 Statistically expected failure cost as “risk”

The expense required at a failure is installation cost of renewed equipment and some so-called penalty costs. The “penalty” cost should include the lost revenue from selling electricity, the emergent recovery cost, a penalty resulting from service interruption, and so on. Since the occurrence of a failure is statistical, the expense is “statistically expected cost,” which is the product of the “cost of failure” and the “probability of failure.” This cost is not a real cash flow, but it is expressed in monetary values and can be compared and combined with maintenance and renewal costs. When some aged equipment in service is removed as a result of failure, the same number of new equipment should be installed in order to maintain its power network scale. That is, the total number of equipment does not change. From the statistical point of view, such failures are occurred every year, depending on their failure probability. This means that the age distribution changes over time, which should be considered when the cumulative cost evaluation is carried out [4].

2.3 Maintenance scenarios to be compared

In general, when the asset management techniques are utilized for maintenance and renewal planning, it is necessary to consider possible maintenance measures and scenarios, in advance. The cumulative cost evaluation should be carried out for each scenario. Therefore, this scenario setting is important for this method. As one example, time-based renewal scenarios (such as at 40 and 50 years) with and without overhaul (OH) have been considered, as shown in Figure 1. The OH is assumed to rejuvenate equipment at a certain cost. Its effect (rejuvenation years), cost, and timing are specified as parameters.

Figure 1.

Output example of CRIEPI’s support program.

Advertisement

3. Failure data analysis based on operation results

One of the most important items in the managing assets is the risk evaluation, and failure probability estimation for each equipment is crucial. The failure probability distribution of a product is generally obtained from endurance tests on a large number of the same products. However, it is not practical to conduct such endurance tests for electric power transmission and distribution equipment, which can generally be expected to operate for more than several decades. In order to investigate the failure probability characteristics of long-life products, statistical analysis of residual performance tests of removed products from a real field, and operation/failure results in a real field are often employed. This section describes statistical analysis methods for operational data.

3.1 Definition of failure rate

Failures that are generally considered with a failure rate for an industrial product include those that stop the operation of the product once they occur and those that are repaired repeatedly each time they occur. The former determines the service life of the product, and the rate of occurrence is usually evaluated as a function of operating hours. The latter usually focuses on the interval of occurrence of multiple failures, and the change over time of the average time or the occurrence rate in a certain operating time is evaluated. In this section, “failure rate” means the occurrence rate of the former failures at a certain age, and is expressed as a function of age.

If a large number of the same products start operating simultaneously, the percentage of those that continue to operate without failure up to a certain elapsed time t is generally called the reliability, and it is often expressed as R(t). The cumulative failure probability, which means the percentage of failures between the start of operation and t, is expressed as follows:

Ft=1RtE1

F(t) differentiated by t is often denoted as f(t).

ft=ddtFtE2

This is the increment of F(t) at time t, i.e., the percentage of products that fail at time t, for all products. It is sometimes called the “failure probability” because it is the time derivative of the “cumulative failure probability,” but it is also called the probability density of failure because it represents the probability distribution of when the product will fail. When examining the risk of equipment in service, the probability that equipment that has been operating until age t will fail by the following year is often utilized. This is often denoted as λ(t) and is obtained as follows:

λt=ftRt=dFtdt1Rt=dRtdt1RtE3

In this chapter, this is referred to as the “failure rate,” and a method for estimating it from operation results is discussed.

3.2 Characteristics of power transmission and distribution equipment operational data for failure rate estimation

In general, to examine the failure rate characteristics of a product, an endurance test is conducted using several units of the same product. To examine aging characteristics, test samples are usually operated simultaneously and the time required for each sample to reach failure is determined. In conducting endurance tests, it is not always possible to continue the test until all samples fail due to time and cost constraints, but some statistical analysis methods can analyze data obtained by discontinuing the endurance test in the middle of the test. In the case of power transmission and distribution equipment, it is difficult to plan an endurance test in which a sufficient number of test samples are operated simultaneously, but a method to estimate the failure rate by considering the actual results of the long-term operation of a large number of facilities in a real field as a pseudo endurance test is conceivable. In this case, some considerations need to be made for the data used in the analysis.

3.2.1 Data for failure rate estimation

In order to estimate the failure rate, information on equipment that has been in operation without failure is needed in addition to the equipment that has failed. For a group of equipment that is the same type and assumed to exhibit the same failure rate aging characteristics, it is necessary to investigate the age of each failed facility as well as the age distribution of the group in operation.

3.2.2 Observation period

Failure rate estimation based on operational data for a group of equipment with age distribution starts by determining the number of failures/operation equipment at each age in order to obtain an approximation of λ(t). In doing so, it is necessary to consider the period of the survey (observation period). For example, if the past 10 years of failure history is to be investigated, the number of failures for each age can be obtained by adding up the 10 years of information for each individual failure by age of occurrence. Similarly, for the number of operations, the actual number of operations (the age distribution of equipment in operation) for each year of the observation period should be surveyed and added. For example, equipment that has continued to operate without failure at 20 years old has existed every year for the past 10 years, so they can be added up to the amount of equipment in operation at 20 years old in the aggregate. However, for example, equipment that is currently 40 years old should not be combined as “equipment that has continued to operate at 20 years old without failure” because this information is outside the observation period, though it is clear that the equipment continued to operate for 20 years without failure 20 years ago. This is because the information on equipment that has failed or has been removed outside the observation period cannot be used. Only “should have been in operation” information is likely to lead to underestimation of the failure rate. Therefore, the “observation period” is important and should be paid attention in the failure rate estimation.

3.2.3 Influence of low number of failure results

The inherent difficulty in statistical analysis of the failure rate from operational data of power transmission and distribution equipment lies in the fact that such equipment has a low failure rate and is highly reliable, and that preventive maintenance is performed to maintain high supply reliability. These suppress the occurrence of failures during operation, resulting in a decrease in the accuracy of failure rate estimation and underestimation. There is no other way to deal with these problems than to continuously accumulate appropriate data and increase the amount of data.

3.3 Procedure of statistical analysis

This section presents a computer simulation of virtual operational data and uses the results to show a specific procedure for estimating failure rates [6].

3.3.1 Simulated data

The simulating equipment is a group of 12,340 units with the aging distribution shown in Figure 2, all of which are assumed to have the same failure rate characteristics shown in Figure 3. The Weibull distribution is assumed for the failure rate characteristics, and the failure rate λ(t) and probability density of failure f(t) are expressed as follows:

Figure 2.

Assumed age distribution (12,340 units).

Figure 3.

Assumed failure characteristics.

λt=mts·ttsm1E4
ft=mts·ttsm1·expttsmE5

where m is the shape parameter and ts is the scale parameter, and in Figure 2, m and ts (years) are set to 4 and 80, respectively.

It can be simulated that after 1 year of operation of this equipment group, some equipment will fail according to the failure rate determined by each age. For each aged equipment, a random number between 0 and 1 is generated, and when the value is less than the failure rate at its age the unit is regarded to fail. In the following year, the number of failures is subtracted from the amount of equipment in each age, and the age distribution is shifted to the higher side by 1 year, and new equipment equal to the number of failures in the previous year is regarded to be installed, assuming maintenance that keeps the total amount of equipment in the group constant. If this is continued for several years, which corresponds to “observation period,” operational data can be generated as one simulation case, but its result would depend on the random number output, so different results would be obtained each time of the simulation. One simulation result whose observation period is 5 years is shown in Table 1. Table 1 also includes the combined number at each age during the observation period. The statistical analysis in the following section is performed on this combined operational data. Such simulation was performed five times.

age (year)1st year2nd year3rd year4th year5th yeartotal
operatingfailedoperatingfailedoperatingfailedoperatingfailedoperatingfailedoperatingfailed
11002202202302501020
2100100220220230870
3100100100220220740
4200100100100220720
525002001001001003000
6350025002001001006400
72800350025002001009100
8200028003500250020011000
92800200028003500250113601
102700280020002800350013800
112800270028002000280013100
123001280027002800200013301
133400299028002700280014690
145500340029902800270017390
155400550034002990280120091
165800540055003402299023092
175900580054005500338025980
185901590158005400550028502
196001589058905800540028981
206002599158925890580029575
216000598159825871589129725
226000600059705961586029791
23440060006002597595128323
244803440060005981597427158
25480147714400600597225944
263100479047624401600123054
27310131034790474439020126
282900309230704791472118574
292001290130713070478015823
302101199028903062307013113
312801209119902890304012812
322102279020801990289211854
332001208027932080199110945
341802199020812760208110714
35190117811994207027610509
36150018911770195120719183
37125015011880177019418342
38100112501491188117717394
39801991125014811876395
40700792980125214715195
415007007709831234183
424005017027729513326
43300401491681752623
442013003924806712044
451501903003704821492
461201501903013701131
4770123150191290824
48507090150180540
4930507090150390
503030507090270
510030305070180
520000303150111
53000000302050
54000000003030
55000000000000

Table 1.

One example of simulated operational data.

3.3.2 Hazard analysis

Failure characteristics of high voltage equipment are often expressed as a Weibull distribution. In other words, the fitting of an endurance test and operational performance data is performed assuming that the probability density distribution of failure can be expressed as a Weibull distribution function. The Hazard analysis is a statistical analysis method for this purpose. In this simulation, as described in Section 3.3.1, the true distribution is given as Weibull distributions, therefore, if hazard analysis is performed with high accuracy, the distribution is expected to be restored.

When the failure characteristic follows a Weibull distribution, the failure rate λ(t) is expressed by Eq. (4). This is integrated over time as in the following equation and is called the cumulative hazard H(t).

Ht=0tλτ=ttsmE6

Taking the natural logarithm of both sides of Eq. (6), the following equation is obtained.

lnHt=lnttsm=mlntmlntsE7

Plotting as y = ln{H(t)} and x = ln(t) results in a straight line with slope m and y-intercept -mln(ts). Utilizing Eq. (7), the natural logarithm of the approximate value Ĥt of the cumulative hazard H(t) obtained from endurance tests or operational data is plotted against the natural logarithm of age t, to estimate the shape parameter m and scale parameter ts by linear approximation. When the number of operations at age t is N(t) and the number of failures is n(t), the approximate value λ̂t of the failure rate λ(t) is obtained from the following equation.

λ̂t=ntNtE8

Using λ̂t, Ĥt is obtained as follows:

Ĥt=τ=1tλ̂τ=Ĥt1+λ̂tE9

Table 2 summarizes the results of these calculations using the combined data in Table 1. Table 2 also includes the natural logarithm of t and Ĥt for graph plotting. In the hazard plot that shows the relationship between ln(t) and lnĤt, plotting is carried out only when Ĥt is changed, that is, when λ̂t0. The hazard plots created from ln(t) and Ĥt in Table 2 are shown in Figure 4. Figure 4 includes linear approximation, from which the shape parameter m = 3.57 and scale parameter ts = 82.2 years are obtained from the slope and the intercept.

age t (year)totalapprox. Failure rate λ̂(t)approx. Cum. hazard Ĥ(t)ln (t)ln (Ĥ(t))
operatingfailed
110200/102 = 0.00000.0000
28700/87 = 0.00000.0000
37400/74 = 0.00000.0000
47200/72 = 0.00000.0000
530000/300 = 0.00000.0000
664000/640 = 0.00000.0000
791000/910 = 0.00000.0000
8110000/1100 = 0.00000.0000
9136011/1360 = 0.00070.00072.197225−7.21524
10138000/1380 = 0.00000.0007
11131000/1310 = 0.00000.0007
12133011/1330 = 0.00080.00152.484907−6.51088
13146900/1469 = 0.00000.0015
14173900/1739 = 0.00000.0015
15200911/2009 = 0.00050.00202.70805−6.22217
16230922/2309 = 0.00090.00292.772589−5.86005
17259800/2598 = 0.00000.0029
18285022/2850 = 0.00070.00362.890372−5.64
19289811/2898 = 0.00030.00392.944439−5.54731
20295755/2957 = 0.00170.00562.995732−5.18698
21297255/2972 = 0.00170.00733.044522−4.92383
22297911/2979 = 0.00030.00763.091042−48,787
23283233/2832 = 0.00110.00873.135494−4.74832
24271588/2715 = 0.00290.01163.178054−4.45565
25259444/2594 = 0.00150.01323.218876−4.33097
26230544/2305 = 0.00170.01493.258097−4.20705
27201266/2012 = 0.00300.01793.295837−4.0245
28185744/1857 = 0.00220.02003.332205−3.91071
29158233/1582 = 0.00190.02193.367296−3.82024
30131133/1311 = 0.00230.02423.401197−3.72095
31128122/1281 = 0.00160.02583.433987−3.65846
32118544/1185 = 0.00340.02913.465736−3.53538
33109455/1094 = 0.00460.03373.496508−3.38972
34107144/1071 = 0.00370.03753.526361−3.28467
35105099/1050 = 0.00860.04603.555348−3.07858
3691833/918 = 0.00330.04933.583519−3.00999
3783422/834 = 0.00240.05173.610918−2.96248
3873944/739 = 0.00540.05713.637586−2.8629
3963955/639 = 0.00780.06493.663562−2.73448
4051955/519 = 0.00960.07463.688879−2.59613
4141833/418 = 0.00720.08173.713572−2.50423
4233266/332 = 0.01810.09983.73767−2.30448
4326233/262 = 0.01150.11133.7612−2.19587
4420444/204 = 0.01960.13093.78419−2.03356
4514922/149 = 0.01340.14433.806662−1.93591
4611311/113 = 0.00880.15313.828641−1.87639
478244/82 = 0.04880.20193.850148−1.59987
485400/54 = 0.00000.2019
493900/39 = 0.00000.2019
502700/27 = 0.00000.2019
511800/18 = 0.00000.2019
521111/11 = 0.09090.29283.951244−1.22816
53500/5 = 0.00000.2928
54300/3 = 0.00000.2928
55000/0 = −

Table 2.

One example of processing data for hazard plotting.

Figure 4.

One example of hazard plots.

The Weibull distribution obtained by such procedure is a “sample mean” obtained from the average characteristics of a “sample” of equipment operational data and is expected to be different each time the sample is taken, in this case, each time the operational data is simulated. On the other hand, the true failure rate characteristic, the “population mean,” is determined first in this discussion, therefore they can be compared. The failure rates and probability density distributions of failures obtained from the results of five simulations, including the data in Table 1, are shown in Figure 5. Figure 5 also shows the true failure rate characteristics. Some of the five estimates (sample mean) have failure rates that are close to the true values, while others are higher or lower. Only one of the sample means, which is expected to vary, can be observed in reality, and there is no way to know the deviation from the true value. The only fundamental solution to the low estimation accuracy due to the low failure rate described in Section 3.2.3 is to increase the amount of data by investigating and accumulating the actual operation of equipment over a long period of time.

Figure 5.

Results of hazard analysis for 5 simulated data. (a) failure rate (b) probability density of failure.

3.4 Failure data and preventive renewal data

As mentioned in section 3.2.3, another issue in estimating failure rates from actual operational results is that failures do not occur as actual results because preventive renewals are performed in actual maintenance. This section discusses the addition of renewal data to the failure rate estimation.

Reference [7] introduces the failure rate and renewal rate of transformers and points out that the failure rate does not increase over time but the renewal rate does, and that the failure rate characteristic should be what is shifted renewal rate characteristic to the right (toward the high aging side) if no preventive maintenance is performed since the failure would have occurred years later if no renewal was performed. In risk evaluation for the examination of maintenance and renewal plans, the use of failure rate characteristics based on the operational data without consideration of the preventive renewal effect is clearly an underestimate. Reference [8] analyzed operational performance data for transformers and shunt reactors and estimated failure rates assuming that equipment that was replaced before failure was the one that would fail 5 or 10 years later. In order to conduct such a study, it is necessary to investigate and accumulate the actual field data of not only operations and failures, but renewals with reasons.

An example of a survey of equipment operational data, including renewal data, is the questionnaire survey [9] conducted by “Investigating R&D Committee for Asset Management for Electric Power Equipment Based on Insulation Diagnosis” of IEE Japan. This is a survey of failure and renewal data in 10 years conducted on approximately 200 plant manufacturers and other companies. Reference [6] has tried to utilize the results of this survey to estimate failure rate characteristics by combining failure and renewal data. Although how to combine them should be examined according to the reasons for each renewal and the characteristics of the target equipment, the reasons for renewal were not investigated in the survey, therefore, the analysis was conducted by assuming that the failure should have occurred at +5 years after the year of each renewal [6]. The results for CV cables from 6 kV class to 60 kV class are shown in Figures 6 and 7. For example, the failure rate at 30 years old is about 26 times higher than without taking the renewal results into account. Considering that approximately 90% of the group is 6 kV class cables, and that diagnostic methods such as DC leakage current measurement would have been applied to this class cables, it can be assumed that most of the reasons for renewal are due to some kind of trouble, and it is highly appropriate to add up them when the failure rate is estimated.

Figure 6.

Hazard plots of CV cables.

Figure 7.

Results of failure characteristics estimation of CV cable. (a) failure rate (b) Cumulative failure probability.

Advertisement

4. Conclusion

In recent years, the asset management or managing assets for power transmission and distribution equipment has been actively examined. In order to optimize maintenance strategies, both risk and cost associated with operating the equipment should be considered, and both evaluated and compared in monetary values. CRIEPI is investigating managing assets support tools evaluating the cumulative cost in operation including statistically expected risk.

It is important to obtain failure rate characteristics of equipment for risk assessment. As one method, this chapter has presented a method for statistical analysis of actual equipment operation data in the field, as well as the necessary data and considerations for this method. For power transmission and distribution equipment with high reliability and low failure rates, it is necessary to accumulate actual data over a long period of time in order to accurately estimate failure rates. Among them, the information on equipment renewal, which has not necessarily been sufficiently investigated and analyzed in the past, is particularly important in the situation that the preventive maintenance is generally adopted, and must be investigated and accumulated together with the reasons for renewal.

References

  1. 1. Ageing of the system – impact on planning, CIGRE TB176. 2000
  2. 2. Takahashi T, Okamoto T. Development of asset management support tool for electric power apparatus. Int’l. Sympos. High Voltage Eng. (ISH), Paper No. T6-719. 2007
  3. 3. Takahashi T, Okamoto T. Development of asset management support tools for oil immersed transformer. IEEE Transactions on Dielectrics and Electrical Insulation. 2016;23(3):1643-1648
  4. 4. Takahashi T. Development of support program for managing assets by considering regular maintenance cost and statistically expected failure cost. In: Németh B, editor. Proceedings of the 21st International Symposium on High Voltage Engineering. ISH 2019. Lecture Notes in Electrical Engineering. Vol. 598. Springer, Cham; 2020. DOI: 10.1007/978-3-030-31676-1_22
  5. 5. Takahashi T. Study of hierarchical support technique for managing assets - development of support program for substation equipment. 22nd International Symposium on High Voltage Engineering (ISH 2021). 2021. pp. 2101-2106. DOI: 10.1049/icp.2022.0463
  6. 6. Takahashi T. Study of statistical failure analysis for long life equipment based on operation data. CRIEPI Report No. GD21025. 2022. [in Japanese]
  7. 7. Guidelines for the use of statistics and statistical tools on life data. CIGRE TB706. 2017
  8. 8. Picher P et al. Use of health index and reliability data for transformer condition assessment and fleet ranking. CIGRE session 45, A2-101. 2014
  9. 9. Asset management for electric power equipment based on insulation diagnosis. Technical report of the Inst. of Electrical Engineers Japan, No. 1243. 2012. [in Japanese]

Written By

Tsuguhiro Takahashi

Submitted: 15 July 2022 Reviewed: 29 July 2022 Published: 07 September 2022