Open access

Assessment of Areal Average Air Quality Level over Irregular Areas: a Case Study of PM10 Exposure Estimation in Taipei (Taiwan)

Written By

Hwa-Lung Yu, Shang-Chen Ku, Chiang-Hsing Yang, Tsun-Jen Cheng and Likwang Chen

Submitted: 21 October 2010 Published: 17 August 2011

DOI: 10.5772/17622

From the Edited Volume

Advanced Air Pollution

Edited by Farhad Nejadkoorki

Chapter metrics overview

2,054 Chapter Downloads

View Full Metrics

1. Introduction

Assessment of public health risk from the exposure of ambient pollutants has been heavily based upon the analyses of the associations between the pollutant exposure and its potential consequence, e.g. disease occurrences (Abbey et al., 1995; AckermannLiebrich et al., 1997; Beeson et al., 1998; Jerrett et al., 2005; Pope et al., 2002). In many air pollution epidemiologic investigations, individual health datasets at nationwide or regional scales are available to assess the subtle risks of pollution exposure. Among them, the understanding of the spatial or spatiotemporal distribution of ambient pollutants is essential due to their prevalent heterogeneity across space and/or time. In these cases, governmental agencies has established ambient air-quality monitoring networks, such as the Air Quality System (AQS) operated by the U.S. Environmental Protection Agency (EPA), regularly recording important and useful environmental data sources concerning the acute and chronic effects of ambient pollutants (TWEPA, 2006; USEPA, 1992). Based upon these databases, an ideal exposure assessment is performed by applying techniques of spatial or/and spatiotemporal analysis for the estimation of the pollutant level at the locations of individuals in health dataset. However, due to the raising concerns of privacy and confidentiality of personal information, the sensitive personal information, such as residential addresses, of health dataset is usually not allowed to be accessed. As a result, individual information of health dataset obtained from institutes or governmental agencies is often removed or degraded. Among them, the spatial locations of individuals of health dataset are usually aggregated into a larger spatial unit, e.g. higher administrative level, in which no personal identities can be identified. These aggregations raise challenges for environmental epidemiologists to assess the exposure levels and to investigate the potential health risks.

Spatial interpolation techniques have been increasingly used by environmental epidemiologists to estimate the spatiotemporal distribution of air quality levels. Among them, the deterministic approaches such as nearest-neighbor method (NN) and inverse distance weighted method (IDW) are the most widely used methods (Hendryx et al., 2010; Hoek et al., 2001; Michelozzi et al., 2002; Sohel et al., 2010), in which the estimation results are derived from certain assumed functional relationship of geographical distances between the observation and estimation locations without considering the stochastic associations among the air quality measurements. The stochastic techniques, in particular kriging method, have been applied with increasing frequency in exposure assessment studies to address the spatial heterogeneity among the air quality data as well as the estimation uncertainty of predictions (Brauer et al., 2003; Brimicombe, 2000; Buzzelli and Jerrett, 2004; Hoek et al., 2001). Most spatial interpolators are originally developed for the estimations with geographical support of estimation and observation locations at point scale. As a result, most studies of exposure assessment apply spatial techniques for point estimation at the centroid of the geographical unit of interest to characterize its average air pollution level (Chen and Schwartz, 2008; Chen and Schwartz, 2009; Lertxundi-Manterola and Saez, 2009; Maheswaran and Elliott, 2003; Miller et al., 2007). The inconsistency of geographical support between the exposure estimations and aggregated health dataset can potentially distort the results of their associated epidemiological studies (Young et al., 2009; Young et al., 2008).

This study investigates and compares the estimation results of areal-averaged air quality level by several popular spatial mapping techniques, i.e. NN, IDW, ordinary kriging (OK) and block kriging method (BK). Among them, BK is a kriging-based upscaling method which can perform spatiotemporal estimation with the consideration of the stochastic dependence among the locations with irregular sizes and shapes. This comparison is performed on the spatiotemporal PM10 estimation of the townships over Taipei area (Taiwan) during 2004-2006 on the basis of data during 1997-2007.


2. Materials and methods

2.1. Study area

Taipei is the largest metropolitan area in Taiwan including Taipei city and Taipei county with the vehicle density as high as over 6000 vehicles per km2. Except for traffic emissions, the three incineration plants are the major stationary emission sources in the area (Chang and Lee, 2007). The Taipei area is bounded by mountains, i.e. Yangming mountains to the north, Linkou mesa to the west, and ridge of Snow mountains to the southeast which forms the second largest basin of the island (see Figure 1). Because of the significant variation of the basin topography, the air convection and circulation are generally degraded in this area. In addition, the alluvial plains at basin floor materialized the highly urbanization area of Taipei. As a result, the ambient pollutant concentration across the basin floor is generally higher than that over its surrounding mountain areas. Taiwan Environmental Protection Agency (TWEPA) has established the air quality monitoring network which regularly records the ambient pollutants, including PM10 and other criteria pollutants (TWEPA, 2006), and meteorological covariates throughout the island since September, 1993. Figure 2 shows the eighteen TWEPA monitoring stations in Taipei area. As it can be noticed, the distribution of monitoring locations is spatially imbalanced that most of the stations in Taipei are located at the high urbanized area. In this study, all the PM10 observations from these stations are aggregated into monthly data following the procedure suggested by USEPA (USEPA, 2004). The areal concentration estimations are performed at each of the townships shown in Figure 2 by various spatial interpolation techniques.

Figure 1.

The highways, rivers and topography in Taipei metropolitan area

Figure 2.

Spatial distribution of PM10 monitoring stations operated by TWEPA

2.2. Methods

This study investigates several common used spatial interpolation methods for ambient pollutant exposure estimation. Among them, the nearest neighbor method (NN), also called polygon method and Theissen method, is the simplest method for spatial estimation. It assumes the air quality level at an ungauged location is completely determined by its closest observations so that spatial distribution of pollutants is composed of a set of polygon across space. The inverse distance weighted method (IDW) is a weighted average of neighboring observation values. The weights given to each observation is a function of distances between observation and estimation locations (Waller and Gotway, 2004) as below


where dki is the space-time distance between the estimation location (sk,tk) to the observation locationsz(si,ti). pis the modeler-specified degree. The increase of p can decreases the weights of distant observations. The space-time distance is assumed to be the function of the geographical distance and time interval (Christakos, 2000; Christakos et al., 2000). It can be expressed into the form ofdki=|sksi|+mst|tkti|, where mst is the space-time metric to account for the relationship between spatial and temporal metric (Christakos et al., 2002). Both NN and IDW methods are deterministic methods which provide the estimations without the information of estimation uncertainty.

Stochastic techniques, on the other hand, can account for the uncertainty of the space-time data under the framework of random field theory which forms a multivariate joint distribution among the space-time attribute of interest. Among them, kriging method is the most popular technique in space-time mapping which is so-called best linear unbiased estimator (BLUE) in the sense of providing the minimum estimation uncertainty (Olea, 1999). Various types of kriging method have been developed in terms of different assumptions and analytical goals. Among them, ordinary kriging (OK) is the most widely used one which assumes the unknown mean among the space-time dependent data. The basic equations of ordinary kriging are shown below (Goovaerts, 1997)


where λi,pk are the OK weights for the observation atpi=(si,ti); C(pi,pj)denotes the covariance of the attribute between pi andpj, n(pk)is the number of observations used for the kriging estimation atpk, and μOK,pk is the Lagrange multiplier. The OK estimation at pk can be the linear combination of observations expressed as form ofz^(pk)=i=1n(pk)λi,pkz(pi). Block kriging (BK) inherits the framework of kriging method and considers the geographical support of observation and estimation locations, i.e. their size and shape. The average value of attribute of concern over a block V can be represented asz^V(pk)=V(pk)duz(u)/|V|, where |V| is the areal size of block V and u is the point locations within the blockV. The block kriging system is very similar to Eq. (2) with the covariance C(pk,pj) replaced by the covariance between point observations and estimation block C¯(V(pk),pj)(Goovaerts, 1997), whereC¯(pk,pj)=V(pk)duC(u,pj)/|V|.

This study compares the township-based estimations of PM10 level by the methods above, i.e. NN, IDW, OK and BK. Among them, three different spatial resolutions are used in BK method, i.e. dividing the irregular areas into 5x5, 10x10 and 40x40 grids. The estimations are performed at all townships in Taipei during 2006 with the support of space-time PM10 observations during 1997-2007. Among them, the township-level PM10 estimations by NN, IDW and OK methods follow the conventional approach that uses the estimations at geographical centroid to characterize the areal average level of pollutant concentration (Chen and Schwartz, 2009; Lertxundi-Manterola and Saez, 2009; Maheswaran and Elliott, 2003). In addition, to assure the comparison is performed at the same basis, the nonstationarity of PM10 process in space and time is removed by locally weighted smoothing regression method (LOESS) (Cleveland, 1979; Cleveland and Devlin, 1988) in advance to the applications of the spatial-time interpolation techniques.


3. Results

The observed PM10 levels across the stations in Taipei vary significantly as shown in Figure 3. The average and variance of monthly PM10 observations at the stations located in highly urbanized areas are generally higher than those in city’s surrounding areas. Temporal variation of averaged PM10 across stations is shown in Figure 4 in which the increased PM10 variability happens at the time of higher average PM10 value, i.e. proportional effect. In general, the average of observed PM10 is higher during the seasons of spring and winter. In this study, the spatiotemporal processes of monthly PM10 is decomposed into an nonstationary trend in space and time to account for the general pattern of PM10 observations, e.g. PM10 variation resulting from the changes of seasons and distribution of emissions, and stationary residuals which account for the spatiotemporal dependence among the PM10 transport process. The spatiotemporal trend is estimated by using LOESS method which applies low-order polynomials to obtain the general pattern of PM10 variation at each space-time locality (Cleveland, 1979; Cleveland and Devlin, 1988). The estimation of spatiotemporal trend is performed by R software package. To characterize the spatiotemporal dependence, a space-time seperable function is used as shown below (see Figure 5)


where c0=150 and[ar,at]=[2000m,3month]. Eq. (3) is used in kriging methods, i.e. OK and BK, to account for the space-time covariance at point scale in space and time. In addition, the covariance function identifies the influential ranges of PM10 data in space and time, and therefore characterizes the space-time metric which is used to estimate the space-time distance for IDW and kriging methods as discussed above.

The estimations of the areal PM10 level during the period of 2004-2006 at all townships of irregular sizes and shapes are compared in this study and their differences are shown in Figure 6 in which BK1, BK2, and BK denote the results of block kriging with spatial resolution of 5 by 5, 10 by 10, and 40 by 40 elements of each township. Results show that the spatiotemporal dependence is important to the estimations, i.e. significant differences between the results from deterministic methods, i.e. NN and IDW, and stochastic methods, OK and BK. The spatial distribution of the mean squared difference of estimations between the results from BK and other methods are shown in Figure 7. Figures 7 (a) and (b) shows the comparison between BK and the deterministic methods, i.e., NN and IDW, in which the mean squared differences (MSD) are generally high and increase as the estimation location further apart from the cluster of monitoring stations located at city central. The comparisons among spatial patterns of kriging results in Figures 7 (c) and (d), for OK-BK and BK1-BK

Figure 3.

Spatial distribution of (a) mean and (b) standard deviation of PM10 observations

Figure 4.

Temporal variation of averaged PM10 observations and its associated 95% confidence interval.

Figure 5.

Spatiotemporal covariance of PM10 observations across (upper) space and (bottom) time.

Figure 6.

Boxplot of the estimation differences between the results of block kriging with highest resolution and other methods.

Figure 7.

Spatial distribution of the mean squared differences between the methods of (a) NN-BK, (b) IDW-BK, (c) OK-BK, and (d) BK1-BK

respectively, show the relatively higher MSDs are located at the boundary of study area and townships surrounding by multiple stations with relatively drastic variations of their observations. It should be noted that estimation differences between deterministic methods and BK are generally higher than those in the comparisons between kriging methods. Similar results are shown in the comparisons of over time in Figure 8 in which the seasonality of MSDs is shown in all the comparisons that elevated estimation variability is elevated in spring and winter.

Figure 8.

Temporal distribution of mean squared differences among the methods of NN, IDW, OK, and BK with various spatial resolutions.


4. Discussions

Various methods of different level of complexity have been proposed and used in spatiotemporal estimation of air pollution exposure. Due to the privacy issue of personal data access, the spatiotemporal exposure estimation is often performed on the aggregated dataset with various geographical supports. As a result, certain approximations or assumptions of common techniques of point scale are required to be applied on the estimation of the average level over an area of concern, e.g. administrative division. Among them, the most common approximation is to use the air pollution estimation at the centroid to represent the average pollution level of the entire area of concern (Chen and Schwartz, 2008; Chen and Schwartz, 2009; Lertxundi-Manterola and Saez, 2009; Maheswaran and Elliott, 2003; Miller et al., 2007). The impact of the size and shape of geographical units to the estimation results is seldom considered (Goovaerts, 2008). The assumption of the similar spatial resolution among observation and Estimation areas can be dubious. In addition, the estimation ignores the variability of pollution levels within each of the studied units. Such approximation can be inadequate, especially, in the exposure estimation in metropolitan area where, in addition to its complex topography, various emission sources spreading across space elevates the spatiotemporal variability of air pollution. Though it is theoretical defective, exposure estimation at centroids usually provides a quick and convenient way to assess the air quality level in the corresponding areas of interest. The impact of this assumption of spatial prediction is worthwhile to be examined because the potential biased estimation results of air quality can distort their associated environmental epidemiology studies (Son et al., 2010).

As shown in Figure 6, comparing with the results of BK, the inconsistency is clearly shown between the estimations obtained from the two groups of methods, i.e. deterministic and stochastic, that the variability of the between-group differences are significantly larger than the within group differences. Among them, the results from NN method, i.e. a very common used method for ambient exposure estimation in environmental health studies (Basu et al., 2004; Miller et al., 2007), are significantly variable compared to those from other methods, i.e. its standard deviation is about 20% of common level of PM10. Figure 7 (a) shows that the discrepancy levels of NN results can vary across space and its generally high variability can be reduced at the areas of the higher density of monitoring stations. The periodic feature of the variation in Figure 8 shows that the temporal characteristics of NN results are still distinct from those of the results by other methods. It may resort to the fact that NN method is the only method disregarding the space-time dependence among the PM10 dataset in this study. Despite of its deterministic assumption, IDW accounts for the spatiotemporal dependence by considering the space-time metric among the observation and estimation locations. As a result, IDW reduces the discrepancy in both space and time across Taipei area between its results and those by kriging methods which considering the spatiotemporal dependence by not only space-time metric but the similarity among the observations. The standard deviation of the estimation variability compared to BK is about 10% of common PM10 level in Taipei. In addition, the results of deterministic approach are overestimated, especially at the townships with scarce monitors around, due to the preferential sampling at the highly polluted areas of Taipei central area (see Figure 3), i.e. the observations are mostly sampled at high concentration areas.

Figures 6-8 show the results from kriging methods, i.e. OK and BK with two spatial resolutions, are relatively similar. Among them, the standard deviation of estimation variability is lower than 5% of PM10 level. The distributions of MSDs among kriging methods in both space and time are relatively close to each other shown in Figure 7 (c) and (d). Among them, the areal estimation variability is further reduced by using BK instead of OK that their 95% confidence ranges are 2.8 and 6.8μg/m3, respectively. Because of the short spatial influential range of local PM10 transport, i.e. 2000m (see Eq. (3)), it implies the high spatial variability of PM10 distribution within the townships, i.e. point estimation results can vary significantly from location to location within townships. As a result, the consideration of geographical characteristics can effectively improve the understanding of the areal average level of PM10 concentration. This effect can be especially obvious at the townships containing and surrounding by multiple stations, e.g. San-chung township (the township with two stations), because the nature of kriging methods completely appreciates the observations and therefore result in the higher variability of estimation results among these townships. It should be noted that all the estimations by the present spatial interpolation techniques only depend upon the PM10 observations. The general characteristics of these methods can only provide reasonable estimations within the convex of data locations (Olea, 1999).


5. Conclusions

This study applies several popular spatial techniques to assess the average areal PM10 level of townships in Taipei area. The comparison shows that the importance of the inclusion of space-time metric among the observation and estimation locations as well as the consideration of spatiotemporal dependence among the observations for the estimations. This study shows the consideration of the shape and size of the townships is important to the performance of the estimations of areal average concentration. The MSDs between the estimations by kriging methods with and without considering the geographical characteristics of townships are up to 6% and 2% of the common PM10 level across space and time, respectively. This study provides insights for the impact of geographical support to the exposure estimation in Taipei for environmental epidemiological studies.



This research was supported by the National Science Council of Taiwan (NSC 99-2625-M-002-007-).


  1. 1. AbbeyD. E.LebowitzM. D.MillsP. K.PetersenF. F.BeesonW. L.BurchetteR. J.1995Long-Term Ambient Concentrations of Particulates and Oxidants and Development of Chronic Disease in a Cohort of Nonsmoking California Residents. Inhalation Toxicology 1995; 71934
  2. 2. AckermannLiebrich. al.Lung function and long term exposure to air pollutants in Switzerland. American Journal of Respiratory and Critical Care Medicine 1997155122129
  3. 3. BasuR.WoodruffT. J.JDParkerSaulnier. L.SchoendorfK. C.Comparing exposure metrics in the relationship between PM2.5 and birth weight in California.Journal of Exposure Analysis and Environmental Epidemiology200414391396
  4. 4. BeesonW. L.AbbeyD. E.KnutsenS. F.1998Long-term concentrations of ambient air pollutants and incident lung cancer in California adults: Results from the AHSMOG study. Environmental Health Perspectives 1998; 106813822
  5. 5. BrauerM.HoekG.van al.Estimating long-term average particulate air pollution concentrations: Application of traffic indicators and geographic information systemsEpidemiology200314228239
  6. 6. BrimicombeA.Constructing and evaluating contextual indices using GIS: a case of primary school performance tablesEnvironment and Planning A20003219091933
  7. 7. BuzzelliM.JerrettM.Racial gradients of ambient air pollution exposure in Hamilton, CanadaEnvironment and Planning A20043618551876
  8. 8. ChangS. C.LeeC. T.2003Evaluation of the trend of air quality in Taipei, Taiwan from 1994 to 2003. Environmental Monitoring and Assessment 2007; 1278796
  9. 9. ChenJ. C.SchwartzJ.Metabolic syndrome and inflammatory responses to long-term particulate air pollutantsEnvironmental Health Perspectives2008116612617
  10. 10. ChenJ. C.SchwartzJ.Neurobehavioral effects of ambient air pollution on cognitive performance in US adultsNeurotoxicology200930231239
  11. 11. ChristakosG.2000Modern Spatiotemporal Geostatistics. New York, NY: Oxford Univ. Press, 2000.
  12. 12. ChristakosG.BogaertP.SerreM. L.2002Temporal GIS: Advanced Functions for Field-Based Applications. New York, NY: Springer-Verlag, 2002.
  13. 13. ChristakosG.HristopulosD. T.BogaertP.On the physical geometry concept at the basis of space/time geostatistical hydrologyAdvances in Water Resources200023799810
  14. 14. ClevelandW. S.1979Robust Locally Weighted Regression and Smoothing Scatterplots. Journal of the American Statistical Association 1979; 74829836
  15. 15. ClevelandW. S.DevlinS. J.1988Locally Weighted Regression- an Approach to Regression-Analysis by Local Fitting. Journal of the American Statistical Association 1988; 83596610
  16. 16. GoovaertsP.Geostatistics for natural resources evaluationNew York: Oxford University Press, 1997
  17. 17. GoovaertsP.Kriging and semivariogram deconvolution in the presence of irregular geographical unitsMathematical Geosciences200840101128
  18. 18. HendryxM.FedorkoE.Anesetti-RothermelA.2010A geographical information system-based analysis of cancer mortality and population exposure to coal mining activities in West Virginia, United States of America. Geospatial Health 2010; 4243256
  19. 19. HoekG.FischerP.Van denBrandt. P.GoldbohmS.BrunekreefB.Estimation of long-term average exposure to outdoor air pollution for a cohort study on mortality. Journal of Exposure Analysis and Environmental Epidemiology 200111459469
  20. 20. JerrettM.BurnettR. T.MaPopeR. J.CAKrewskiD.NewboldK. al.Spatial analysis of air pollution and mortality in Los AngelesEpidemiology200516727736
  21. 21. Lertxundi-ManterolaA.SaezM.Modelling of nitrogen dioxide (2 and fine particulate matter (PM10) air pollution in the metropolitan areas of Barcelona and Bilbao, Spain. Environmetrics 2009;20477493
  22. 22. MaheswaranR.ElliottP.Stroke mortality associated with living near main roads in England and Wales- A geographical study. Stroke 20033427762780
  23. 23. al.Adult and childhood leukemia near a high-power radio station in Rome, ItalyAmerican Journal of Epidemiology200215510961103
  24. 24. MillerK. A.DSSiscovickSheppard. L.ShepherdK.SullivanJ. H.AndersonG. al.Long-term exposure to air pollution and incidence of cardiovascular events in womenNew England Journal of Medicine2007356447458
  25. 25. OleaR. A.1999Geostatistics for engineers and earth scientists. Boston: Kluwer Academic Publishers, 1999.
  26. 26. PopeC. A.BurnettR. T.ThunM. J.CalleE. al.2002Lung cancer, cardiopulmonary mortality, and long-term exposure to fine particulate air pollution. Jama-Journal of the American Medical Association 2002; 28711321141
  27. 27. SohelN.KanaroglouP. S.PerssonL. A.HaqM. Z.RahmanM.VahterM.Spatial modelling of individual arsenic exposure via well water: evaluation of arsenic in urine, main water source and influence of neighbourhood water sources in rural BangladeshJournal of Environmental Monitoring20101213411348
  28. 28. SonJ. Y.BellM. L.LeeJ. T.2010Individual exposure to air pollution and lung function in Korea Spatial analysis using multiple exposure approaches. Environmental Research 2010; 110739749
  29. 29. TWEPA.Air quality in Taiwan annual report. Taiwan Environmental Protection Agency, Taipei, 2006
  30. 30. USEPA.Technology Transfer Network:Air Quality System(AQS). U.S. Environmental Protection Agency, 1992
  31. 31. USEPA. AQS Raw Data Summary Formulas Draft.In: Standards OoAQPa, editor. U.S. Environmental Protection Agency, Washington, DC, 2004
  32. 32. WallerL. A.GotwayC. A.2004Applied spatial statistics for public health data. Hoboken, N.J.: John Wiley & Sons, 2004.
  33. 33. YoungL. J.CAGotwayKearney. G.DuclosC.Assessing Uncertainty in Support-Adjusted Spatial Misalignment ProblemsCommunications in Statistics-Theory and Methods 20093832493264
  34. 34. YoungL. J.CAGotwayYang. J.KearneyG.DuClos. C.Assessing the association between environmental impacts and health outcomes: A case study from FloridaStatistics in Medicine20082739984015

Written By

Hwa-Lung Yu, Shang-Chen Ku, Chiang-Hsing Yang, Tsun-Jen Cheng and Likwang Chen

Submitted: 21 October 2010 Published: 17 August 2011