Open access peer-reviewed chapter

Adaptive Filters for Processing Water Level Data

By Natasa Reljin, Dragoljub Pokrajac and Michael Reiter

Submitted: October 7th 2010Reviewed: February 9th 2011Published: July 5th 2011

DOI: 10.5772/16253

Downloaded: 1375

1. Introduction

Salt marshes are composed of various habitats contributing to high levels of habitat diversity and increased productivity (Kennish, 2002; Zharikov et al., 2005), making them among the most productive ecosystems on the Earth. The salt marsh consists of a halophytic vegetation community growing near saline waters (Mitsch & Gosselink, 2000) characterized by grasses, herbs, and low shrubs (Adam, 2002). Salt marshes exist between the upper limit of the high tide and the lower limit of the mean high water tide (Adam, 2002). They represent an important factor in the support of surrounding food chains, and due to the high level of productivity their economic and aesthetic value is increasing (Delaware Department of Natural Resources and Environmental Control, 2002; Zharikov et al. 2005). The survival and reproduction of many species of commercial fish and shellfish is dependent upon salt marshes (Zharikov & Skilleter, 2004). In addition, salt marshes provide critical habitat and food supply to crustaceans (Zharikov et al., 2005) and shorebirds (Potter et al., 1991). They are often considered as a primary indicator of the ecosystem health (Zhang et al., 1997). Because of their ability to transfer and store nutrients, salt marshes are an important factor in the maintenance and improvement of water quality (Delaware Department of Natural Resources and Environmental Control, 2002; Zhang et al., 1997). In addition, they provide significant economic value as a cost-effective means of flood and erosion control (Delaware Department of Natural Resources and Environmental Control, 2002; Morris et al., 2004). This economic value makes coastal systems the site of elevated human activity (Kennish, 2002).

Determining the effects of sea level rise on tidal marsh systems is currently a very popular research area (Temmerman et al., 2004). While average sea level has increased 10-25 cm in the past century (Kennish, 2002), the Atlantic coast has experienced a sea level rise of 30 cm (Hull & Titus, 1986). Local relative sea level has risen an average rate of 0.12 cm yr-1 in the past 2000 years, but at Breakwater Harbor in Lewes, DE sea level is rising at the average rate of 0.33 cm yr-1, nearly three times that rate (Kraft et al., 1992). According to the National Academy of Sciences and the Environmental Protection Agency, sea level rise within the next century could increase 60 cm to 150 cm (Hull & Titus, 1986).

The changes in sea level rise are particularly affecting tidal marshes, since they are located between the sea and the terrestrial edge (Adam, 2002; Temmerman et al., 2004). The prediction is that sea level rise will have the most negative effect on marshes in the areas where the landward migration of the marsh is restricted by dams and levees (Rooth & Stevenson, 2000). If sea level rises the almost certain prediction of 0.5 m by 2100 and marsh migration is prevented, then more than 10,360 km2 of wetlands will be lost (Kraft et al., 1992). If the sea level rises 1 m then 16,682 km2 of coastal marsh will be lost, which is approximately 65% of all extant coastal marshes and swamps in the United States (Kraft et al., 1992).

Due to an imminent potential threat which can jeopardize the Mid-Atlantic salt marshes, it is very important to examine the effect of sea level rise on these marshes. The marshes of the St. Jones River near Dover, DE, can be considered to be typical Mid-Atlantic marshes. These marshes are located in developing watersheds characterized by dams, ponds, agricultural lands, and increasing urbanization, providing an ideal location for studying the impacts of sea level rise on salt marsh extent and location. In order to determine the effect of sea level rise on the salt marshes of the St. Jones River, the change in salt marsh composition was quantified. Unfortunately, as for most marsh locations along the Atlantic seaboard, the data on sea level rise for this area was not available for comparison with marsh condition. However, a wide data set for this area is available through a water quality monitoring program, and if it could be properly processed and analyzed it could result in sea level rise data for the location of the interest.

In this chapter, we describe the application of signal processing on the water level data from the St. Jones River watershed. The emphasis is on adaptive filtering in order to remove the influence of upstream water level on the downstream levels.

2. Data

The St. Jones River, in central Delaware, is 22.3 km long (Pokrajac et al., 2007a). It has an average mean high water depth (MHW) of 4 m in the main stem, and an average width of 15 feet. The site’s watershed area is 19,778 ha, and the tidal reaches are influenced by fresh water runoff from the urbanized area upstream. An aerial photo of the St. Jones River is shown in Fig. 1.

Figure 1.

Aerial photo of St. Jones River.

The data used in this research were obtained from the Delaware National Estuarine Research Reserve (DNERR), which collected the data as part of the System Wide Monitoring Program (SWMP) under an award from the Estuarine Reserves Division, Office of Ocean and Coastal Resource Management, National Ocean Service, and the National Oceanic and Atmospheric Administration (Pokrajac et al. 2007a, 2007b). Through SWMP, researchers collect long term water quality data from coastal locations along Delaware Bay and elsewhere in order to track trends in water quality.

The original dataset contained 57,127 measurements, taken approximately every thirty minutes using YSI 6600 Data Probes (Fig. 2) (Pokrajac et al., 2007a, 2007b). The measurements were taken from January 31, 2002 through October 31, 2005. In order to determine if sea level rise is influencing the St. Jones River, the water level data were collected from two SWMP locations: Division Street and Scotton Landing (Pokrajac et al., 2007b). Probes were left in the field for two weeks at a time, collecting measurements of water level, temperature (Co), specific conductivity (mS cm-1), salinity (ppt), depth (m), turbidity (NTU), pH (pH units), dissolved oxygen percent saturation (%), and dissolved oxygen concentration (mg L-1). We used only the water level (depth) data for this study, which were collected using a non-vented sensor with a range from 0 to 9.1 m, an accuracy of 0.18 m, and a resolution of 0.001 m. Due to the fact that the probes are not vented, changes in atmospheric pressure appear as changes in depth, which results in an error of approximately 1.03 cm for every millibar change in atmospheric pressure (Mensinger, 2005). However, the exceptionally large dataset (57,127 data points) overwhelms this data error.

Figure 2.

YSI 6600 Data Probe.

The downstream location, Scotton Landing, is located at coordinates latitude 39 degrees 05’ 05.9160” N, longitude 75 degrees 27’ 38.1049” W (Fig. 3). It has been monitored by SWMP since July 1995. The average MHW depth is 3.2 m, and the river is 12 m wide (Mensinger, 2005). This location possesses a clayey silt sediment with no bottom vegetation, and has a salinity range from 1 to 30 ppt. The tidal range is from 1.26 m (spring mean) to 1.13 m (neap mean). The data collected at the Scotton Landing site are referred as downstream data (see Fig. 4).

The water level data from the Scotton Landing site alone were not sufficient. In addition to tidal forces, this site is influenced by upstream freshwater runoff, so changes in depth could not be isolated to sea level change. However, the data from a non-tidal upstream sampling site could be used for removing the upstream influence at Scotton Landing. Therefore, the data from an upstream location, Division Street, was included in the analysis. Its coordinates are latitude 39 degrees 09’ 49.4” N, longitude 75 degrees 31’ 8.7” W (see Fig. 3.). The Division Street sampling site is located in the mid portion of the St. Jones River, upstream from the Scotton Landing site. At this location, the river’s average depth is 3 m and width is 9 m. The site possesses a clayey silt sediment with no bottom vegetation, and has a salinity in the range from 0 to 28 ppt. The tidal range at this location varies from 0.855 m (spring mean) to 0.671 m (neap mean). The data were monitored from January 2002 (Mensinger, 2005). The data collected at the Division Street site are referred to as upstream data (see Fig. 4).

Figure 3.

Sampling locations for St. Jones data: Division Street (upstream data); Scotton Landing (downstream data).

Figure 4.

Original dataset (upstream and downstream data).

3. Data pre-processing

The data were sampled every Ts = 30 minutes, and the dataset consisted of “chunks” of continuous measurements. Some of the measurements were missing due to maintenance or malfunction of the equipment, probe replacement, etc. The length of the intervals with missing measurements varied between 1 h (1 missing measurement) and 1517.5 h (3036 missing measurements), but the majority of the intervals were shorter than 10 h.

Figure 5.

Spectrum of collected data before filtering (chunk 99, downstream data).

The discrete Fourier spectra (Proakis & Manolakis, 2006) of all the chunks contained three prominent peaks, which is shown in Fig. 5 using chunk 99 from the downstream data. The first peak corresponds to lunar semi-diurnal tides with a period of approximately 12.4 h, and the diurnal tides with a period of approximately 24.8 h. In addition, there is a peak that corresponds to solar tides, which have a period of approximately 12 h. These periodicities are also shown in Fig. 6.

Figure 6.

The periodicities of the downstream data.

The dataset had several problems that had to be rectified before further processing. One data sample (Sep 28, 2004, 09:00:00) had an incorrect time, which was located sometime between Sep 27, 2004, 23:30:00 and Sep 28, 2004, 00:30:00, and was corrected. Four data samples (Jul 24, 2003, 07:30:00; Jun 10, 2005, 09:00:00; Aug 11, 2005, 15:00:00; Aug 11, 2005, 15:30:00) had missing values. In addition, the number of intervals with no measurements (total of 99 “gaps” in experiment) represented a problem for signal processing (for example, for filtering). Fig. 7 shows the number of chunks as a function of the duration of the missing measurements. Due to the properties of the used data and the shortest period of 12 h, we decided to interpolate intervals shorter than 12 h. Also, we interpolated all the above mentioned samples with missing data values. The treatment of the missing values is shown in Fig. 8.

In order to interpolate data for each interval of missing measurements, first we approximated the existing data within 20 samples from the interval. We used a least squares approximation followed the combination of the 4th order polynomial and trigonometric functions:

x(t)=j=04ajtj+j=13Ajsin(2πtTj+θj)E1

Figure 7.

The number of chunks as function of the duration of missing measurements.

Figure 8.

The treatment of the missing values.

where T1 = 12.4 h, T2 = 24.8 h and T3 = 12 h. Then, we interpolated missing values using the computed approximation functions. The interpolation was performed on 866 samples, which represented less than 2% of the original number of samples. One example of the interpolated intervals is depicted in Fig. 9.

Figure 9.

An example of interpolated intervals.

The interpolation resulted in the merging of the majority of chunks, thus giving us only 11 chunks. The sizes of the new chunks were as follows: 4105, 5422, 4, 4, 7154, 14357, 10750, 5, 4491, 9423, and 2278. Three of those chunks (3, 4 and 8) have very small size, which made them suitable for discarding. Therefore, the interpolation process left us with only 8 chunks.

4. Filtering of the tidal components

We performed discrete filtering of both upstream and downstream data using the Filter Design and Analysis (FDA) Tool in Matlab Signal Processing Toolbox, v.6.2 in order to remove the tidal periodic components from the data. The first idea was to create and use the infinite impulse response (IIR) filter (Proakis & Manolakis, 2006), because it can potentially meet the design specifications with lower order than the corresponding finite impulse response (FIR) filter, which would also result in shorter time to buffer the data. However, several attempts (using the Yule-Walker method, notch or elliptic filters) didn’t achieve the expected results – the order was too high and the attenuation was less than specified (Pokrajac et al., 2007a). Hence, we designed the FIR filter. Since the spectrum of the data had peaks in two bands (see Fig. 5), two stopband filters were designed. Both of them had a passband ripple of 0.05, and the sampling frequency fs = (1/30) min-1 = 0.556 mHz (Pokrajac et al., 2007a). In order to have a stopband attenuation of at least 20 dB in the 11 – 11.4 μHz band, which corresponds to a 24.8 h period, the first created filter was of order 168. The attenuation of 40 dB in the 22.401 – 23.148 μHz band (which corresponds to periods of 12 and 12.4 h) was achieved with the second filter of order Nfilter = 354. Here, more attenuation was needed due to the very high corresponding peak in the spectrum. In Figs. 10 and 11, magnitude responses of the first and the second filters are shown. The result of applying both filters on chunk 99 and downstream data is illustrated in Fig. 12. At the beginning of each chunk, we had to discard Nfilter-1 data samples in order to perform filtering. This led to discarding less than 5% of the data. The standard deviation of the downstream data after the filtering was std(yFIR(t)) = 0.200. Also, we tried the alternative approach by applying a moving average (MA) filter of length Q = 25, which corresponds to a period of 12.4 h. Standard deviation of the downstream data after the MA filter was std(yMA(t)) = 0.223. The result of filtering the downstream data is shown in Fig. 13.

Figure 10.

Magnitude response of the first filter.

Figure 11.

Magnitude response of the second filter.

Figure 12.

Spectrum after filtering (chunk 99 and downstream data).

Figure 13.

Filtered downstream data.

5. Application of the adaptive filters

The downstream data yt can be considered as a non-stationary function of the delayed upstream data xt (see Fig. 14) (Pokrajac et al., 2007a, 2007b). It can be described as the discrete model yt=ft(xt, xt-Ts,…, xt-(L-1)Ts)+rt, where L is the maximal delay of the model and rt is the residual corresponding to the portion of the downstream data which cannot be explained by the upstream data.

Figure 14.

Removal of the upstream data influence.

If a function ft is linear, the adaptive linear model can be represented as follows:

yt=wtTxt+rtE2

where wt = [w0,twL-1,t]T are coefficients and xt = [xtxt-(L-1)Ts]T is the upstream data vector. A linear regression model could be obtained if the coefficients w are held constant (Devore, 2007):

yt=wTxt+rtE3

The coefficient of determination, R2, is usually used to measure the accuracy of the model, (Devore, 2007). It is defined as a function of averaged squared residuals and the standard deviation of the response:

R2=1r^t2¯std(yt)2E4

where the residuals are estimated with:

r^t=ytwtTxtE5

The updating of the coefficients wt in Eq. (2) is performed using the Widrow-Hoff least mean squares (LMS) algorithm (Widrow & Stearns, 1985):

wt+1=wt+2μr^txtE6

where μ represents the adjustable learning rate, and r^tis estimated using Eq. (5). In addition to the Widrow-Hoff LMS algorithm, we applied time notching by adjusting the coefficients only when all the time instants, t,..., t-(L-1)Ts, belonged to the same chunk of interpolated data (Pokrajac et al., 2007a, 2007b), see Fig. 15.

Figure 15.

Time notching in adaptive filtering.

Using the linear regression given with Eq. (3) on the data yMA(t), which is processed by the MA filter, we were able to explain only 6% of the variance, i.e. R2 = 0.06 for L = 55. In Table 1 are shown the results of obtained std(r^(t)), for different values of the learning rate and the filter delay, when the adaptive filter given with Eqs. (2), (5) and (6) is used. Useful models were obtained when std(r^(t))<std(yMA(t)) = 0.200, and are shown in the shaded boxes in Table 1. As can be seen, the best results were obtained for L = 55, μ = 0.015, which yielded to R2 = 0.37. Small values of the learning rate, combined with small filter length, lead to unsatisfactory results. On the other hand, the learning becomes unstable if the filter length and learning rates are getting large.

/L303540455055
0.010.2260.2130.2010.1900.1870.180
0.0150.2040.1900.1740.1640.1600.157
0.020.1830.1700.1591.8464.8e51.2e13

Table 1.

Std of residuals for different values of learning rate μ and the filter length L, for MA filter. Useful models are in shaded boxes.

When the designed FIR filter was used on the same data yMA(t), we received the results shown in Table 2. As can be seen, the results given in Table 1 are better. However, if L = 45 and μ = 0.015 (which yields to R2 = 0.24), the best performance of the designed FIR filter is achieved.

/L20303540455055
0.010.250.230.210.210.190.190.17
0.0150.230.200.200.180.1724.95e4
0.020.210.190.472e65e163e301e48

Table 2.

Std of residuals for different values of learning rate μ and the filter length L, for designed FIR filter. Useful models are in shaded boxes.

Fig. 16 shows the residuals in time of the three useful adaptive filters, applied on data yMA(t) and processed by using the MA filter. The particular combination of the filter parameters was different, but the residuals showed similar behavior. Table 3 provides the time intervals when the relative residuals are larger than four standard deviations for the ada ptive filter with L = 55, μ = 1.5 e-2 (Pokrajac et al., 2007b). At the beginning of the learning process, the filter

Figure 16.

Residuals of the adaptive filters applied on yMA(t) data using three different combinations of the learning rate and the filter length.

coefficients were not adapted fully, which caused the identified peaks. These peaks corresponded to observations from February 2002. In addition, there are two other peaks, corresponding to Sep 4, 2002, and Oct 25, 2004, which can be explained by a transient behavior after the notching interval.

YearBeginEndNoteYearBeginEndNote
DateTimeDateTimeDateTimeDateTime
200202/0823:0002/1000:00Learning200404/0115:3004/0218:30
02/1706:3002/1801:30Learning04/0917:0004/0918:00
07/2000:3007/2009:0005/2912:3005/2919:00
08/0701:0008/0703:3007/1220:3007/1302:00
08/0716:3008/0717:3009/1900:0009/1906:00
08/2114:3008/2121:3009/2920:3009/3006:30
09/0416:0009/0419:30Transient10/0515:3010/0520:30
09/1115:0009/1211:0010/2503:0010/2506:00Transient
12/2109:3012/2110:3011/2609:0011/2615:00
200301/0902:0001/0904:00200509/2720:0009/2804:30
10/2212:0010/2409:00

Table 3.

Identified intervals of large residuals (adaptive filtering on yMA(t) data, L = 55, μ = 1.5 e-2

6. Conclusion

We have described the application of the adaptive filtering for analyzing river hydrographic data. When determining the portion of the downstream data that is not influenced by the upstream data, the numerical results show that adaptive filtering is superior to linear regression.

Acknowledgments

This work was partially supported by the US Department of Commerce (award #NA06OAR4810164), NOAA (#NA06OAR4810164), NIH (#2P20RR016472-04), DoD/DoA (#45395-MA-ISP, #54412-CI-ISP, W81XWH-09-1-0062), and NSF (#0320991, CREST grant #HRD-0630388, #HRD-0310163).

© 2011 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike-3.0 License, which permits use, distribution and reproduction for non-commercial purposes, provided the original is properly cited and derivative works building on this content are distributed under the same license.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Natasa Reljin, Dragoljub Pokrajac and Michael Reiter (July 5th 2011). Adaptive Filters for Processing Water Level Data, Adaptive Filtering Applications, Lino Garcia, IntechOpen, DOI: 10.5772/16253. Available from:

chapter statistics

1375total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Nonlinear Adaptive Filtering to ForecastAir Pollution

By Luca Mesin, Fiammetta Orione and Eros Pasero

Related Book

First chapter

Convergence Evaluation of a Random Step-Size NLMS Adaptive Algorithm in System Identification and Channel Equalization

By Shihab Jimaa

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us