The techniques for noise cancellation have been developed with applications in signal processing, such as homomorphic signal processing, sensor array signal processing and statistical signal processing. Some exemplar applications may be found from kepstrum (also known as complex ceptstrum) method, beamforming and ANC (adaptive noise cancelling) respectively as shown in Fig. 1.
Based on the two-microphone approach, the applications are characterized as three methods, which are based on identification of unknown system in acoustic channels, adaptive speech beamforming and adaptive noise cancellation. It can be described as generalized three sub-block diagram as shown in Fig. 2, where it is shown as three processing stages of
kepstrum (complex cepstrum),
ANC and also two structures of beamforming and ANC.
Kepstrum - estimation of acoustic path transfer functions (and)
From the output of sensor array, the acoustic transfer functions (and) are estimated from the acoustic channels as noise statistics during the noise period and it is applied to speech and noise period for noise cancellation. It can be applied as preprocessor to second processing stage, beamforming or directly to third processing stage, ANC. The application can be found from (Moir & Barrett, 2003; Jeong & Moir, 2008), where the unknown system has been estimated as the ratio (/) of two acoustic transfer functions between each microphones and noise source. Kepstrum filter is used as estimate of unknown system and it is applied in front of SS (sum and subtract) functions in beamforming structure (Jeong & Moir, 2008).
Beamforming - adaptive filter (), delay filter () and SS functions
The beamforming structure contains SS functions, where it is used as signal separator and enhancer by summing and subtracting the signals of the each microphones input (Griffiths & Jim, 1982). An adaptive filter 1 is placed in front of SS functions and used as speech beamforming filter (Compernolle, 1990). It is used as a beam steering input and hence DS (delay and sum) beamformer in primary input during speech period using VAD (voice activity detector) and its output is then applied to third stage, ANC as an enhanced primary input. Both output signals from the SS functions are divided by a number of microphones used (in the case of two microphone, it should be 0.5). Alternatively, adaptive filter 1 can be used as a first adaptive noise canceller. For this application, its output is a noise reference input to next cascading adaptive filter 2 during noise period in VAD (Wallace & Goubran, 1992). Based on a same structure, two-stage adaptive filtering scheme is introduced (Berghe and Wouters, 1998). As a speech directivity function, GCC (genenalized cross-correlation) based TDOA (time difference of arrival) function may alternatively be used instead of adaptive filter 1 in beamforming structure (Knapp & Carter, 1976).
ANC - adaptive filter () and delay filter ()
The last part of block diagram shows ANC method (Widrow et al., 1975), where it consists of adaptive filter 2 and delay filter 2. The adaptive filter generally uses FIR (finite impulse response) LMS (least mean square) algorithm in signal processing or IIR (infinite impulse response) RLS (recursive least square) algorithm in adaptive control for the noise cancellation in MMSE (minimize mean square error) sense. According to the application, both algorithms show compromised effects between performance and computational complexity. It shows that RLS gives, on average, two-tenths of a decibel SNR (signal to noise ratio) improvement over the LMS algorithm (Harrison et al., 1986) but it requires a high demand of computational complexity for the processing. Delay filter 2 is used as noncausality filter to maintain a causality.
As desribed above, the techniques have been developed on the basis of above described methods and the structures. From the above analysis, kepstrum noise cancelling technique has been studied, where the kepstrum has been used for the identification of acoustic transfer functions between two microphones and the kepstrum coefficients from the ratio of two acoustic transfer functions have been applied in front of adaptive beamforming structure for noise cancellation and speech enhancement (Jeong & Moir, 2008). Furthermore, by using the fact that the random signal plus noise may be represented as output of normalized minimum phase spectral factor from the innovations white-noise input (Kalman & Bucy, 1961), the application of an innovations-based whitened form (here we call it as inverse kepstrum) has been investigated in a simulation test, where front-end inverse kepstrum has been analyzed with application of cascaded FIR LMS algorithm (Jeong, 2009) and also FIR RLS algorithm (Jeong, 2010a; 2010b), both in ANC structure for noise cancellation.
In this paper, for a practical real-time processing using RLS algorithm, analysis of innovations-based whitening filter (inverse kepstrum) has been extended to beamforming structure and it has been tested for the application in a realistic environment. From the simulation test, it will be shown that overall estimate from front-end inverse kepstrum processing with cascaded FIR RLS approximates with estimate of IIR RLS algorithm in ANC structure. This provides alternative solution from computational complexity on ANC application using pole-zero IIR RLS filter, which is mostly not acceptable to practical applications. For the application in realistic environment, it has been applied to beamforming structure for an effective noise cancelling application and it will be shown that the front-end kepstrum application with zero-model FIR RLS provides even better performance than pole-zero model IIR RLS algorithm in ANC structure.
2. Analysis of optimum IIR Wiener filtering and the application to two-microphone noise cancelling approach
For the IIR Wiener filtering approach, the z- transform of optimum LS (least squares) filter is constrained to be causal but is potentially of infinite duration, hence it has been defined by (Kailath, 1968) as
From the equation (1), it may be regarded as a cascaded form of transfer functions
From the optimum Wiener filtering structure, the innovations processcan be obtained by the inverse of spectral factor
It can be applied to two-microphone noise cancelling structure as optimum IIR Wiener filtering approach as shown in Fig. 4.
3. Front-end whitening filter and cascaded adaptive FIR RLS filter
To obtain the innovations-based whitened sequence, inverse kepstrum filter is used as whitening filter. This section describes a whitening procedure by kepstrum processing as front-end application and overview of FIR RLS filter as rear-end application to beamforming structure (Jeong, 2010b).
3.1. The innovations-based whitening filter
Fig. 5 shows that the generating input model may be whitened as innovations white-noise by the inverse of minimum phase spectral factor from input signal of signal plus noise.
To obtain the innovations white noise, the processing procedure is described as:
Take periodogram (P) from FFTs (fast Fourier transforms) of the input signal.
Get the kepstrum coefficients from the inverse FFT (IFFT) of the logarithm of the periodogram.
where (is Euler constant, 0.577215 is added to be unbiased).
Negate it from the obtained kepstrum coefficients because the logarithmic function of inverse minimum phase transfer function can be obtained by a negated sign from the kepstrum coefficients.
Normalize the negated kepstrum coefficients.
Truncate it less than half frame size and then make first zeroth coefficient to half from their previous value.
Convert it to impulse response by the recursive formula (Silvia & Robinson, 1978) as:
Finally, convolve the impulse response (5) with input signalto obtain the innovations whitened sequence.
3.2. The FIR RLS algorithm
The RLS algorithm is to estimate the inverse of the autocorrelation matrix of the input vector and it requires information from all the previous input data used (Haykins, 1996).
The recursive method of least squares is to minimize the residual sum of squares of the error signal () and find immediate search for the minimum of cost function, such as:
where,is exponentially weighted forgetting factor,.
The resulting equation for the optimum filter weights at time is described as normal equation:
where autocorrelation matrix,, cross-correlation vector, with
Both and can be computed recursively:
where gain vector,
The equation (9) is known as ordinary RLS algorithm and it is valid for FIR filters because no assumption is made about the input data. We can then find the weights update equation as:
4. Application to noise cancelling
Adaptive filter, such as FIR LMS filter (Widrow & Hoff, 1960) or IIR RLS filter (Ljung & Sodestrom, 1987) is used to estimate two acoustic path transfer functions (and) between each mirophone input and noise source. It is represented as the ratio of in the two-microphone ANC approach as shown in Fig. 6 (A). Front-end whitening application is used to estimate the inverse of acoustic path transfer functionin the reference input shown in Fig. 6 (B), where the cascaded adaptive filter is used to estimate acoustic path transfer function, in the primary microphone input.
In this paper, the inverse kepstrum filter is used to estimate as whitening filter in front of SS functions and FIR RLS algorithm is used as rear-end spectral shaping adaptive filter in two-microphone beamforming structure as shown in Fig. 7. As an alternative approach, the system identification based kepstrum method has been studied in beamforming structure (Jeong & Moir, 2008).
The objective is to analyze the operation of the front-end innovations based whitening method and the rear-end FIR RLS filter between ANC and beamforming structure. For the simulation test, 2 kepsturm coefficients and first order of zero model RLS have been used, which will be compared with pole-zero model IIR RLS with first order of numerator polynomial and first order of denominator polynomial in ANC structure. Based on this, it will be tested in beamforming structrue for real-time processing in a realistic room environment, where noise cancelling performance will be compared with typical IIR RLS method in ANC structure. For the application of signal plus noise, a simple sine waveform (consisting of 500Hz, 550Hz and 700Hz) has been selected as a desired signal, which considered as a desired signal of speech signal with real data in noise signal. For the processing, two FFT points (2048 in simulation test and 4096 in real test) frame sizes have been used, and sampling frequency of 22050Hz and Nyquist frequency of around 11000Hz have been chosen. For the precise test, programmed operation is made to stop the estimate to freeze both kepstrum coefficients and adaptive (FIR and IIR RLS) filter weights when the signal is applied as desired speech signal (Jeong, 2010a; 2010b). The frozen coefficients and weights are then applied to desired signal and noise periods. For the test in a real environment, two unidirectional microphones (5cm distance apart) with broadside configuration have been set up and tested in a corner of room (3.8m(d)x3m(w)x2.8m(h)) with moderate reverberant status.
5.1. Simulation test in ANC structure
The noise characteristic between two microphones is estimated as the ratio of two acoustic path transfer functions, where the front-end innovations kepstrum estimates minimum phase term of a denominator polynomial and also zero-model FIR RLS algorithm of the cascaded adaptive filter estimates the remaining numerator polynomial as shown in Fig. 8. Both coefficients and weights are continously updated during the noise periods only and frozen during the signal plus noise periods.
5.2. Operation of innovations-based whitening filter and cascaded zero-model FIR RLS filter in ANC structure
To verify the operation of inverse kepstrum whitening filter with a nonminimum phase term from numerator polynomial and a minimum phase term from denominator polynomial, has been used as a simple example of unknown system, where each acoustic transfer functions are
Hence, which is illustrated as zero and pole in Fig. 9 (A).
Therefore, it can be described as a polynomial of:
As shown in Fig. 9 (B), the front-end inverse kepstrum estimates minimum phase term (13) in denominator polynomial and cascaded zero-model RLS estimates remaining nonminimum phase term (14) in numerator polynomial,
It is also compared in terms of overall estimate, where overall estimate (III) from (C) is obtained from the convolution of estimate (I) and estimate (II). Table 1 shows that (A) is the ordinary IIR RLS with one pole and one zero model, (B) is its estimates, and (C) is estimates of front-end inverse kepstrum and cascaded FIR RLS as listed in Table 1. From the observation, it can be found that innovations based inverse kepstrum gives approximation to the ordinary IIR RLS, where it is also be verified in Fig. 9.
5.3. Simulation test in beamforming structure
SS functions in beamforming structure and
adaptive filter in ANC structure as shown in Fig. 10.
Without application of whitening filter, acoustic path transfer function is estimated by adaptive filter
5.4. Operation of front-end innovations-based whitening filter and rear-end zero-model FIR RLS filter in beamforming structure
With the use of same unknown system (11) as in ANC structure, the operation of inverse kepstrum whitening filter in front of SS functions in beamforming structure is same as one (13) in ANC structure.
The FIR RLS filter is then estimated on, which gives that, where =0.75. It shows that weight value is half in size from the orignal weight value, 1.5 in. Fig. 12 shows pole-zero locations according to different weight value in, where values are (A) 0.2 (B) 1 (C) 1.5 and (D) 2. With the use of three inverse kepstrum coefficients as shown in Fig. 12, it shows that adaptive FIR RLS is approximated to the half values, which are (A) 0.1 (B) 0.5 (C) 0.75 and (D) 1, respectively.
5.5. Test of noise cancellation on signal plus noise for real-time processing in a realistic environment
For real-time processing in a realistic room environment, it has been tested for the comparison of
the noise cancelling performance at each step in beamforming structure, and
the performance on front-end whitening application between ANC structure and beamforming structure, and finally
the noise cancelling performance in noise and signal plus noise between ordinary ANC approach using IIR RLS in ANC structure and front-end whitening approach with FIR RLS in beamforming structure.
Firstly, as shown in Fig. 13, the noise cancelling perfomance has been found from each processing stage, of
inverse kepstrum filter output
overall output with application of inverse kepstrum filter only and
overall output with application of inverse kepstrum filter and FIR RLS filter from the each points in Fig. 10. For this test, 32 inverse kepstrum coefficients have been processed with FFT frame size 4096.
Based on this, it is found that inverse kepstrum filter works well in beamforming structrure. Secondly, with the sole application by inverse kepstrum filter only, its noise cancelling performance has been tested in (A) ANC structure and it has been compared in (B) beamforming structure as shown in Fig. 14. From the test, it has been found that inverse kepsrum is more effective in beamforming structure than its application in ANC structure.
Thirdly, it has also been compared in average power spectrum between IIR RLS in ANC structure and inverse kepstrum filter in front with rear-end FIR RLS in beamforming structure. From the test result, it shows that inverse kepstrum provides better noise cancelling performance in frequency range over 1000 Hz for noise alone period as well as signal plus noise period as shown in Fig. 15.
It has been shown in simulation test that the application of front-end innovations-based whitening application (inverse kepstrum method) to cascaded zero model FIR RLS algorithm in ANC structure could perform almost same performance on convergence compared with pole-zero model IIR RLS in ANC structure. For the more effective performance in realistic environment, the front-end whitening application with rear-end FIR RLS to beamforming structure has shown better noise cancelling performance than the ordinary approach using pole-zero model IIR RLS in ANC structure. Therefore, when it is processed in real-time, it is claimed that the front-end whitening application could provide an effective solution due to a reduced computational complexity in inverse kepstrum processing using FFT/IFFT, which could be a benefit over sole application of IIR RLS algorithm.
This work was supported in part by the UTM Institutional Grant vote number 77523.