1. Introduction
This chapter introduces a new phase-input joint transform correlator (PiJTC) to accommodate the additive noise in face recognition. It begins with the
The authors emphasize the presentation of this model because it represents the basis for the phase-input model. A better understanding of the (FA-AJTC) facilitates the use of the proposed sine modulated fringe-adjusted phase-input joint transform correlator (FA-sinePiJTC). All the steps that help the amplitude (FA-AJTC) model performances improve, are similar in the phase model with the benefits migration, too. A short step by step example of correlation process explains and reveals the pattern recognition performances improvements of the proposed phase-input JTC model.
It is pointed out that the sine modulation function comprises a degree of freedom through the limits of the modulation domain,
Next stage of this chapter analysis the face database recognition performances of the proposed (FA-sinePiJTC) model in additive noise conditions from 10% to 50% random noise. There are presented some computer simulation results of the correlation process over face images selected from a small test database with 21 individual classes x 3 face images/class. Due to the variations of the faces in the original images there were chosen different sine modulation domains to achieve better face recognition performances in additive noisy conditions for a larger database. Thus, there were selected 102 individual classes with 5 face images/class multiplied by 5 additive noise conditions. The goal was to build up 1,560,600 cross-correlations between all original face images and all the noisy ones. Where was necessary, the face recognition performances were analysed with the error rates: equal error rate (EER), genuine acceptance rate (GAR), false acceptance rate (FAR) and false rejection rate (FRR). The receiver operating characteristic (ROC) curve was also built up.
2. Amplitude joint transform correlators (AJTC)
Vander Lugt correlator (VLC) is based on
The mathematical model of (VLC) starts considering an input image,
The L1 lens generates
where
This filter is applied to
and generates the filtered Fourier transform in the (VLC) Fourier plane (Vlad et al., 1976)
The L2 lens generates the Fourier transform (second in number) of the
There are two output cross-correlation terms. The
Pattern recognition (i.e. discrimination or detection) process fails, if the cross-correlation
Optical axis alignment, as being the major disadvantage of (VLC), generates the occurrence of the amplitude joint transform correlator (AJTC). This correlator is based on the
Here
The next equations, after the L1 lens optical Fourier transform, are
Here,
The L2 lens Fourier transform, of the
The correlation result presents the zero-order term,
Amplitude joint transform correlators (AJTC) are known as robust to embedding and additive noise, thus they are a good solution for the pattern recognition in these conditions. Also, the main disadvantage of the JTCs is that it has a very high intensity zero-order correlation peak (dc term),
It was the past issue to resolve the JTCs. To remove the zero-order term and to accomplish the non-zero order JTC (NZAJTC), many methods have been proposed: for instance, phase-shifting technique (Su & Karim, 1998; Su & Karim, 1999), joint transform power spectrum (JTPS) subtraction strategy (Cheni & Wu, 2003; Cheni & Wu, 2005; Lu et al., 1997), Mach-Zehnder method.
The (NZAJTC) based on JTPS subtraction is more time consuming than the others, as a digital operation of non-zero order term subtraction is needed (Cheni & Wu, 2003; Cheni & Wu, 2005; Lu et al., 1997 )
where
In order to achieve thin and higher correlation peaks with (NZAJTC), one must make a spatial frequency domain filtering that enhances the high order spatial frequencies which defines the reference object details.
One of the conventional methods to alleviate this issue is the use of amplitude fringe-adjusted joint transform correlator (FA-AJTC). The method applies to
where
The (FAF) creates better light diffraction efficiency because it works like inverse reference adaptive filtering. In other words, use of inverse adaptive filter increases the higher spatial frequencies and decreases the lower spatial frequencies of the
The final fringe-adjusted non-zero joint power spectrum in the Fourier plane is (Abookasis et al., 2001; Alam & Karim, 1993a; Alam & Karim, 1993b)
and generates the correlation result
According to this equation, the amplitude fringe-adjusted joint transform correlator (FA-AJTC) generates only the two anti-parallel cross-correlation “lines” ”,
All optics experimental setup of the fringe-adjusted non zero-order amplitude joint transform correlator (FA-AJTC) invokes holographic (i.e. CD media type) or high resolution photographic media. With the help of the laser coherent light, these plates have to be “written” with the joint image in the input plane and the reference fringe-adjusted filter in the Fourier plane.
Alternatively a hybrid (optical-digital) (FA-AJTC) can be built up that is implemented with amplitude spatial modulator (ASLM) and a square law image capture device (CCD camera) (Demoli et al., 1997; Sharp et al., 1999). A coherent laser beam light is projected on the (ASLM1) which is addressed by a computer with the joint image in the input plane. After the Fourier transform with the L1 lens, the joint power spectrum,
All the above steps describe the
The only drawback of the hybrid setup of (FA-AJTC) is the time consuming of the digital operations (i.e. digital addressing the (ASLM), digital image capturing with CCD camera and digital computer operations). The benefits of the hybrid experimental setup of (FA-AJTC) are the mathematical perfect filtering and processing of the digital
As follows, quantitative analysis should be done. Correlation performance criteria, that can be used to analyse the described joint transform correlators, need the definition of cross-correlation peak intensity, CPI, and the auto-correlation peak intensity, API. The ratio
A better understanding of the correlation performances of the (JTC) is accomplished by a short example. Thus, a joint image (figure 2) with the scene image gathering a reference with additive noise (50% random type – indexed F1), two morphologically distorted reference images (indexed F2 and F3) and a non-reference image (with index F4) are considered.
Correlation results, presented in figures 3 and 5, show the benefits of fringe-adjusted frequency domain filtering (figure 4). After this filtering, the output correlation peaks are sharper and the minimum value of detection efficiency coefficients calculated with CPI for F4 and APIs for F1, F2, F3, for (FA-AJTC) (DEC = 1.7501) is greater than for (NZAJTC) (DEC = 1.1944). There can be noticed that the (NZAJTC) fails to discriminate between the reference-type image, F3, and the non-reference image, F4. The F3 image generates the lowest API from the reference-image class and the DEC value is less sensible than the threshold.
3. Phase-input joint transform correlators (PiJTC)
Amplitude encoding joint transform correlators need (ASLM) with large bandwidth and thus, generate wider correlation peaks and lower light efficiency. In the holographic research, there is stated that the phase encoding needs small bandwidth of the recording media to generate higher light efficiency. These reasons have made the transition from amplitude JTC to phase-input JTC, denoted (PiJTC), worthy.
Since the appearance of phase spatial light modulators (PSLM) there were several approaches to encode the joint image in the phase domain. In early attempts direct phase encoding domain was restricted to
The (PiJTC) model is the same as the amplitude one, but it uses the phase transformation of the reference, scene and joint input image. This method, assumes that the amplitude image
where
Reference, scene and joint phase encoded images are optically Fourier transformed to obtain the reference
These steps describe the non-zero order fringe adjusted phase-input joint transform correlates (FA-PiJTC). The “phase-input” terminology comes from the fact that in the input plane of the (FA-PiJTC) a (PSLM) is used to make a direct phase projection of an amplitude input image.
In optical holography, when the object is a phase encoded one, then the whitened holographic method must be applied in order to get the best contrast and light diffraction efficiency. Similarly, in the (FA-PiJTC) the input direct projection phase encoded image must have a non-black background to achieve the theoretically predicted performances. One solution is to apply a gray level grating background at the (PSLM) pixel level. But this processing is due to the amplitude gray domain range of the input image and the phase encoding domain of the (PSLM),
Sine modulated fringe-adjusted non-zero order phase-input joint transform correlator, (FA-sinePiJTC) presents the amplitude domain pre-processing step, other than the scalar transform of the input amplitude image. In this step, as in the previous one, a scalar transformation of the input amplitude is involved. This is required by a pre-processing trigonometric function, chosen by the authors as the
The pre-processing amplitude sine function needs a definition domain within the amplitude image to be scaled in,
A better understanding of the correlation performances of the (FA-PiJTC) and (FA-sinePiJTC) is accomplished by a short example which consider the same joint image used previously for the amplitude models.
In the same way, from figure 7 and 9, it can be noticed that (FA-PiJTC) has sharper correlation peaks than (FA-AJTC) and the minimum detection efficiency coefficient value is DEC=1.5547, with
These results conclude that (FA-sinePiJTC) can be chosen for database (i.e. inter-class and intra-class) face recognition with and without additive noise (that consists in an image distortion).
4. Experimental and computer simulation aspects of phase-input fringe-adjustment joint transform correlators (PiJTC)
Previous paragraphs present the mathematical models of (FA-AJTC) and (FA-sinePiJTC) and the experimental setup for (FA-AJTC) in all optics and hybrid versions.
The following presentation emphasizes the experimental hybrid setup, the computer simulation conditions and results for (FA-sinePiJTC) model. In the
The drawbacks and benefits of the
The computer simulations were done on a face database with the sine modulated fringe-adjusted phase-input joint transform correlator, (FA-sinePiJTC). The (FA-sinePiJTC) model has a degree of freedom, the sine modulation domain limits, that can adjust the performances of the correlation process. In the computer simulations two sine modulation domains:
The face databases used are two parts from the “General Purpose Recognition and Face Recognition” section from Computer Vision Research Projects supervised by Dr. Libor Spacek from Department of Computer Science University of Essex, Colchester, UK (
The presented computer simulations use the “male” and “malestaff” directories from Faces94 (updated Friday, 16-Feb-2007 15:52:52 GMT). The subjects sit at fixed distance from the camera and are asked to speak, whilst a sequence of images is taken. The speech is used to introduce facial expression variation (see table 1).
Joint images, built up to perform the correlation process, have 512x512 pixels. The reference image is located in the upper half and two target images are located in the lower half composing the scene image.
5. Correlation performances of phase-input fringe-adjustment joint transform correlator (FA-PiJTC) in face recognition with additive noise
Computer simulations of correlation process over a database can be very time consuming. In order to avoid time loss, a step by step strategy is chosen. In this strategy a small database is tested and after that, due to the results, the decisions are applied to a larger database in order to draw final conclusions. The smaller database from directory ”malestaff” the larger the one from directory “male”.
The test face database used is organized in individual classes Figure 10. One individual class has a total of 20 images of the same person (i.e. individual). For the test database there are selected only 3 face images (figure 11) from each class of the 21 classes (figure 12).
Five additive noise conditions - with 10%, 20%, 30%, 40%, 50% random noise levels - are presented in figure 11. References, just the face parts, were subtracted from the original images to prevent the background false correlation. CIELab colour filter was used to extract the predominant green background.
Correlation performance analyse over the face database without additive noise should be done. If the pattern recognition process fails or has poor detection efficiency for the situation without additive noise, then the additive noise condition will collapse the process. At the start, autocorrelations over all class reference images were done: 21 classes x ((20x20) reference images/class). For each class only the overall (20x20=400 values) minimum value represented with diamond line - figure 13 as intra-class correlation peak intensity (ICPI) was retained.
Large domain values of ICPI can be noticed from figure 13. The main reason is that the face images were captured during a monologue. At the start of session, in the first images, the individuals take care about their posing. After a few seconds they have head motions that generate significant variations in the face images. As a consequence, the ICPI decreases very fast for 8 individual classes. In order to analyze the additive noise correlator performances, as a counter measure, the authors decide to drop the number of reference images for the test database from 20 to only 3 and the target images to just one (the first in the class).
Autocorrelations between 63 reference images (21 classes x 3 reference images) were done. The diagonal (3x3) block from the cross-correlation matrix represents the intra-class results (ICPI). For each class was retained only the overall (3x3=9 values) minimum value represented with boxed line - figure 12 - as intra-class correlation peak intensity line (ICPI). The rest of the values consist of the inter-class cross-correlations values. For each class, only the maximum values of the inter-class correlation values were retained, as the inter-class correlation peak intensity (IECPI) – represented in figure 14 with diamond line.
In the same manner, the results for cross-correlations between 63 reference images (21 classes x 3 reference images) and 105 additive noisy images (21 classes x (1 original image x 5 noise conditions)) were generated. Graphical results are presented in figure 15 as the ICPI-AN and IECPI-AN (AN from Additive Noise) correlations peak intensity lines.
Figure 14 shows that the ICPI line does not intersect with IECPI line; figure 15 shows that the ICPI-AN line does not intersect with IECPI-AN line. If the lines intersect then there is an image from an individual class that is similar with an image from different class. But, these facts prove that the (FA-sinePiJTC) is robust to random additive noise.
Figure 14/15 present gaps between the ICPI/ICPI-AN values and IECPI/IECPI-AN values along all classes. The significance of these gaps is a very good discrimination (e.g. pattern recognition) performance of the (FA-sinePiJTC). Thus, the (FA-sinePiJTC) can discriminate with high accuracy over all classes (figure 16), between faces of different classes and do the registration of the faces from the same class successfully. The accuracy (gap magnitude) decreases in additive noise conditions (figure 16) at 22.95 (a.u.), from 96.05 (a.u in the situation without additive noise.
The algorithm revealed by correlation results with the (FA-sinePiJTC) over the test face image database, can be now applied on a larger face images database. First, the (21 classes x (20x20) reference images) autocorrelations are done. The results presented in figure 17 show small values of ICPI for 10% of the classes. As consequence, only 5 reference images per individual class were considered.
With these conditions, the autocorrelations between 510 reference images (102 classes x 5 reference images) were done. The results are presented in figure 18 as the ICPI and IECPI correlations peak intensities lines.
Correlation lines, ICPI and IECPI, in figure 18 do not intersect, but the gap between them is very small. As mentioned before, the main reason is the significant variation of the faces due to the motion during the monologue.
In analogue conditions, the cross-correlations between 510 reference images (102 classes x 5 reference images) and 2550 additive noisy images (102 classes x (5 original image x 5 noise conditions)) were done. Graphical results are presented in figure 19 as ICPI-AN and IECPI-AN correlations peak intensity lines. Correlation lines ICPI-AN and IECPI-AN, intersect because no gap between them is present. This fact conclude that some individuals appear to be similar using the (FA-sinePiJTC) in the mentioned conditions.
The pattern recognition performances in this case are measured by equal error rate (EER), genuine acceptance rate (GAR), false acceptance rate (FAR) and false rejection rate (FRR) (figures 20-22). Also there must be built up the receiver operating characteristic (ROC) curve (figure 22).
6. Conclusion
This chapter presents additive noise robustness of sine modulated fringe-adjusted phase-input joint transform correlator (FA-sinePiJTC), tested over a human face images database. There were inspected two databases: a small – test database for preliminary results, and then a large database for the final results and conclusions. The architecture for the proposed JTC is the
The sine modulation function gives the correlation process a degree of freedom through the limits of the modulation domain,
Presented results of the proposed joint transform correlator algorithm were achieved with 512x512 pixels input images. The joint image has two target images in the scene section, so it works twice faster than the Vander Lugt correlator (VLC), or other single pair matching correlators. Computer simulations were done on Intel i7-870 processor at 2.93 MHz with 3 GB of RAM with an operation speed of 0.185 seconds/single reference-target correlation. The application was built up to be used as a single process (CPU thread) or as multiple processes run simultaneously in maximum 5 CPU threads. The limited number of the CPU threads used is due to RAM memory and not to the processor. In these conditions, the single reference-target correlation time drops to only 37 milliseconds and make possible the achievement of 1,560,600 single pair correlations from the large database. In this way, the statistical significance of the face recognition results is very high.
The (FA-sinePiJTC) face recognition for the test database reveals that choosing only 3 out of 20 face images at the start of capturing process provides zero ERR and FAR error rates. This is stated because of the gap between the maximum IECPI and the minimum ICPI over the entire database process, without and with random additive noise. We can conclude that the (FA-sinePiJTC) is robust to additive noise up to 50% for database face recognition.
The selection of more than 3 initial face images, to 5 face images in the individual classes, widens the variations of faces. In this situation non-zero error rates. The equal error rate appear value is EER=0.0886% for CPI value of 35.4007, and the genuine acceptance rate is GAR=99.9020% at a false acceptance rate of FAR=0.0869%. The error rates values are very low revealing high accuracy and high robustness of the (FA-sinePiJTC) at most 50% additive random noise in face recognition.
Low resolution used (200x180 pixels) in the above performance analysis validate video security systems applications.
Authors experience with the same (FA-sinePiJTC) confirm the good pattern recognition results in correlation applications involving human fingerprint and irises, mostly based on the sine modulation function, the phase-input and the fringe-adjusted filter, previously emphasized.
Future work can involve 3D face recognition systems based on (FA-sinePiJTC) correlation algorithm.
References
- 1.
Abookasis D. Arazi O. Rosen J. Javidi B. 2001 Security optical systems based on a joint transform correlator with significant output images, Opt. Eng., 40(8),1584 1589 0091-3286 - 2.
Alam M. S. Horache E. H. 2004 Optoelectronic implementation of fringe-adjusted joint transforms correlator, Journal of Optics Communications,236 1 59 67 0030-4018 - 3.
Alam M. S. Karim M. A. 1993a Fringe-adjusted joint transform correlation, Journal of Applied Optics,32 23 4344 4350 0155-9128 X (print) - 4.
Alam M. S. Karim M. A. 1993b Joint-trasform correlation under varying illumination, Journal of Applied Optics,32 23 4351 0155-9128 X - 5.
Born M. Wolf E. 1970 Principles of Optics, Pergamon Press,0-52164-222-1 York, USA - 6.
Chang H. T. Chen C. C. 2006 Fully-phase asymmetric-image verification system based on joint transform correlator, Optics Express,14 4 1458 1467 1094-4087 - 7.
Cheni C. Wu C. S. 2003 Polychromatic pattern recognition using the non-zero-order joint transform correlator with cross-correlation peak optimization, Journal of Modern Optics,50 9 1353 1364 0000-0095 M340 - 8.
Cheni C. Wu C. S. 2005 Modification of training images for the optimisation of the nonzero order joint transform correlator, Journal of Modern Optics,52 1 21 32 0000-0095 M340 print - 9.
(Cohn R.W. Liang M. 1996 ) Pseudorandom phase-only encoding of real-time spatial light modulators, Journal of Applied Optics,35 ,14 ,2488 2498 ,1559-128X - 10.
Demoli N. Dajms U. Gruber H. Wernicke G. 1997 Influence of flatness distorsion on the output of a liquid- crystal- television- based joint transform correlator system, Journal of Applied Optics,36 32 8417 8426 0155-9128 X - 11.
Huang X. Lai H. Gao Z. 1997 Multiple-target detection with use of a modified amplitude-modulated joint transform correlator, Journal of Applied Optics,36 35 9198 9204 0155-9128 X (print) - 12.
Javidi B. Kuo C. J. 1988 Joint transform image correlation using a binary spatial light modulator at the Fourier plane, Journal of Applied Optics,27 4 663 665 0155-9128 X (print) - 13.
Javidi B. Tang Q. 1994 Chirp-encoded joint transform correlators with a single input plane, Journal of Applied Optics,33 2 227 230 0155-9128 X - 14.
Jutamulia S. 1994 Phase-only Fourier transform of an optical transparency, Journal of Applied Optics,33 2 280 282 0155-9128 X - 15.
Labastida I. Carnicer A. Martin-Badosa E. Santiago V. Ignacio J. 2000 Optical correlation by use of partial phase-only modulation with VGA liquid-crystal displays, Journal of Apllied Optics,39 5 766 769 0155-9128 X - 16.
Lu G. Yu F. T. S. 1996 Performance of a phase-transformed input joint transform correlator, Journal of Applied Optics,35 2 304 313 0155-9128 X - 17.
Lu G. Zhang Z. Wu S. Yu F. T. S. 1997 Implementation of a non-zero-order joint-transform correlator by use of phase-shifting techniques, Journal of Applied Optics,36 2 470 483 0155-9128 X - 18.
Nomura T. 1998 Phase-encoded joint transform correlator to reduce the influence of extraneous signals, Journal of Applied Optics,37 17 3651 3655 0155-9128 X - 19.
Rosen J. 1998 Three- dimensional joint transform correlator, Journal of Applied Optics,37 32 7538 7544 0155-9128 X - 20.
Sharp J. H. Budgett D. M. Slack T. G. Scott B. F. 1998 Compact phase-conjugating correlator: simulation and experimental analysis, Journal of Applied Optics,37 20 4380 4388 0155-9128 X - 21.
Sharp J. K. Mackay N. E. Tang P. C. Watson I. A. Scott B. F. Budgett D. M. Chatwin C. R. Young R. C. D. Touda S. Huignard J. P. Slack T. G. Collings N. Pourzand A. R. Duelli M. Grattarola A. Braccini C. 1999 Experimental systems implementation of a hybrid optical- digital correlator, Journal of Applied Optics,38 29 6116 6127 0155-9128 X - 22.
Su H. J. Karim M. A. 1998 Performance improvement of a phase-shifting joint transform correlator by use of phase-iterative techniques, Journal of Applied Optics,37 17 3639 3642 0155-9128 X - 23.
Su J. H. Karim M. A. 1999 Phase-shifting joint transform correlation with phase- iterative algorithm: effect of the dynamic range limit, Journal of Applied Optics,38 26 5556 5559 0155-9128 X - 24.
Takahashi T. Ishii Y. 2006 Laser-Diode Phase-Shifting Joint-Transform Correlator with Multiple-Object Recognition, Optical Review,13 2 53 63 1340-6000 - 25.
Takahashi T. Ishii Y. 2010 All Optical Phase-Only Filtering Correlation with Binarized Inputs by a Ferroelectric Liquid-Crystal Spatial Light Modulator, Optical Review,17 3 195 203 1340-6000 - 26.
Vlad V. I. Zaciu R. Miron N. Maurer J. Sporea D. 1976 Prelucrarea optică a informaţiei-Optical Information Processing (in romanian language), Editura Academiei Republicii Socialiste Romania, Bucuresti, Romania - 27.
Willett P. Javidi B. Lops M. 1998 Analysis of image detection based on Fourier plane nonlinear filtering in a joint transform correlator, Journal of Applied Optics,37 8 1329 1341 0155-9128 X - 28.
Zhang S. Karim M. A. 1999 Illumination-invariant pattern recognition with joint-transform-correlator-based-morphological correlation, Journal of Applied Optics,38 35 7228 7237 0155-9128 X - 29.
Zhong S. Jiuxing J. Liu S. Li C. 1997 Binary joint transform correlator based on differential processing of the joint transform power spectrum, Journal of Applied Optics,36 8 1776 1780 0155-9128 X