Skin temperature mean in each block of different people when ambient temperature is Te =25.9
Face recognition, being straightforward, passive and non-invasive comparing with other biometrics such as fingerprint recognition (Yang et al., 2006; Yang & Park, 2008a; Yang & Park, 2008b), has a nature place in biometric technology and computer vision. Currently, most researches on face recognition focus on visual images. The reason is obvious: such sensors are cheap and visual images are widely used. The key problem in visual face recognition is to cope with different appearances due to the large variations both in intrinsic (pose, expression, hairstyle etc) and extrinsic conditions (illumination, imaging system etc). It is difficult to find the unique characteristics for each face, and it is accordingly not easy to develop a reliable system for face recognition by using visual images.
Infrared face recognition, being light-independent and not vulnerable to facial skin, expressions and posture, can avoid or eliminate the drawbacks of face recognition in visible light. Some methods (Buddharaju, et al, 2004, Chen, et al, 2005 , Kong, et al, 2005, Wu, et al, 2005A) based on thermal images are proposed for infrared face recognition in last decade. It is highlighted that the infrared images which are the character of the human skins can be affected by ambient temperature, psychological, as well as physiological conditions. Therefore, the recognition systems based on thermal images have the problem that achieving high performance when the test and train images are captured in the same ambient temperature, while the performance is poor if the test and train samples are collected under different temperature (time-lapse data).
To improve the performance on time-lapse data, it is important to normalize the training images and test images. Linear gray transform and histogram equalization are two common methods for image normalization. However, these approaches change the grayscales which represents the skin temperature so that the thermal images have no physical significance. Therefore, both methods are not suitable for normalization of infrared facial images.
In this chapter, we dedicate to provide a novel study on normalization of infrared facial images, especially resulting from variant ambient temperatures. Three normalization methods are proposed to eliminate the effect of variant ambient temperatures. The experimental results show that the proposed methods can increase the robustness of infrared face recognition system and greatly improve its performance on time-lapse data.
The organization of the chapter is as below. In section 2, effect of ambient temperatures on thermal images is analyzed. Three normalization methods are presented in Section 3. An improved Normalized Cross Correlation (NCC) for similarity measurement is proposed in Section 4. A variety of experiments on normalized images are performed in Section 5, and the conclusions are drawn in Section 6.
2. Effect of ambient temperatures on facial thermal patterns
Prokoski et al. (1992) indicated that humans are homoiotherm and capable of maintaining constant temperature under different surroundings. However, it should be highlighted that the so-called ”homoiotherm” only refers to the approximately constant temperature in deep body (i.e., the core temperature), whereas the skin temperature distribution fluctuates with ambient temperature, changes from time to time, as shown in Houdas & Ring(1982), Guyton & Hall (1996), Jones & Plassmann (2000). The infrared camera can only capture the apparent temperature instead of deep temperature. It is pointed by Housdas & Ring (1982) that the variations in facial thermograms result from not only external conditions, such as environmental temperature, imaging conditions, but also various internal conditions, such as physiological and psychological conditions. Socolinsky & Selinger (2004A, 2004B) also studied such variations. It is necessary to learn how the thermal patterns vary in different conditions.
Wu et al. (2005A, 2007) have illustrated that variations in ambient temperatures significantly change the thermal characteristics of faces, and accordingly affect the performance of recognition. It is indicated by Professor Wilder et al. (1996) that the effect of ambient temperatures on thermal images was essentially equivalent to that of external light on visible images. We only consider the effect of ambient temperatures on thermal images in the following analysis.
To study the characteristics of thermal images under different ambient temperatures Te, a sequence of images are captured and then divided into un-overlap blocks as shown in Figure 1. We can then obtain the mean values from each blocked image.
Table 1 shows the temperature mean in each block of different people in the identical ambient temperature. It is seen that different people have different thermal distributions. Table 2 indicates the means of skin temperatures in each blocked images of the same person. As is shown, the temperatures in each blocked images increase if the ambient temperatures increase. It reveals that the skin temperature variations and the ambient temperatures variations have the same tendency for the same person. In other hand, the skin temperature variations of different people have the same tendency when ambient temperature changes.
|Block||Image 1||Image 2||Image 3||Image 4|
What’s more, with the increasing of ambient temperature, the skin temperature variations of each blocked images are different. In order to study the relation between skin temperature variations of each blocked images and ambient temperatures, the maximum and minimum values in thermal images are used to analyze.
As shown in Figure 2, ten thermal images are collected in ambient temperatures 26ºC and 28.5ºC respectively. The maximal and minimal temperature in thermal images, are relatively stable. They all increase when ambient temperatures rise from 26ºC to 28.5ºC. It’s worth noting that the variations of maximums are lower than those of minimums, which implies the temperature variations are different in blocks.
In Figure 3, the input image (a) and reference image (b) (the thermal images of same person) were captured when ambient temperatures are 26ºC and 28.5ºC, which are geometrically normalized to size 80X60. The background areas are set to 0.000001ºC. Image (c) is the subtraction of image (b) and image (a). According to the characteristics of thermal images, the point which is brighter (white) has higher gray value, while the gray value of dark point is lower. Through observation of the images captured under different ambient temperatures, reference image, and their subtracted images, we find:
The temperatures of forehead and nose parts are higher than those of the facial edge, hair. Moreover, the higher the skin temperature, the smaller its temperature variations while ambient temperature changing;
In a thermal image, the difference of skin temperature among different parts is about 10ºC;
Due to the effect of breathing pattern, the temperature fluctuation in mouth part is larger than any other parts.
3. Normalization of infrared facial images
Wilder et al. (1996) illustrated that the effect of ambient temperatures on thermal images is essentially equivalent to that of external light on visible images. Therefore, analogous to the methods solving illumination in visible face recognition, the methods to eliminate ambient temperatures variations can be divided into the following three kinds: the methods based on transformation, invariant features and temperature model.
In order to eliminate the effect of ambient temperatures on infrared imaging and improve the robustness and recognition performance, we convert the image captured in unknown ambient temperature into the reference ambient temperature. This is the so-called normalization of infrared images, which is the key issue addressed in this section.
The traditional infrared image normalization methods are classified as linear gray transform methods ( Chen, et al, 2005 , Zhang, et al, 2005) and equilibrium methods (Song, et al, 2008). The former methods expand the range of gray levels to a dynamic range, with some linear methods. These methods aim to strengthen the detail characteristics of images. The second equilibrium methods are also dedicated to improve the information which people are interested in. These methods are not suitable for application to infrared image normalization, because these methods change the distribution of facial temperatures. Kakuta et al. (2002) have proposed a method for infrared image normalization which utilizes the skin temperature subtraction to convert infrared images, as shown in Figure 4.
This method can convert IR images obtained under various thermal environments into those under reference environments, in which the IR images contain foot, chest, hand and forearm. Every point in image gets the same offset. However, it should be noted that the offset of each point in face is different, as indicated in Section 2. Such processing will lead to wrong temperature variation. Therefore, we should use different normalized methods to deal with the infrared facial images. Three methods will be introduced below separately.
3.1. Normalization based on transformation (TN)
Normalization of infrared facial images based on transformation aims to reduce the effect of ambient temperatures using transformation. The proceedure using the block and least squares method is explained in details in Wu, et al, 2010. Firstly, the images collected under different ambient temperatures were divided into some block images, the values of the change of ambient temperatures and the change of corresponding temperature in face were obtained, which were used to be fitted into a function through the least squares method. Then, each
block image was normalized by the relevant function and the whole image could be normalized into the reference condition. Specific processes are as follows.
Obtaining images captured in every different ambient temperatures and blocking them;
For each image containing blocks, the means of reference images and their mean are calculated. With this calculation, the means of reference images in different ambient temperatures and their mean are obtained;
In formula (1), is the variation of ambient temperatures and is variation in facial temperature.represents theth blocked image in image under different ambient temperatures, while is the image after temperature normalization.
Each blocked image can be normalized by the same process. With these executions, the thermal image captured in unknown ambient temperature can converted into the reference environmental temperature.
3.2. Normalization based on invariable features (IFN)
Principal Component Analysis (PCA) is one of the most successful feature extraction methods, which frequently applied to various image processing tasks. The magnitudes of eigenvalues correspond to the variances accounted for the corresponding component. For example, discarding principal components with small eigenvalues is a common technique for eliminating small random noise. It has been theoretically justified from the point of spherical harmonics of view that eliminating three components with largest eigenvalues has been used to handle illumination variations (Ramamoorthi, 2002). In addition, the traditional PCA algorithm operates directly on the whole image and obtains the global features, which are vulnerable to external ambient temperatures, psychological and physiological factors. Such global procedure cannot obtain enough information because the local part of the face is quite different. It is necessary to extract more detailed local features. Xie et al. (2009) proposed a weighted block-PCA and FLD infrared face recognition method based on blood perfusion images, which highlighted the contribution of local features to the recognition and had a good performance when the test samples and training samples were captured in the same environmental temperatures.
Temperature normalization using block-PCA is proposed in this section (Lu, et al, 2010). In order to take full advantage of both the global information and the local characteristics of facial images, the images are partitioned into blocks. Then PCA is performed on each block and the component corresponding to the biggest eigenvalue of each block is discarded to eliminate the ambient effect.
3.2.1. PCA feature extraction
The main idea of PCA algorithm is using a small number of characteristics of features to describe the samples, reduce feature space dimension while the required identifying information are reserved as much as possible. These steps are briefly descripted below.
184.108.40.206. PCA algorithm
Assume that a set of N training samples is given, where is column vector which connects the rows of image matrix, u is the number of pixels.
The average face of the training set is as follow:
The distance from each face to the average face is as follow:
We assume the matrix and the corresponding covariance matrix is. The next task is to select the v largest eigenvalues with corresponding eigenvectors and form a projection space. Then the original image vector of dimension u is projected to the space of dimension v, where u>v. The eigenvector after projection is as follow:
There are usually two methods to determine the number of principal components, which is the number v of eigenvalues of the covariance matrix.
The first method is determined by the pre-specified information compression ratio ():
Where is the eigenvalues of.
The second method that directly reserves the largest eigenvalues is a common method, where.
220.127.116.11. Improved PCA algorithm
Although the eigenvalues and their corresponding eigenvectors produced by PCA take into account all the differences between the images, it cannot distinguish these differences caused by the human face or by the external factors. If the test samples and training samples are not collected at the same time, the ambient temperatures, psychological and physiological factors will affect the infrared image greatly when the images are collected. Some variations affected by these factors will arise in the principal components with the corresponding eigenvectors. In addition, each principal component represents a kind of image feature. Some of these features pertain to the subjects depicted in the thermal images, and others represent irrelevant sources of image variations. Thus, it is natural to handle these principal components to reduce the effects caused by the ambient temperatures, psychological and physiological factors on infrared images.
After determining the number of principal component, eliminating three components with largest eigenvalues has been used to handle illumination variations (Ramamoorthi, 2002). Although the effect of ambient temperatures on thermal images is essentially equivalent to that of external light on visible images, it cannot ensure the three components with the largest eigenvalues in thermal images must contain the useless information for identification. The principal components may contain feature information which distinguishing the images from different classes. Therefore, a method calculates the standard deviation of each principal component is proposed to estimate the effects of ambient temperatures, psychological and physiological factors on each principal component. The procedure describes as follows:
After obtaining the projection matrix, ten test samples, as an example, of the same person are extracted and projected into the projection matrix. The images with high dimension are mapped to the space with low dimension. Then the standard deviation of each principal component belongs to the test samples are calculated and shown as follows:
We can see from the standard deviations of principal components that the standard deviations of the first, 7th and 8th principal component are very large in comparison with other standard deviations of the ten principal components, which means the image information in these three principal component change much more strenuous with the variations of ambient temperatures, psychological and physiological factors. These factors account for the differences between test samples and training samples collected in different time. The standard deviations of the second and the third principal component are smaller than any other standard deviations of the ten principal components. The useful feature information for identification will lose if these two principal components are discarded. For a human face, the energies of first three or five components are 90% and 94% respectively. The remaining principal components contain less information comparing with the preceding principal components. Therefore, instead of discarding three components with largest eigenvalues, the component with the biggest eigenvalue is discarded in this section to reduce the effects of the ambient temperatures, psychological and physiological in infrared images.
The v largest eigenvalues and their corresponding eigenvectors are obtained by the covariance matrix S firstly. Then the eigenvector corresponding to the biggest eigenvalue is removed and the reminders of the v-1 eigenvalues with the eigenvectors constitute a new projection space. In this way, the original u-dimensional images vector project onto the (v-1)-dimensional space have better separability. The eigenvector after projection is as follows:
3.2.2. Feature extraction by block-PCA
Traditional PCA algorithm is performed on the original images directly, which obtained are the global features. For IR images, only partial areas of the temperature information on the face change obviously when ambient temperature changes. The global features extracted tend to strengthen the part changes obviously, and neglect some other information representing the image classes.
Apart from the effect of uniform illumination on visible recognition system, the infrared images affecting by ambient temperatures are an overall process; all the human faces would change with the variation of the ambient temperatures. In fact, different parts of IR images have different variations with the changes of ambient temperatures. It does not primarily eliminate the effect of environment on the local thermal images by removing the largest principal component of the whole images.
Therefore, the idea of block-PCA is proposed to extract local features. It is assumed that a set of N training samples is given. Each image is divided into blocks and the size of each sub-block is.
Using the method of information compression ratio, the eigenvectors of each block image is obtained and then the eigenvector with the largest eigenvalues of each block image is discarded. The remainders of eigenvectors of each block constitute a new eigenvector to be used for feature extraction.
The new combined eigenvector not only reflects the global features of images, but also reflects the global features. Moreover, it can reduce the effects of ambient temperatures, physiological and psychological factors, which enhance the robustness of the recognition system.
3.3. Normalization based on temperature model (TMN)
3.3.1. 0-1 normalization
As aforementioned in section 2, the higher the skin temperature, the smaller its temperature variation while ambient temperature changes. Therefore, every point in thermal image may have a weight coefficient to represent its own temperature variation weight. The high temperature corresponds to small weight coefficient while the low temperature point has a high weight coefficient. Therefore, a weighted linear normalization method of infrared image is proposed in this section (Wu, et al 2010). The weight coefficients of each point in the face are first obtained through the improved 0-1 standardization. Then each coefficient is used to temperature normalization.
Assume a sample, where is one of the skin temperature in face.
In which wk is corresponding to weight coefficient while ambient temperature changes, ,,.
As shown in the formula (8), the point whose skin temperature is high has a high gray value, which results in small weight coefficient, and vise reverse.
3.3.2. Temperature normalization based on maximin model
Suppose two thermal imagesandto be captured in different ambient temperatures, in whichis collected in reference temperature whileis gathered in unknown temperature. State parameters are extracted fromand the ambient temperatureis obtained. Then the difference of ambient temperature of the two thermal images is calculated., each point in test image has the following calculation:
The result obtained by the maximin model is anastomotic with the analysis shown in Section 2.
4. Improved Normalized Cross Correlation for similarity measurement
The purpose of temperature normalization is to convert the images captured under different environmental temperature into the images collected in reference temperature. The images after temperature normalization have the similar ambient temperature and temperature distribution. Therefore, the similarity of test image and reference image can be used to test the performances of the proposed normalization methods.
Normalized Cross Correlation (NCC) is defined as follow:
where S is the reference image, T is the test image. Both S and T have same size m×n. Obviously, higher value of NCC indicates more similar of the two images. NCC equals to 1 when the two images are completely same.
It is noted that the NCC method is pixelwised, which requires that the images have the exact location for measurement. As faces are 3D moving subjects, which poses 3D rotation and deformation, it is impossible to achieve exact one-to-one correspondence computation even the two faces images are well segmented. Namely, the point corresponding to nose in test image is not the same location of nose in reference image. Therefore, the traditional NCC method is not accurate for face measurement. An improved NCC method is proposed to test the normalized performance.
Assuming S, T are normalized reference image and normalized test image. Each image is blocked into and M is the mean of block image. The improved NCC is defined as follow:
In which is the mean temperature value of one block image, Te1 is the ambient temperature when reference image S is captured. If, It reveals that the test image has more similar temperature distribution with reference image. On the other hand, it also shows the temperature normalized method can effectively reduce the effect of external temperature on thermal facial recognition system.
5. Experimental results and discussions
5.1. Infrared database
Currently, there is no international standard infrared facial database stored in temperature. We captured the infrared images using the ThermoVision A40 made by FLIR Systems Inc. The training database comprises 500 thermal images of 50 individuals which were carefully collected under the similar conditions: environment under air-conditioned control with temperature around 25.6～26.3ºC. Each person stood at a distance of about 1 meter in front of the camera. The ten templates are: 2 in frontal-view, 2 in up-view, 2 in down-view, 2 in left-view, and 2 in right-view. As glass is opaque to long-wavelength infrared, subjects are required to remove their eyeglasses in database collection. The original resolution of each image is 240×320. The size turns to be 80×60 after face detection and geometric normalization, as illustrate in Figure 5 and Figure 6.
Time-lapse data were collected from one month to six months. There are totally 85 people involved, some are included in training subjects and some are not. This time, the subjects are allowed to wear eyeglasses. The data were acquired either in air-conditioned room from morning (24.5~24.8ºC), afternoon (25.4~26.3ºC) to night (25.4~25.8ºC) or outdoor, where the ambient temperature was 29.3~29.5ºC. It is noted that these data were collected on the condition that the subjects had no sweat. The total test images is 1780.
5.2. NCC experimental results
In Figure 7, image (c) represents the normalized image. It is shown that the test image and its normalized image look similar that it is difficult to judge the effect of the proposed normalization method.
We can see from Figure 8 that the NCC after TMN method is higher than the original NCC without normalization.
Further, we verify the performance of normalization via recognition rate. In our expreiments,,linear least squares model and polynomial least squares modedl are applied to verify the influence of different matching curves (Wu, S. Q. et al, 2010a). After temperature normalization, the thermal images are converted into simplified blood perfusion domain to get stable biological features (Wu et al, 2007).
Whereis the temperatures in images, , , are the core temperature and ambient temperature respectively. is the blood perfusion, ,,and are four constants.
To extract features, PCA and discrete wavelet transform (DWT) (Wu, S. Q. et al, 2009) are chosen to test the effect of normalization. 3NN and Euclidean distance are selected in classification and recognition. Experimental results are shown in the Table 3.
5.3. Experimental results using TN method
It is shown in Table 3 that the normalization using block size 80×60 did not improve the performance, because the skin temperature variations vary with different facial parts. However, the normalized methods using small blocks have better performance, whatever which method are chosen. Moreover, the normalized method using linear least squares achieves better performance than that using polynomial least squares. It is highlighted that smaller block size results in higher performance. But the computation increases along with the increasing of the number of block images.
|Feature extraction method||Block size||Pre-normalization||Linear normalization||Polynomial normalization|
We also using 2D linear discriminant analysis (2DLDA) (Wu, S. Q. et al, 2009), DWT+PCA (Wu, S. Q. et al, 2009), and PCA+LDA (Xie, Z. H et al, 2009) for feature extraction. The recognition rates with or without normalization are shown in Table 4.
|Feature extraction method||Block size||Pre-normalization||Polynomial normalization|
As shown is Table 4, no matter which feature extraction method is chosen, the thermal recognition system after temperature normalization has better performances.
5.4. Experimental results using IFN method
As shown in Table 5 that the recognition performance has remarkable improvement by getting rid of the principal components with the largest eigenvalues. Moreover, the performance by discarding three principal components with the largest eigenvalues is better than that discarding two principal components with the largest eigenvalues, which verifies that the second principal component is useful for identification and it cannot be discarded. These results agree with the previous work in Section 3.2.
As can be seen from Figure 10, the recognition rate withis only 3.64 percentages lower than that with. Hence, the block-PCA experiments are operated in and. Sub-block size is chosen to be 40×30，20×30，20×20 and 10×10. Since the positions in each block-image are different and their temperature changes are different with the variations of the ambient temperatures, the method that determined the number of eigenvectors in this chapter is information compression ratio. Accordingly, the total number of eigenvalues after the PCA operation on each block-image is different. After the block-PCA performed, and the component with the biggest eigenvalue of each block-image is discarded, the reminders of the eigenvectors combine into a new eigenvector by the tactic of the block-images and are used for recognition.
It is shown in Table 6 and Table 7 that the recognition performance is best when the size of the sub-block is 20×30 in the case of and. This is because the own temperature change of each sub-block is different with the change of the ambient temperatures. The different parts in the face can locate well in each sub-block when the size of sub-block is 20×30.
In order to show the contribution of the block on recognition rate, the results using block and without using block are shown in Figure 11. No matter how much is the information compression rate, the performance when use the block is better than those without using block. It is seen from all the experimental results that the recognition rates using block-PCA and discarding the principal components are significantly higher than those using PCA directly. Therefore, the method using block-PCA proposed here can fully utilize the local characteristics of the images and thus improve the robustness of the infrared face recognition system.
5.5. Experimental results using TMN method
As shown in Table 8, the normalized method without weighting coefficients does not improve the performance. This is because the skin temperature variations of some points in thermal images are lower than the ambient temperature variation. Such processing leads to great change of skin temperatures, especially in the parts with high temperatures. By using the normalized method with weighted coefficients, the performances are all improved greatly, no matter which kind of features are extracted. It illustrates the normalization method using maximin model can improve the robustness and performance of the system.
|Feature extraction method||Pre-normalization||Normalization without weigh coefficient||Normalization with weigh coefficient|
This chapter dedicates to eliminate the effect of ambient temperatures on thermal images. The thermal images of a face are severely affected by a variety of factors, such as environmental temperature, eyeglasses, hairstyle and so on. To alleviate the ambient temperatures variations, three methods are proposed to normalize thermal images. The normalization based on transformation uses the function obtained by the block and least squares method. Features which do not change with ambient temperatures are extracted. The third method aims to normalize image through the maximin model. The extensive experiments demonstrated that the recognition performances with temperature normalization are substantially better than that with no temperature normalization process.
It should be highlighted that psychological (e.g., happy, angry, and sad etc) and physiological (e.g., fever) conditions also affect the thermal patterns of faces. How to analysis and eliminate these variations will be our future work.
This work was partially supported by the National Natural Science Foundation of China (No. 61063035), and it is also supported by the Merit-based Science and Technology Activities Foundation for Returned Scholars, Ministry of Human Resources of China.