Face Recognition Using Uncorrelated, Weighted Linear Discriminant Analysis Yixiong Liang, Weiguo Gong, Yingjun Pan, and Weihong Li Key Lab of Optoelectronic Technology & Systems of Education Ministry of China, Chongqing University, Chongqing 400044, China {yxliang, wggong, pyj, weihongli}@cqu.edu.cn

Abstract. In this paper, we propose an uncorrelated, weighted LDA (UWLDA) technique for face recognition. The UWLDA extends the uncorrelated LDA (ULDA) technique by integrating the weighted pairwise Fisher criterion and nullspace LDA (NLDA), while retaining all merits of ULDA. Experiments compare the proposed algorithm to other face recognition methods that employ linear dimensionality reduction such as Eigenfaces, Fisherfaces, DLDA and NLDA on the AR face database. The results demonstrate the efficiency and superiority of our method.

1

Introduction

Face recognition has a wide range of applications, such as face-based video indexing and browsing engines, biometric identity authentication, human-computer interaction, and multimedia monitorring/surveillance. Within the last decades, numerous novel FR algorithms have been proposed [1]. A central issue to this approaches is the feature exaction. The most well-known technique for linear feature extraction is the linear discriminant analysis (LDA). Its basic idea is to seek an optimal set of discriminant vectors W = [w1 , . . . , wl ] by maximizing the Fisher criterion JF (W) = tr[(WT Sw W)−1 (WT Sb W)] , (1) c  c where, Sb = i=1 pi (mi − m)(mi − m)T and Sw = i=1 pi Si are the betweenand within-class scatter matrices, respectively; m is the mean of all samples and mi is the mean of class i with prior probability pi ; Si is the covariance matrices of class i. Uncorrelated features are usually desirable in pattern recognition because an uncorrelated feature set is likely to contain more discriminatory information than a correlated one. Recently, Jin et al. [2] proposed the uncorrelated LDA technique (ULDA), which tries to find the optimal discriminant vectors by maximizing the Fisher criterionunder the conjugated orthogonal constrains: n wjT St wi = 0, (i = j), where St = i=1 (xi − m)(xi − m)T denotes total scatter matrix. Therefore, ULDA can extract a set of statistically uncorrelated discriminant features with better discriminant power as shown experimentally in Ref. [2]. However, the ULDA technique still has some deficiencies as follows: first, it still suffers from the so-called small sample size problem (SSSP) which is S. Singh et al. (Eds.): ICAPR 2005, LNCS 3687, pp. 192–198, 2005. c Springer-Verlag Berlin Heidelberg 2005 

Face Recognition Using Uncorrelated, Weighted LDA

193

often encountered in face recognition; second, the Fisher criterion that ULDA maximized isn’t optimal for a c-class (c > 2) classification problem in that it overemphasizes the larger distance between classes and causes large overlaps of neighboring classes. In this paper, we propose an uncorrelated, weighted linear LDA (UWLDA) technique to solve the above problems. The reminder of this paper is organized as follows. The related work on LDA is described in section 2. Then our UWLDA is presented in section 3. Experiments are reported in section 4 and finally section 5 concludes this paper.

2 2.1

Related Work ULDA

Suppose that Sb is a positive semi-definite matrix and Sw is a positive definite matrix. The first ULDA discriminant vector, denoted by w1 , is calculated as the first eigenvector corresponding to the maximal eigenvalue of the generalized eigenequation Sb w = λSw w. Suppose that r ULDA discriminant vectors Wr = [w1 , . . . , wr ] have been obtained. Then the (r + 1)th discriminant vector wr+1 can be taken as the eigenvector corresponding to maximum eigenvalues of the generalized eigenequation: MSb w = λSw w, where M = −1 I − St Wr (WrT St S−1 WrT St S−1 w St Wr ) w . 2.2

NLDA

In many practical face recognition tasks, there are not enough samples to make Sw nonsingular and then both LDA and ULDA suffer from the well-known SSSP which arises whenever the number of available samples is smaller than the dimensionality of the samples [3]. The traditional solutions to this problem is to project all samples onto a subspace, as it was done for example in Fisherfaces [4], where the resulting within-class scatter matrix is no longer singular. However, Chen et al. [3] proved that the nullspace of Sw , denoted by N (Sw ), contains the most discriminant information when a SSSP takes place. Based this finding, they proposed an enhanced LDA that we refer to as NLDA, to extract the most discriminant information for recognition. Intrinsically, NLDA tries to find a transform W who satisfies WT Sw W = 0, WT Sb W = Λ . 2.3

(2)

WLDA

In Fisher criterion, as discussed in [5] and [6], all class pairs have the same weights irrespective of their separability in the original space and the resulting transformation will preserve the distances of already well-separated classes, causing a large overlap of neighboring classes. Two similarly motivated solutions to this problem have been proposed: weighted pairwise Fisher criteria [5] and fractionalstep LDA [6]. Although quite effective [6], the fractional-step LDA is iterative

194

Y. Liang et al.

and very time-consuming. The other solution, weighted pairwise Fisher criteria, is more easily to implement and we refer the resulting algorithm as weighted LDA or WLDA. In WLDA, the between-class scatter matrix Sb in Fisher criterion is replaced by the following weighted between-class scatter matrix ˆb = S

c−1  c 

pi pj w(dij )(mi − mj )(mi − mj )T ,

(3)

i=1 j=i+1 d

where w(dij ) = 2d12 erf ( 2√ij2 ) is the weighting function that depends on the ij Mahanalobis distance dij between the classes i and j.

3

UWLDA

From the discussion in the previous section, it would certainly be desirable to exploit the benefits of ULDA, NLDA and WLDA. There are, however, a potential contradiction: the primary motivation for NLDA is the preservation of the N (Sw ), while in N (Sw ) both ULDA and WLDA will break down. In the remainder of this paper, we intend to solve this problem and propose a new combining algorithm. To begin, as suggested in Ref. [7], we project all samples onto the subspace Ω adopted in NLDA where the resulting between-class scatter matrix ¯w is zero. This ¯b is full rank while the resulting within-class scatter matrix S S subspace can be easily determined by removing the nullspace of St first and then extracting the nullspace of the intermediate within-scatter matrix. Notice that in Ω the Mahanalobis distance dij between the classes i and j is undefined and ˆ b is intractable. then the calculation of weighted between-class scatter matrix S As our solutions to this question, we first alter dij so that it is equal to Euclidean distance because in Ω the distribution of each class is exactly a point and the similarity between classes can be easily measured by Euclidean distance. Then we simply set the weighting function w(dij ) = (dij )−k , k > 0 ,

(4)

where k is the parameter. The rationale behind this is that classes which are closer together are more likely to have more confusion and should therefore be more heavily weighted. Now we turn our attention to combine the WLDA and ULDA in the subspace Ω. Notice that in Ω the Fisher criterion or weighted pairwise Fisher criterion is not longer in effect for giving an arbitrary vector w ∈ Ω, the Fisher criterion will definitely reach infinite. Motivated by NLDA and WLDA, we introduce a new criterion ˆ b w, w ∈ Ω . (5) J(w) = wT S ¯b , then the Moreover, as the total scatter matrix in Ω is definitely equal to S conjugated orthogonal constraint is equal to ¯b wi = 0 (i = j) . wjT S

(6)

Face Recognition Using Uncorrelated, Weighted LDA

195

By maximizing (5) under the constrain (6), we derive a novel UWLDA technique. Likewise, in UWLDA the first optimal discriminant vector w1 is the eigenvector ˆ b . In order to obtain the other corresponding to the largest eigenvalues of S directions, we introduce the following theorem. Theorem 1. The (r + 1)th desired optimal discriminant vector wr+1 is the eigenvector corresponding to maximum eigenvalues of the following eigenequation: ˆb w = λw , (7) MS where

¯b Wr (WrT S ¯bS ¯b Wr )−1 WrT S ¯b , M = I−S

(8)

Wr = [w1 , · · · , wr ] .

(9)

Proof. It is noted that besides the constraints (6), wr+1 should be normalized, i.e. T wr+1 = 1 . (10) wr+1 Therefore, the Lagrange function can be expressed as T ˆ T T ¯ L(wr+1 ) = wr+1 Sb wr+1 − λ(wr+1 Sb Wr U , wr+1 − 1) − wr+1

(11)

where U = [u1 , · · · , ur ]T . Set the derivative of L(wr+1 ) with respect to wr+1 equal to zero, namely ˆb wr+1 − 2λwr+1 − S ¯ b Wr U = 0 . 2S

(12)

¯ b , we obtain Multiplying the left-hand side of the above equation by WrT S

Thus we have

¯ bS ˆ b wr+1 − WrT S ¯bS ¯ b WrT U = 0 , 2WrT S

(13)

¯bS ¯b Wr )−1 WrT S ¯b S ˆ b wr+1 . U = 2(WrT S

(14)

Substituting Eq. (14) into Eq. (12), we will obtain ¯ b Wr (WrT S ¯b S ¯ b Wr )−1 WrT S ¯b )S ˆb wr+1 = λwr+1 . (I − S

(15)

Therefore, Eq. (7) is obtained. ˆb with the unIt is worth noting that in our UWLDA, if we substitute S weighted one, the resulting method is the generalization of ULDA to the SSSP cases. Moreover, one can easily prove that the resulting method is definitely equal to the original NLDA because the optimal discriminant vectors derived from NLDA can satisfied the conjugated orthogonal constraint (6). However, in our UWLDA, due to integrating the weighted pairwise Fisher criterion to replace ˆb . Therefore, ¯b into another matrix S the original Fisher criterion, i.e. we change S the traditional NLDA solution based on the weighted pairwise Fisher criterion cannot guarantee the derived discriminant features are statistically uncorrelated, whereas the proposed UWLDA technique that combines the ULDA, NLDA and the weighted pairwise Fisher criterion can obtain the statistically uncorrelated features.

196

4

Y. Liang et al.

Experimental Results

In this section, we present experimental results on a subset of AR face database [8] using our method and comparing with the performance of several popular subspaces projection-based schemes such as Eigenfaces [9], Fisherfaces [4], DLDA [10] and NLDA [3]. This subset contains 1652 non-occluded images corresponding to 118 persons with changes in facial expression and illumination conditions, and images taken in two sessions two weeks apart. For illustration, some available images for one subject are shown in Fig.1. To begin, we convert the RGB images to gray scale ones by adding all threecolor channels. Later, each image is scaled, translated, rotated and cropped to a size of 53 × 56 to obtain a ”face” which includes only the middle portion of the face images. We also apply the histogram matching technology to images as photometric normalization. Subsequently we smooth them with a Gaussian filter (3 by 3 with sigma 1) for noise reduction and globally normalize them to have zero mean and unit standard deviation. As the regions on the two sides of the chin are usually not important but the magnitude of the summation vectors there may be large, we removed these regions by an elliptical mask. The preprocessed images of one person are shown in Fig.2. In our experiments, the simple nearest center classifier (NCC) is adopted to recognize the unknown face images by using L2 norm as the distance measurement. In our UWLDA technique, the optimal parameter kopt can be found by searching highest accuracy over the variation of k. We randomly select 5 images per subject for training and the remains for testing. The results of different k within the range from 1 to 10 are shown in Table 1, from which kopt = 4 is obtained for the following comparative experiments.

Fig. 1. Some face samples of one subject from the AR database

Fig. 2. The preprocessed images

Face Recognition Using Uncorrelated, Weighted LDA

197

Table 1. Recognition accuracy corresponding to different k in UWLDA (%) k

1

2

3

4

5

6

7

8

9

10

Accuracy 93.34 93.83 94.03 94.88 93.29 92.31 91.42 89.09 85.41 81.14

90

80 70 60 50 40

UWLDA NLDA DLDA Fisherfaces Eigenfaces

30 20 20

40

60

80

Dimensionality

(a)

100

120

Recognition accuracy (%)

95

90 Recognition accuracy (%)

100

85 80 75 70 65

UWLDA NLDA DLDA Fisherfaces Eigenfaces

60 55 20

40

60 80 Dimensionality

100

120

(b)

Fig. 3. Comparative recognition performance. (a) Under the condition of variations in illuminance. (b) Under the condition of variations in facial expression.

In the next experiment, our aim is to compare Eigenfaces, Fisherfaces, DLDA, NLDA and the proposed UWLDA under varying illumination conditions. For each subject, we select two images with normal lighting condition corresponding to the first column in Fig.2 for training and 6 images with varying light conditions corresponding to the rightmost 3 columns in Fig.2 for testing. In total,we have 236 training samples and 708 testing samples. Fig. 3(a) shows the recognition accuracy under a varying number of selected features. This figure indicates that the performance of those LDA-based methods (Fisherfaces, DLDA, NLDA and UWLDA)is much better than PCA-based Eigenfaces under conditions where lighting is varied. In general, UWLDA performs better than the other methods. The last experiment is performed to evaluate the performance of different methods under the condition of variations in expression. As in the previous experiment, we still select two neutral images per subject for training. The probe set comprises 4 images for each person which involve variations in facial expressions (smile and angry). Thus, the total number of training samples is 236 and the number of testing samples is 472. The experiment results are shown in Fig.3(b), with the recognition accuracy against number of selected features. Again, the proposed UWLDA obtains the best performance when more than 28 features are used.

5

Conclusions

In this short paper, we present a novel LDA-based subspace projection method for face recognition that unifies nullspace LDA (NLDA), uncorrelated LDA

198

Y. Liang et al.

(ULDA) and weighted pairwise Fisher criteria (WLDA) in a single algorithm we refer to as uncorrelated, weighted LDA (UWLDA). This approach can extract the most discriminatory features which are statistically uncorrelated. Experimental results indicate that under the conditions of varying in illuminance and facial expressions, UWLDA performs better than other state-of-the-art methods such as Eigenfaces, Fisherfaces, NLDA and DLDA in terms of classification accuracy.

Acknowledgements This work is supported by the Scientific Technology Key Project of Ministry of Education (02057) and Key Project of Chongqing Natural Science Foundation (CSTC2005BA2002, CSTC2005BB2181), China.

References 1. Zhao, W., Chellappa, R.,Rosenfeld A., Phillips P. J.: Face Recognition: A Literature Survey. ACM Computing Survey. 35 (2003) 399–458 2. Jin, Z., Yang, J. Y., Hu, Z. S., Lou, Z.: Face Recognition Based on the Uncorrelated Discriminant Transformation. Pattern Recognition. 34 (2001) 1405–1416 3. Chen, L. F., Liao, H. Y., Lin, J. C., Ko, M. T., Yu, G. J.: A New LDA-based Face Recognition System Which Can Solve the Small Sample Size Problem. Pattern Recognition. 33 (2000) 1713–1726 4. Belhumeur, P. N., Hespanha, J. P., Kriegman, D. J.: Eigenfaces vs. Fisherfaces: Recognitin Using Class Specific Linear Projection. IEEE Trans. Pattern Anal. Machine Intell. 9 (1997) 711–720 5. Loog, M., Duin, R. P. W., Haeb-Umbach, R.: Multiclass Linear Simension Reduction by Weighted Pairwise Fisher Criteria. IEEE Trans. Pattern Anal. Mach. Intell. 23 (2001) 762–766 6. Lotlikar, R., Kothari, R.: Fractional-Step Dimensionality Reduction. IEEE Trans. Pattern Anal. Mach. Intell. 22 (2000) 623–627 7. Huang, R., Liu, Q. S., Lu, H. Q., Ma, S. D.: Solving the Small Smaple Size Problem of LDA. In: Proceedings of the 16th International Conference on Pattern Recognition. 3 (2002) 29–32 8. Martinez, A. M., Benavente, R.: The AR face database. CVC Tech. Report. 34 (1998) 9. Turk, M., Pentland, A. Eigenfaces for Recognition. Journal of Cognitive Neuroscience. 3 (1991) 72–86. 10. Yu, H., Yang, J.: A Direct LDA Algorithm for High-dimensional Data – with Application to Face Recognition. Pattern Recognition. 34 (2001) 2067–2070

Face Recognition Using Uncorrelated, Weighted Linear ...

and within-class scatter matrices, respectively; m is the mean of all samples and mi is the mean of class .... For illustration, some available images for one subject ...

514KB Sizes 0 Downloads 298 Views

Recommend Documents

Face Recognition Based on Local Uncorrelated and ...
1State Key Laboratory for Software Engineering, Wuhan University, Wuhan, ... of Automation, Nanjing University of Posts and Telecommunications, 210046, ...

Rapid Face Recognition Using Hashing
cal analysis on the recognition rate of the proposed hashing approach. Experiments ... of the images as well as the large number of training data. Typically, face ...

Rapid Face Recognition Using Hashing
cal analysis on the recognition rate of the proposed hashing approach. Experiments ... of the images as well as the large number of training data. Typically, face ...

Face Recognition Using Eigenface Approach
which the person is classified by comparing its position in eigenface space with the position of known individuals [1]. The advantage of this approach over other face recognition systems is in its simplicity, speed and .... The experiment was conduct

Face Recognition using Local Quantized Patterns
by OSEO, French State agency for innovation and by the ANR, grant reference ANR-08- ... cessing, 2009:33, 2009. [18] H. Seo and P. Milanfar. Face verification ...

Face Recognition in Surgically Altered Faces Using Optimization and ...
translation and scale invariance [3]. Russell C.Eberhart (2006) emphasis that the ... Each particle in the search space evolves its candidate solution over time, making use of its individual memory and knowledge gained by the swarm as a ... exchange

Face Recognition Using Composite Features Based on ...
Digital Object Identifier 10.1109/ACCESS.2017.DOI. Face Recognition Using Composite. Features Based on Discriminant. Analysis. SANG-IL CHOI1 ... ing expressions, and an uncontrolled environment involving camera pose or varying illumination, the recog

Face Recognition Using Sparse Approximated Nearest ...
The authors are with the School of Computer Science & Software. Engineering ..... adapting the Accelerated Proximal Gradient (APG) method to optimize (4) ...

Survey on Face Recognition Using Laplacian faces
1Student, Pune University, Computer Department, K J College Of Engineering and Management Research. Pune .... Orlando, Florida, 2002, pp.3644-3647.

Survey on Face Recognition Using Laplacian faces - International ...
Abstract. The face recognition is quite interesting subject if we see in terms of security. A system can recognize and catch criminals and terrorists in a crowd. The proponents of large-scale face recognition feel that it is a necessary evil to make

Multithread Face Recognition in Cloud
Oct 12, 2016 - biometrics systems have proven to be an essential security tools, ... showcase cloud computing-enabled face recognition, which utilizes ...

Multithread Face Recognition in Cloud
Oct 12, 2016 - this responsibility to third-party vendors who can maintain cloud ... showcase cloud computing-enabled face recognition, which utilizes ...

SURF-Face: Face Recognition Under Viewpoint ...
A grid-based and dense extraction of local features in combination with a block-based matching ... Usually, a main drawback of an interest point based fea- .... navente at the Computer Vision Center of the University of Barcelona [13].

SURF-Face: Face Recognition Under Viewpoint ...
Human Language Technology and ..... In CVPR, Miami, FL, USA, June. 2009. ... In International Conference on Automatic Face and Gesture Recognition,. 2002.

Face Recognition Based on SVM ace Recognition ...
features are given to the SVM classifier for training and testing purpose. ... recognition has emerged as an active research area in computer vision with .... they map pattern vectors to a high-dimensional feature space where a 'best' separating.

Appearance-Based Automated Face Recognition ...
http://sites.google.com/site/jcseuk/. Appearance-Based Automated Face. Recognition System: Multi-Input Databases. M.A. Mohamed, M.E. Abou-Elsoud, and M.M. Eid. Abstract—There has been significant progress in improving the performance of computer-ba

Markovian Mixture Face Recognition with ... - Semantic Scholar
cided probabilistically according to the probability distri- bution coming from the ...... Ranking prior like- lihood distributions for bayesian shape localization frame-.

Face Authentication /Recognition System For Forensic Application ...
Graphic User Interface (GUI) is a program interface item that allows people to interact with the programs in more ways than just typing commands. It offers graphical icons, and a visual indicator, as opposed to text-based interfaces, typed command la

Face Recognition in Videos
5.6 Example of cluster containing misdetection . .... system which are mapped to different feature space that consists of discriminatory infor- mation. Principal ...

Handbook of Face Recognition - Research at Google
Chapters 9 and 10 present methods for pose and illumination normalization and extract ..... Photo-sharing has recently become one of the most popular activities on the ... in social networking sites like Facebook, and special photo storage and ...

Undersampled Face Recognition via Robust Auxiliary ...
the latter advanced weighted plurality voting [5] or margin ... Illustration of our proposed method for undersampled face recognition, in which the gallery set only contains one or ..... existing techniques such as Homotopy, Iterative Shrinkage-.