A robust non-rigid point set registration method based ...

Viewer
Transcript

ARTICLE IN PRESS

JID: YCVIU

[m5G;June 24, 2015;15:19]

Computer Vision and Image Understanding 000 (2015) 1–14

Contents lists available at ScienceDirect

Computer Vision and Image Understanding journal homepage: www.elsevier.com/locate/cviu

A robust non-rigid point set registration method based on asymmetric gaussian representation✩ Gang Wang, Zhicheng Wang∗, Yufei Chen, Weidong Zhao CAD Research Center, Tongji University, NO. 4800, Cao’an Highway, Shanghai 201804, China

a r t i c l e

i n f o

Article history: Received 7 January 2015 Accepted 27 May 2015 Available online xxx Keywords: Point matching Point set registration Asymmetric Gaussian distribution Kernel method

a b s t r a c t Point set registration problem confronts with the challenge of large degree of degradations, such as deformation, noise, occlusion and outlier. In this paper, we present a novel robust method for non-rigid point set registration, and it includes four important parts are as follows: First, we used a mixture of asymmetric Gaussian model (MoAG) Kato et al. (2002) [1], a new probability model which can capture spatially asymmetric distributions, to represent each point set. Second, based on the representation of point set by MoAG, we used soft assignment technique to recover the correspondences, and correlation-based method to estimate the transformation parameters between two point sets. Point set registration is formulated as an optimization problem. Third, we solved the optimization problem under regularization theory in a feature space, i.e., Reproducing Kernel Hilbert Space (RKHS). Finally, we chose control points to build a kernel using low-rank kernel matrix approximation. Thus the computational complexity can be reduced down to O(N) approximately. Experimental results on 2D, 3D non-rigid point set, and real image registration demonstrate that our method is robust to a large degree of degradations, and it outperforms several state-of-the-art methods in most tested scenarios. © 2015 Elsevier Inc. All rights reserved.

1. Introduction Point set registration plays an important role in both computer vision and pattern recognition, and it frequently arises in many applications, such as image registration, medical imaging, structure from motion, 3D reconstruction, image stitching, image retrieval, and object tracking. Formally, a general point set registration method contains two parts, the ﬁrst one is to recover the correspondences between two point sets, and the second one is to estimate the best transformation which can align the corresponding point pairs. However, point set registration becomes increasingly diﬃcult, because of the challenge of large degree of degradations which make the distribution of point set more complex. In this paper, we mainly concentrate on four cases of degradation, i.e., deformation, noise, occlusion, and outlier. Brieﬂy, noisy data means the feature points cannot be matched precisely, and the data with occlusion and outlier mean some points cannot ﬁnd their correspondences in the corresponding point set. Generally, the point set registration can be categorized into rigid and non-rigid depends on the transformation pattern. A rigid ✩

This paper has been recommended for acceptance by Longin Jan Latecki. Corresponding author. Fax: +86 02165983989. E-mail addresses: [email protected] (G. Wang), [email protected] (Z. Wang), [email protected] (Y. Chen), [email protected] (W. Zhao). ∗

transformation, which allows the translation, rotation and scaling, is relatively easy to estimate. By contrast, a non-rigid transformation is very diﬃcult to estimate since the true transformation model is usually unknown and diﬃcult to approximate. Moreover, the non-rigid transformation is a main element in point set registration, because it exists in numerous applications, including hand-written character recognition, facial-expression recognition, and medical image registration. Recently, numerous methods exist for rigid and non-rigid point set registration. The Iterative Closest Point (ICP) algorithm [2] is one of the best known algorithms for point set registration, because of its simplicity and low computational complexity. The nearest-neighbor distance criterion is used to assign a binary correspondence at each step of the ICP. Then the transformation is reﬁned by the estimated correspondence. However, it requires a good initial position, i.e., adequately close distance between two point sets. In order to address the limitations of ICP and improve its performance, many interesting methods are proposed. Granger et al. [3] proposed a fast and robust method to align point sets under the expectation maximization (EM) framework, namely EM-ICP. Chui et al. [4] introduced a soft assignment technique and the deterministic annealing to construct a general framework to estimate the transformation and recover the correspondences for non-rigid matching. The core of their work is the transformation model which is built by thin-plate splines (TPS). They present a robust point set registration

http://dx.doi.org/10.1016/j.cviu.2015.05.014 1077-3142/© 2015 Elsevier Inc. All rights reserved.

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

JID: YCVIU 2

ARTICLE IN PRESS

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 1. Density functions of Gaussian and asymmetric Gaussian model (AG).

algorithm named TPS-RPM, and it is more robust than ICP to deformation, noise, occlusion, and outlier, but it has high computational complexity. Zheng et al. [5] proposed a robust point set registration method for non-rigid shapes, and it preserves local neighborhood structures. Moreover, the point set registration problem is interpreted as a graph matching which outperforms the shape context [6] method and TPS-RPM. Huang et al. [7] proposed a shape registration method, which measures the similarity between free form deformations and the shape in the distance transform space. Tsin et al. [8] proposed a correlation-based method named kernel correlation (KC) for point set registration, where the correlation of two kernel density estimates are used to formulate the cost function, and maximizing the correlation between kernels (as an M-estimator) can get the best transformation parameters. Based on the theory of kernel correlation, Jian et al. [9] presented a robust point set registration approach using Gaussian mixture models (GMM), namely L2-TPS, they leverage the closed-form expression for the L2 distance between two Gaussian mixtures which represent the given point sets. Alternatively, Ma et al. [10] introduced L2-minimizing estimate (L2E) [11], a robust estimator in statistics, to estimate the non-rigid transformation, and it needs putative correspondences. Then they use shape context as the feature descriptor to update the correspondences iteratively for point set registration. Li et al. [12] proposed an asymmetric shape representation and a new high-peak-fat-tail Gaussian mixtures kernel method to align two shapes, and they use particle swarm optimization (PSO) instead of the gradient-based methods to ﬁnd the optimal transformation parameters. Theoretically speaking, all above point set registration methods use distance-based similarity measure to formulate energy functions, and they do not need to build a more complex model which includes outliers. In another idea, more complex model, which includes outliers, is built for point set registration. For example, Myronenko et al. [13] proposed an eﬃcient algorithm, namely coherence point drift (CPD), based on the motion coherence theory (MCT) [14,15]. They use the Gaussian radial basis function (GRBF) to build the transformation model instead of the TPS, and consider the points in a point set which needs to move as the GMM centroids. Most importantly, EM framework is used to estimate the unknown transformation parameters. CPD can get good results in a very short time when handling a large number of points. Ma et al. [16] proposed an interesting method for point set registration by vector ﬁeld consensus [17], namely RPMVFC, and it is very similar in the transformation model to CPD. However, point set registration methods still have many unsolved problems: 1) they are sensitive to large degree of degradations, e.g., deformation, noise, occlusion and outlier, 2) the numerical optimization often falls into local minima, 3) their computational complexity are challenging to reduce when handling large data sets. In this paper, we focus on the above unsolved problems of nonrigid point set registration, and present a novel robust point set

registration method. Brieﬂy, the core of our method is based on a asymmetric Gaussian representation method, i.e., a mixture of asymmetric Gaussian model (MoAG) to represent the density of the given point set, in particular, assuming that the point set often satisﬁes asymmetric distribution. Then we use the improved soft assignment technique to ﬁt the correspondences. A robust correlation-based method is used to estimate the parameters of the transformation model, which has been well studied in [11,18]. Moreover, we use the kernel method to build a feature space (RKHS) to solve our cost function in the style of a regularized least square, and we consider the non-rigid transformation as a functional in RKHS. Local minima often exists in gradient based numerical optimization methods, so we use annealing framework to escape the trap of local minima. Finally, we use low-rank kernel matrix approximation to reduce the computational complexity. In our previous work [19], it has been shown that the MoAG model can get better performance in most degradations on 2D point set. Here, we add more method details and experimental analysis. The experimental results demonstrate that our proposed method is robust against a large degree of degradations, and it outperforms several existing methods in most tested scenarios on 2D, 3D point set, and real image data. The rest of the paper is organized as follows: in Section 2, we present the method to represent a point set using mixture of asymmetric Gaussian model. In Section 3, we formulate point set registration based on correlation based method as an optimization problem, and search the optimal solution in RKHS by kernel method. In Section 4, we show the experimental results on 2D, 3D synthetic data and 2D real images. In Section 5, we present a conclusion.

2. Asymmetric Gaussian representation method Single Gaussian and Gaussian mixture models (GMM) have been proven the popular models in computer vision and pattern recognition. However, they do not always adapt and ﬁt any distribution of patterns, Kato et al. [1] introduced a new probability model named asymmetric Gaussian model (AG) which can capture spatially asymmetric distributions. Asymmetric Gaussian model is another form extending from Gaussian model. It is shown that Gaussian distribution has a symmetric distribution while asymmetric Gaussian representation has an asymmetric distribution, as shown in Fig. 1, where the density functions are plotted. The distribution of asymmetric Gaussian model is given by

A x|μ, σ 2 , r =

|x − μ|2 exp − γ D/2 2σ 2 2π σ 2 ((r + 1 )/2 )D |x − μ|2 + (1 − γ ) exp − , (1) 2r2 σ 2 1

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

JID: YCVIU

ARTICLE IN PRESS

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

3

Fig. 2. Registration examples on 2D non-rigid ﬁsh point set. From top to bottom are the four largest degradation scenarios: deformation (0.08), noise (0.05), occlusion (0.5), and outlier (2.0). Note that we align the Model point set (blue crosses) onto the Scene point set (red circles). (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

Fig. 3. Performances of registration methods on 2D non-rigid ﬁsh point set under four largest degradation scenarios: deformation (0.08), noise (0.05), occlusion (0.5), and outlier (2.0). Recall-accuracy curves are used to evaluate TPS-RPM, L2-TPS, CPD, RPM-L2E, and our method.

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU 4

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 4. Performances of registration methods on 2D non-rigid ﬁsh point set under twenty degradation scenarios: deformation (from 0.02 to 0.08), noise (from 0.01 to 0.05), occlusion (from 0.1 to 0.5), and outlier (from 0 to 2.0). The error bars are used to evaluate TPS-RPM, L2-TPS, CPD, RPM-L2E, and our method, which indicate the registration error means and standard deviations over 100 random trials.

where D is the dimension of data, γ ∈ (0, 1) denotes the weighting between two Gaussian components of an asymmetric Gaussian model, γ = 1 for x ≤ μ, and γ = 0 for x > μ. σ 2 and r2 σ 2 are the standard deviations of an asymmetric Gaussian model. Note that r = 1 means asymmetric Gaussian model is equivalent to Gaussian model. Based on the deﬁnition of a single asymmetric Gaussian model, it is easy to construct a mixture of asymmetric Gaussian model which may be well approximate almost any density with a linear combination of local asymmetric Gaussian model. The overall density of the J-component mixture is given by

p( x ) =

J

w j A x|μ, σ , r , 2

(2)

j=1

where wj is the weight of each component, and {w j } j = 1 are mixing J proportions satisfying 0 ≤ wj ≤ 1 and j = 1 w j = 1. J

In this paper, we deﬁne the ﬁrst point set XM×D = (x1 , . . . , xM )T as the moving Model set, and the second point set YN×D = (y1 , . . . , yN )T as the ﬁxed Scene set, X, Y ∈ R2 or X, Y ∈ R3 . We can represent both point sets by a single mixture of asymmetric Gaussian model

respectively where the number of asymmetric Gaussian components is equivalent to the number of points in the point set. Note that all asymmetric Gaussian components are weighted equally, and both point sets are normalized as distributions with zero mean and unit variance ﬁrst. 3. Robust non-rigid point set registration 3.1. Objective function We motivated by the reason that the error of L2-minimizing estimator is less than the error of maximum likelihood estimation (MLE) [11]. Then, in this paper, we use correlation-based method to estimate the similarity between the input point sets, where the point sets are represented by mixture of asymmetric Gaussian model. L2 Euclidean distance is widely used in multiple applications, and many registration methods [2,8–10] based on it. Thus, the problem of point set registration can be well formulated by minimizing the L2 Euclidean distance between two point sets. Note that the structure of the Model set is transformed as v(X, θ ) after each step of registration. The correlation between two point sets based on L2 Euclidean distance is

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

5

Fig. 5. Registration examples on 2D non-rigid Chinese character point set. From top to bottom are the four largest degradation scenarios: deformation (0.08), noise (0.05), occlusion (0.5), and outlier (2.0). Note that we align the Model point set (blue crosses) onto the Scene point set (red circles). (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

deﬁned as follows:

dist (X, Y, θ ) =

3.2. Estimation for correspondence

(MoAG(Y ) − MoAG(v(X, θ )))2 dx,

(3)

where MoAG( · ) denotes mixture density of asymmetric Gaussian model constructed from a point set. Thus the parameter θ of the nonrigid transformation can be solved by minimizing Eq. (3)

θ = arg min

θ

MoAG(Y )2 dx +

MoAG(v(X, θ )) dx 2

MoAG(Y )MoAG(v(X, θ ))dx.

−2

(4)

Note that the ﬁrst term ∫MoAG(Y)2 dx is a constant, because it is independent of θ . The second term does not require to estimate, since it can be evaluated exactly for any value of θ . The last term of Eq. (4), ∫MoAG(Y)MoAG(v(X, θ ))dx is the key quantity to estimate. Formally, this estimation is equivalent to the Integrated Square Error (ISE or L2E) [10] which is a robust estimator to estimate densities between two point sets. Rewriting Eq. (4), we obtain

θ = arg min

θ

−

MoAG(v(X, θ )) dx

ϕ ji = √

2πσ 2

(5)

where v is a parameterized spatial transformation family in point set registration, and MN is a normalization term. Under the registration framework, the ﬁrst term of Eq. (5) is only dependent on Model set, and it is not a key component to estimate. Ignoring the constant independent of parameter θ , σ 2 , and r, so we can deﬁne the cost function of the registration algorithm as follows: M N 2 Cost θ , σ 2 , r = C − MoAG yi − v x j , θ , MN

i=1 j=1

1

(2π σ 2 )D/2 ((r+1)/2)D

is a constant.

(6)

yi − v x j , θ 2

exp −

2σ 2

.

(7)

Then for outliers, we use xo , yo to denote the outlier cluster centers as discussed in [4]

ϕoi = √

1

ϕ jo = √

1

2πξ 2

2πξ 2

i=1 j=1

where C =

1

2

N M 2 MoAG yi − v x j , θ , MN

Note that the corresponding points can be estimated by minimizing the last term of Eq. (6), then the newly estimated point position of point yi that corresponds to xj is deﬁned as: t j = N i = 1 ϕ ji yi , where ϕ is a soft assignment [20–22] correspondence matrix of size M × N in the interval [0, 1], and satisﬁes N i = 1 ϕ ji = 1 for j ∈ 1, 2, . . . , M. {yi , i ∈ 1, 2, . . . , N} and Mathematically, for points {x j , j ∈ 1, 2, . . . , M} of two point sets, the distance between arbitrary two points (∀yi ∈ Y, ∀xj ∈ X) satisﬁes a Gaussian distribution, we obtain

exp −

y i − x o 2 , 2ξ 2

yo − v x j , θ 2

exp −

2ξ 2

(8)

,

(9)

where ξ denotes a very large scale, in our method, we set ξ = max (yi )2 . Thus we can obtain the soft assignment correspondence ϕ , and matrix after row and column normalization ϕ ji = M ji j=1 ϕ ji +ϕoi ϕ ji ϕ ji = N , respectively. Following the soft assignment, the i=1 ϕ ji +ϕ jo

correspondences can be updated iteratively.

3.3. Transformation updating The correspondence set is used to update the transformation between Model and Scene point set. After estimating the correspondence in the Section 3.2, we can obtain the newly set of point correspondences S = {(t j , x j )}M in each iteration. So the objective j=1

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU 6

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 6. Performances of registration methods on 2D non-rigid Chinese character point set under four largest degradation scenarios: deformation (0.08), noise (0.05), occlusion (0.5), and outlier (2.0). Recall-accuracy curves are used to evaluate TPS-RPM, L2-TPS, CPD, RPM-L2E, and our method.

where β is a constant. Thus, we can obtain the kernel matrix K

function (6) can be rewritten as follows:

Cost

M 2 MoAG t j − v x j , θ . θ , σ 2, r = C −

M

⎡

(10)

j=1

3.4. Searching for an optimal solution in feature space The motivation to use reproducing kernel Hilbert spaces (RKHS) and the representation theorem: 1) the representation theorem which states that every function in an RKHS is a linear combination of the kernel term evaluated at the training points, and 2) the empirical risk minimization problem from an inﬁnite dimensional to a ﬁnite dimensional optimization problem can be effectively simpliﬁed by the representation theorem. Then the functional form of the non-rigid transformation family v can be found by calculus of variation in an RKHS. Given the Model set X ∈ RD , the Scene set Y ∈ RD (where D = 2 or D = 3), and the estimated correspondences S, so we can deﬁne an RKHS H with a positive deﬁnite kernel function k. In this paper, we choose Gaussian radial basis function (GRBF) as the kernel function 2 of our method, which is deﬁned as: k(xi , x j ) = exp(−βxi − x j ),

k ( x1 , x1 ) .. K=⎣ . k ( xM , x1 )

... .. . ···

⎤

k ( x1 , xM ) .. ⎦, . k ( xM , xM )

(11)

where K : RD × RD → RD×D is an M × M matrix. Note that the kernel matrix associated with a positive deﬁnite kernel is positive semideﬁnite. The transformation function v ∈ H can be found by minimizing the following regularized least-squares [23,24]:

λ ε (v ) = min Cost θ , σ 2 , r + v2K , v∈H

2

(12)

where the ﬁrst term is the empirical risk, and the second term is the Tikhonov regularization [25], λ > 0 is a trade-off parameter, · K denotes a norm in the RKHS. Tikhonov regularization form smoothly determines the trade-off between v2K and the Cost(θ , σ 2 , r), most importantly, it solves the ill-posed problems in point set registration. According to the representation theorem [26], the solution of Eq. (12) to the Tikhonov regularization can be written in the

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

7

Fig. 7. Performances of registration methods on 2D non-rigid Chinese character point set under twenty degradation scenarios: deformation (from 0.02 to 0.08), noise (from 0.01 to 0.05), occlusion (from 0.1 to 0.5), and outlier (from 0 to 2.0). The error bars are used to evaluate TPS-RPM, L2-TPS, CPD, RPM-L2E, and our method, which indicate the registration error means and standard deviations over 100 random trials.

following form:

v (· ) =

M

h jK x j, ·

(13)

j=1

for some h j ∈ RM . For such a non-rigid transformation, the norm in the RKHS can be rewritten as follows:

v2K = tr H T KH ,

(14)

)T

where H = (h1 , . . . , hM is a coeﬃcient matrix of size M × D. Substituting Eqs. (13) and (14) into the cost function (10), we can therefore rewrite it with the Tikhonov regularization as follows:

E H, σ 2 , r = C −

2 λ MoAG(T − KH ) + tr H T KH , M 2

(15)

where tr( · ) denotes the trace of a matrix, and T is the estimated correspondence set of the point set Y that corresponds to X. 3.5. Optimization and deterministic annealing The aforementioned cost function is convex in the neighborhood of the optimal position and, most importantly, always

differentiable. Thus, the numerical optimization problem can be solved by employing some gradient-based optimization methods, such as Quasi–Newton method [27], or global optimization methods, such as Particle Swarm Optimization (PSO). Taking the partial derivative of Eq. (15) with respect to H, we obtain:

2K ∂E = D/2 ∂H Mσ 2 π σ 2 (r + 1 )2 /2 1 ⊗ 1 + λKH, × γ V ◦ (B ⊗ 1 ) + (1 − γ ) 2 V ◦ B r

(16)

= exp(diag(VV T )/ where V = KH − T , B = exp(diag(VV T )/2σ 2 ), B 2r2 σ 2 ), 1 is an 1 × D row vector of all ones, γ = 1 for tj ≤ (KH)j , and γ = 0 for tj > (KH)j . ◦ denotes the Hadamard product and ⊗ denotes the tensor product. However, there often exists local minima which may trap the numerical optimization. Since the large global motion or the non-rigid transformation model has high degrees of freedom. In order to escape from the trap of local minima, we adopt a coarse-to-ﬁne manner, i.e., deterministic annealing optimization framework, which is a useful heuristic method introduced by [4]. The initial temperatures of our method is σ 2 . Then we reduce the temperatures according to

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU 8

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 8. Registration examples on 3D non-rigid face point set. From top to bottom are the three degradation scenarios: deformation, noise, and occlusion. Note that we align the Model point set (blue crosses) onto the Scene point set (red circles). (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

σ 2 = α × σ 2 , where α is an annealing rate in the interval [0.90, 0.99], and we set α = 0.93 in our experiments. Note that the temperature should be set to a relatively large value to make the annealing process slow enough, in this way, we can obtain robust experimental results. Finally, the cost function will converge to an optimal solution after several iterations. 3.6. Low-rank Kernel matrix approximation The kernel matrix [26,28] plays an important role in the regularization theory, and it provides an easy way to choose an RKHS. However, it is time consuming with the original kernel matrix in registration method, in particular, handling large amount of points. The computational complexity of our method is approximately O(M3 + M2 + MN ). Motivated by the principle component analysis, we have an idea to use a low-rank kernel matrix to approximate the original one, because it can choose several principle feature vectors of the kernel matrix, and effectively reduce the computational complexity. Hopefully, low-rank kernel matrix approximation [29] can yield a large increase in speed with little loss in accuracy. Then the low-rank kernel matrix approximation of K can be deﬁned as follows:

, min K − K K

∗

(17)

is the closest where · ∗ denotes L2 and Frobenius norms. K τ -rank matrix approximation to K, and it satisﬁes both L2 and Frobe-

in place of the original nius norms. Thus, we can use the matrix K kernel matrix K. Using eigenvalue decomposition of K, we obtain the approximation matrix

= Q Q T , K

(18)

where is a diagonal matrix of size τ × τ with τ largest eigenvalues, and Q is an N × τ matrix with the corresponding eigenvectors. we can calculate parameter Based on the approximation matrix K, matrix H of size τ × D instead of the original matrix H. Thus, Eq. (13)

can be rewritten as follows:

v = Q H.

(19)

Substituting Eqs. (18) and (19) into Eq. (15), the cost function of our method therefore can be rewritten as follows:

T H, σ 2 , r = C − 2 MoAG T − U H UH + λ tr H , E M 2

(20)

where UM×τ = Q . Thus, in the optimization processing, we use the partial derivative instead of the Eq. (16) of Eq. (20) with respect to H

∂ E 2U = D/2 2 2 ∂H Mσ π σ (r + 1 )2 /2 ◦ (P ⊗ 1 ) + (1 − γ ) 1 V ◦ P ⊗ 1 + λU H, × γV 2 r

(21)

= UH − T , P = exp(diag(V V T )/2σ 2 ), P V T )/ = exp(diag(V where V ) , and γ = 0 for t > (U H ) . 2r2 σ 2 ), γ = 1 for t j ≤ (U H j j j In practice, τ M and we set τ = 15 in our experiments. Therefore, the computational complexity of our method will be reduced to O(τ 3 + τ 2 + τ N ) approximately, then it can be expressed as O(N) if the number of point set is largish. In other words, low-rank kernel matrix approximation enables our method to be applied to large point sets. 3.7. Parameter setting and implementation details There are mainly six free parameters: the scale parameter σ 2 , the asymmetric Gaussian weighting parameter r, the GRBF parameter β , the regularization parameter λ, the number of low-rank feature vectors τ , and the annealing parameter α , in our method. Scale parameter: σ 2 denotes a capture range for each mixture of asymmetric Gaussian model, and we set the initial scale σ 2 = 2 1 ( MND ) Ni= 1 Mj= 1 (yi − x j ) .

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Algorithm 1 MoAGREG. Require: The Model point set: X, and the Scene point set: Y . Ensure: The optimal transformation parameter θ , and the optimal trans formed point set X. 1: Begin 2: Initialize parameters: σ 2 , r, β , λ, α , τ , σ 2 , and θ = 0τ ×D . f inal 3: Repeat 4: Update correspondence set S = { (t j , x j )}M using Eqs. (7)–(9). j=1 using 5: Construct kernel matrix K and its approximation matrix K

10:

Eqs. (11) and (18), respectively. Optimize the cost function (20) using the BFGS Quasi–Newton method. Update the transformation parameters: θ ← θ. Annealing: σ 2 = α × σ 2 . Until reach a termination condition: σ 2 < σ f2inal . ← (v(x , θ ), . . . , v(x , θ ))T . Update the Model point set: X

11:

End

6: 7: 8: 9:

1

M

Asymmetric Gaussian weighting parameter: r controls the asymmetric scale of each asymmetric Gaussian model, and r is set to 1.5. GRBF parameter: β controls the structural strength of the moving point set. β produces locally smooth transformation with small values, while it produces globally translation transformation with large values. Thus, β is set to 0.8. Regularization parameter: λ plays an important role to trade off the empirical risk and smoothness regularization, and λ is set to 0.1. The number of low-rank feature vectors: τ is a small integer, and denotes the degree of approximation between low-rank kernel matrix and the original one. τ M, and it is set to 15. Annealing parameter: α is annealing rate in the deterministic annealing framework. In order to get robust results, the annealing process is need to slow enough for the method. Due to α is normally between [0.90, 0.99], thus α is usually set to 0.93. Point set registration algorithms usually contain iteration process, we can estimate the non-rigid transformation parameters to update and move the Model point set iteratively, until reach a given termination condition. In the deterministic annealing framework, the termination condition σ f2inal is set to 0.001 according to the initial scale σ 2 . The pseudo code of our non-rigid point registration method is outlined in Algorithm 1 (referred to as MoAGREG).

9

dence on wide-baseline real images [30]. Note that the point sets are normalized to zero mean and unit variance at the beginning of the experiments. Comparison methods. Point set registration methods: TPS-RPM [4], L2-TPS [9], CPD [13], and RPM-L2E [10]. Graph-based matching methods: integer projected ﬁxed point method (IPFP) [31], reweighted random walks for graph matching (RRWM) [32], and factorized graph matching (FGM) [33,34]. Feature correspondence method: random sample consensus (RANSAC) [35]. All methods are implemented in Matlab, and tested on an I5-2450 CPU with 8GB RAM. 2 = 0.5, iteration number = Parameter setting. TPS-RPM: σinitial T

1/2D

2 500, and annealing rate: α = 0.93. L2-TPS: σinitial = det ( XMX ) , β = 1.0, λ = 0, and the max iteration number is set to 103 . 2 1 2 CPD: σinitial = ( MND ) Ni= 1 Mj=1 (yi − x j ) , β = 2.0, λ = 3.0, outlier weighting parameter is set to 0.7 just for outlier scenarios, and the termination condition is σ f2inal = 10−8 or iteration number = 100, note that it does not use the fast Gauss transform (FGT) [36] by 2 default. RPM-L2E: σinitial = 0.05, β = 0.8, λ = 0.1, iteration number =20, and annealing rate: α = 0.5, note that it uses shape context (SC) [6] to estimate the correspondences iteratively. The graph matching methods keep the default parameters which are implemented by Zhou [34]. We use the RANSAC implemented by Vedaldi [37] in Matlab, and its iteration number is set to 104 . Evaluation criterions. Recall-precision, as well known in statistics and pattern recognition, is used to measure the relevance between results and ground-truth. Recall-accuracy curve, as used in [9,38], which denotes the ability of a registration method ﬁnding correct correspondences as many as possible with low accuracy errors. F1 measure (i.e. F1 score) is used to evaluate the balance between recall and precision. Root mean-squared error (RMSE) is used to measure the registration error.

Recall =

TP , TP + FN

Precision = F1 =

TP , TP + FP

2 · Precision · Recall , Precision + Recall

(22)

(23)

(24)

where TP denotes true positive, FP denotes false positive, and FN denotes false negative. 0 ≤ Recall ≤ 1, 0 ≤ Precision ≤ 1, and 0 ≤ F1 ≤ 1.

M 1 Error = t j − v x j, θ , M

(25)

j=1

4. Experiments The algorithm is implemented in Matlab 2012b, and tested on a Pentium Core I5-2450 CPU with 8GB RAM.

where M is the total number of the estimated correspondences, and tj is the ground-truth that corresponds to xj . 4.2. Results on 2D synthetic point set

4.1. Experimental setup Experimental data. In order to evaluate the performance of our method, we design ﬁve main experiments to demonstrate the eﬃciency of our method: 1) non-rigid point set registration on 2D data under degradations 1 , e.g., deformation, noise, occlusion, and outlier, 2) non-rigid point set registration on 3D face data 2 , 3) point matching on CMU house image dataset 3 , 4) point registration on Oxford aﬃne covariant regions datasets 4 , and 5) point set correspon1 2 3 4

http://www.cise.uﬂ.edu/˜anand/students/chui/tps-rpm.html https://sites.google.com/site/myronenko/research/cpd http://vasc.ri.cmu.edu/idb/html/motion/house/ http://www.robots.ox.ac.uk/˜vgg/data/data-aff.html

In this section, we ﬁrst present the results of point set registration methods, and then evaluate them via recall-accuracy and registration error between the transformed Model set and the ﬁxed Scene set. Meanwhile, we compared the performance of our method against four state-of-the-art methods: TPS-RPM [4], L2-TPS [9], CPD [13], and RPM-L2E [10], which are implemented by their publicly available codes in Matlab. We tested the performance of registration method on the same data sets as used in [4,5,10] named Chui–Rangarajan synthesized data sets. Four group point sets are selected and designed to evaluate the robust performance of registration methods under degradations, such as deformation, noise, occlusion and outlier. Speciﬁcally speaking, noise is deﬁned as white Gaussian noise with a zero mean and

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

JID: YCVIU 10

ARTICLE IN PRESS

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 9. Registration results of our method on 3D non-rigid face point set with regard to outlier (outlier to data ratio: 1.0, 1.5, and 2.0). The top row denotes the initialization of point set. The bottom row denotes the registration results. Note that we align the Model point set (blue crosses) onto the Scene point set (red circles). (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

Fig. 10. Performances of registration methods on 3D non-rigid face point set under six degradation scenarios: deformation (default), noise (0.6), occlusion (missing 100 points), and outlier (1.0, 1.5, 2.0). Recall-accuracy curves are used to evaluate TPS-RPM, L2-TPS, CPD, and our method.

Fig. 11. The matching result of our method between frame 1 and frame 111 of the CMU house sequence. (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

JID: YCVIU

ARTICLE IN PRESS G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

[m5G;June 24, 2015;15:19] 11

Fig. 12. Examples of the Oxford aﬃne covariant regions datasets. Each group has ﬁve image pairs from 1 to 2 to 1 to 6.

standard deviation from 0.01 to 0.05, which disturbs the true position of each point. Occlusion, i.e. missing points, denotes some points of one set have no corresponding points in the other set. Outlier, likes occlusion, may affect matching and registration results signiﬁcantly. Note that each group of point set also has some degree of rotation ([−π , π ] approximately). Non-rigid Fish and Chinese character shape point sets are used for point set registration, respectively. There are 100 samples in each degradation level, 4000 samples in all. 4.2.1. Fish 16 examples of qualitative experimental results on 2D ﬁsh point set, as shown in Fig. 2, under the largest degradations of deformation (degree 0.08), noise (level 0.05), occlusion (ratio 0.5), and outlier (ratio 2.0). Deformation: the Model point set is aligned to the Scene point set, all tested methods performs well in this example. Noise: the point positions of the Scene point set are disturbed by white Gaussian noise, and all tested methods cannot align these point set pairs perfectly. Occlusion: the Model point set misses some points, and TPSRPM, CPD, and our method perform well, while the performance of L2-TPS is affected by the missing points. Outlier: a lot of outliers are added in the Scene point set, which satisfy the Gaussian distribution, and our method aligned better than the other tested methods. Quantitative experimental comparison results using recallaccuracy on 2D ﬁsh point set are shown in Fig. 3 under the four largest degradation scenarios. In registration methods, it is better to ﬁnd all true positive correspondences with lower error. Note that all 100 examples of each degradation are tested. The comparison curves of Fig. 3 show that our method can get better recall values in most tested accuracy threshold than the other methods. RPM-L2E performs well

Fig. 14. Examples to recover correspondences on wide-baseline image pairs. From the top row to the bottom row (Mex, Tree, Wash), the number of correspondences are 82, 94, and 47. Note that the yellow lines denote the recovering correspondences, while the green lines denote the missing ones. (For interpretation of the references to color in this ﬁgure legend, the reader is referred to the web version of this article).

in the case of noise, because of the robustness of shape context to some degree of noise. Quantitative experimental comparison results using error bars on 2D ﬁsh point set are shown in Fig. 4 under all degradations. The performances of all tested methods are recognized clearly. Generally, the registration errors of all methods become large as increasing the degree of degradations. L2-TPS obtained poor performances when faced with large degradations, while the other tested methods performs well. Moreover, shape context is not robust to outliers, and then RPML2E obtained large errors under some degree of outliers. Relatively speaking, our method gives the best performances over all degradations in the experiment. 4.2.2. Chinese character This point set is different from the above one (ﬁsh), and its shape point set is not well clustered. Fig. 5 shows the examples of qualitative experimental results under four degradations. Our method aligns both point sets perfectly under deformation, occlusion and outlier degradations. Although the large degree of noise also affects the accuracy of registration, our method can get better alignment than TPSRPM, L2-TPS, and CPD. Fig. 6 shows the recall-accuracy curves of TPS-RPM, L2-TPS, CPD, RPM-L2E, and our method when facing with deformation (0.08), noise (0.05), occlusion (0.5) and outlier (2.0). Those curves also

Fig. 13. Performances of registration methods on the Oxford aﬃne covariant regions datasets. Bars denote the average of registration error and the standard deviations over 100 random trials.

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU 12

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

Fig. 15. F1 -measure scores of RANSAC, RPM-L2E and our method on wide-baseline image pairs under outlier.

demonstrate that our method gives better recall values with lower errors than the other tested methods. Follow the above experiment, we tested 2000 pairs of Chinese character point sets, and the error bars are shown in Fig. 7. The whole tendency of error bars is similar with the results of ﬁsh point set registration, and the registration error of our method when facing with outlier are slightly higher than the above ﬁsh experiment, because those point sets are not well clustered. Fig. 7 demonstrates that our method gives the lower registration errors than the other tested methods under all degradation scenarios.

Table 1 Matching rate on the CMU house sequence. Comparison methods: IPFP [31] initialized with spectral matching (IPFP-SM), RRWM [32], and FGM [33,34] (FGM-D for directed graphs and FGM-U for undirected graphs). Method

30 points (%)

25 points (%)

IPFP-SM RRWM FGM-U FGM-D Ours

100 100 100 100 100

55.95 ± 24.11 69.85 ± 19.81 74.94 ± 16.56 64.78 ± 27.16 92.08 ± 6.68

4.3. Results on 3D face point set In this experiment, we use the 3D face point set as used in [13] with 392 points to test the registration method. Based on the original 3D face data structure, we constructed several point set pairs with respect to some degree of noise, occlusion and outlier. Here, we compare our method against TPS-RPM, L2-TPS, and CPD. Qualitative experimental results are shown in Fig. 8. Our method shows accurate alignments under deformation, noise, and occlusion. The degree of deformation is small, and all tested methods obtain well performances. The performances slightly decreased when adding some degree of white Gaussian noises, while our method gets the better result. Although 100 points of the Model point set are occluded, the remaining points are also aligned perfectly to their corresponding points. Furthermore, we added 392, 588, and 784 outliers to the Scene point set randomly, and the registration results of our method are shown in Fig. 9. Registration results show that our method aligns perfectly when the Model set is contaminated with a large number of outliers. Quantitative experimental results using recall-accuracy curves are shown in Fig. 10. The ﬁrst two ﬁgures (deformation and noise) show that all tested methods get the best performances, while the other tested methods get poor performances under occlusion and outlier degradations. Our method shows the best performances under these degradations. Observe that our method is not sensitive to outliers as increasing the outlier to data ratio. Almost true corresponding points of two point set are aligned quite well and, most importantly, our method takes less than 10 s, while the naive method without lowrank approximation takes more than 20 min for about 1000 points in the experiment. 4.4. Results on CMU house sequence We use the CMU house sequence, which consists of 111 frames of a toy house with different viewpoints, to test our method on point matching. 30 landmarks are used to label each of the house images, in other words, all correspondences between any two frames are obtained. Fig. 11 shows a matching example using our method between

frame 1 and frame 111, where the red points are the landmarks. Here, we evaluate our method against several graph matching methods: IPFP [31] initialized with spectral matching (IPFP-SM), RRWM [32], and FGM [33,34] (FGM-D for directed graphs and FGM-U for undirected graphs) under all possible frame pairs. We selected all 30 landmark points to test the matching rate between frame pairs, then we selected 25 points randomly to evaluate the matching performances of the comparison methods. Note that 25 points experiment means that consists of some outliers. The matching results are shown in Table 1. All methods get the same performance under all 30 landmarks, while the graph based methods are sensitive to outliers when selecting 25 landmarks randomly. Our method performs better than the tested methods. 4.5. Results on oxford aﬃne covariant regions datasets The Oxford aﬃne covariant regions datasets consist of eight group of real images, and each group consists of six images. We constructed ﬁve image pairs from image 1 to image 2, 3, 4, 5, 6, respectively. Varying blur, viewpoint, scale, rotation, light, and JPEG compression between image pairs (Fig. 12). In this experiment, we ﬁrst extracted feature points from each image using SIFT (implemented by Vedaldi [37] in Matlab), then the correspondences are correctly matched by the best-bin-ﬁrst (BBF) method [39]. Thus, the Model and the Scene point sets are constructed to test the registration methods: L2-TPS, CPD, and our method. Note that we randomly selected 100 points (when the number of correspondences is greater than 100), and ran for 100 times. Fig. 13 shows the performances of registration methods. Most average registration errors of our method are less than 1pixel, and our method performs better than L2-TPS and CPD in most tested scenarios. 4.6. Results on wide-baseline real image pairs In this experiment, we use the wide-baseline image pairs [30] to test the performance of our method on point set correspondence recovering. First, SIFT [39] feature points are extracted to

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

ARTICLE IN PRESS

JID: YCVIU

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

construct the input point sets: the Model set and the Scene set. In addition, we use precision-recall pairs and F1 measure criterion to evaluate the performances of the tested methods: RANSAC [35] and Ma et al. [10]. The well known RANSAC is widely used to ﬁt subspace and reject outliers, which is implemented by Vedaldi [37] in Matlab. RPM-L2E (Ma et al. [10]) based on L2E which is a very robust method to eliminate outliers in the application of mismatch removal. The threshold of our method is deﬁned as: (γ exp(− Y −2 X ) + (1 − γ ) exp(− 2Y −2X )) ≥ L , where γ = 1 for 2σ

f inal

2r σ

[m5G;June 24, 2015;15:19] 13

Acknowledgment This work was supported by National Natural Science Foundation of China (NSFC, No. 61103070), and Program for Young Excellent Talents in Tongji University (2013KJ008). The authors wish to acknowledge H. Chui, A. Myronenko, B. Jian, F. Zhou and J. Ma for providing their implemented source codes and test data sets.

f inal

y j ≤ xˆ j , and γ = 0 for y j > xˆ j , and our method performs well for threshold in [0.1, 0.8]. Then we set L = 0.5 for our method to remove outliers, and the threshold of RPM-L2E is also set to 0.5. Note that the threshold decides whether a point is an inlier or not. Fig. 14 shows the experimental results on 3 pairs of widebaseline images without outliers. Our method recovers almost all correspondences, and its performance is better than RANSAC and RPM-L2E. Then we added outliers to those image pairs from outlier to data ratio 0.1–0.9 to test the robustness of the method to outliers. In our study, the results of RPM-L2E are instable, and for instance, we run it for ﬁve times on the image pairs Tree with 43.71% outliers, and its results are (92.77%, 81.91%), (93.97%, 82.97%), (93.40%, 90.42%), (91.57%, 92.55%) and (93.25%, 88.29%), respectively. By contrast, our method can obtain well stable result (95.83%, 97.87%) and have better trade-off between precision and recall evaluation criterions. The comparison curves are shown in Fig. 15. Observed that the F1 measure curves are decreased as increasing outlier to data ratio. Our method still gives higher F1 scores than RANSAC and RPM-L2E in most tested scenarios. 5. Conclusion Point set registration problem is still unsolved in computer vision, because it confronts with the challenge of degradations, such as deformation, noise, occlusion, and outlier. In this paper, we focus on the degradations, and present a novel robust point set registration method based on asymmetric Gaussian representation. We assume that the distribution of point set satisﬁes asymmetric Gaussian model, and a mixture of asymmetric Gaussian model (MoAG) can capture spatially asymmetric distributions. It is different from the methods, e.g., CPD and L2-TPS, which use the Gaussian mixture models (GMM) to represent the point set. We iteratively estimate the closest distance between two MoAGs by minimizing the correlationbased cost function. It is different from the methods, e.g., KC and L2TPS, because they estimate the transformations without updating the correspondences. Both our method and TPS-RPM, a robust point set registration method, perform well with respect to degradations, but the performance of our method is better than TPS-RPM, because of the robust estimator for non-rigid transformation of our proposed method. Our method includes three main aspects: for the point set representation, we use the mixture of asymmetric Gaussian model (MoAG) which is more accuracy than Mixture of Gaussian model [1]. For robust estimation of non-rigid transformation, we formulate point set registration as an optimization problem by the regularized least square. In RKHS, we use the Tikhonov regularization theory and the representation theorem to get the solution form of the non-rigid transformation. Gradient based numerical optimization combines the deterministic annealing scheme to search for an optimal solution, and escape the potential local minima. For reducing the computational complexity of the method, we use low-rank kernel matrix approximation as choosing the principle component to reduce the computational time. Extensive experimental results demonstrate that our method outperforms several state-of-the-art methods on 2D, 3D point sets and real image data.

References [1] T. Kato, S. Omachi, H. Aso, Asymmetric gaussian and its application to pattern recognition, in: Structural, Syntactic, and Statistical Pattern Recognition, Springer, 2002, pp. 405–413. [2] P.J. Besl, N.D. McKay, A method for registration of 3-d shapes, IEEE Trans. Pattern Anal. Mach. Intell. 14 (2) (1992) 239–256. [3] S. Granger, X. Pennec, Multi-scale em-icp: a fast and robust approach for surface registration, in: Computer Vision-ECCV 2002, Springer, 2002, pp. 418–432. [4] H. Chui, A. Rangarajan, A new point matching algorithm for non-rigid registration, Comput. Vis. Image Und. 89 (2) (2003) 114–141. [5] Y. Zheng, D. Doermann, Robust point matching for nonrigid shapes by preserving local neighborhood structures, IEEE Trans. Pattern Anal. Mach. Intell. 28 (4) (2006) 643–649. [6] S. Belongie, J. Malik, J. Puzicha, Shape matching and object recognition using shape contexts, IEEE Trans. Pattern Anal. Mach. Intell. 24 (4) (2002) 509–522. [7] X. Huang, N. Paragios, D.N. Metaxas, Shape registration in implicit spaces using information theory and free form deformations, IEEE Trans. Pattern Anal. Mach. Intell. 28 (8) (2006) 1303–1318. [8] Y. Tsin, T. Kanade, A correlation-based approach to robust point set registration, in: Computer Vision-ECCV 2004, Springer, 2004, pp. 558–569. [9] B. Jian, B.C. Vemuri, Robust point set registration using gaussian mixture models, IEEE Trans. Pattern Anal. Mach. Intell. 33 (8) (2011) 1633–1645. [10] J. Ma, W. Qiu, J. Zhao, Y. Ma, Y. Alan, Z. Tu, Robust L2E estimation of transformation for non-rigid registration, IEEE Trans. on Signal Process 63 (5) (2015) 1115–1129. [11] D.W. Scott, Parametric statistical modeling by minimum integrated square error, Technometrics 43 (3) (2001) 274–285. [12] H. Li, T. Shen, X. Huang, Global optimization for alignment of generalized shapes, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2009, pp. 856–863. [13] A. Myronenko, X. Song, Point set registration: Coherent point drift, IEEE Trans. Pattern Anal. Mach. Intell. 32 (12) (2010) 2262–2275. [14] A.L. Yuille, N.M. Grzywacz, The motion coherence theory, in: IEEE 2nd International Conference on Computer Vision (ICCV), IEEE, 1988, pp. 344–353. [15] A.L. Yuille, N.M. Grzywacz, A mathematical analysis of the motion coherence theory, Int. J. Comput. Vis. 3 (2) (1989) 155–175. [16] J. Ma, J. Zhao, J. Tian, A.L. Yuille, Z. Tu, Robust point matching via vector ﬁeld consensus, IEEE Trans. Image Process. 23 (4) (2014) 1706–1721. [17] J. Ma, J. Zhao, Y. Ma, J. Tian, Non-rigid visible and infrared face registration via regularized Gaussian ﬁelds criterion, Pattern Recognit. 48 (3) (2015) 772–784. [18] A. Basu, I.R. Harris, N.L. Hjort, M. Jones, Robust and eﬃcient estimation by minimising a density power divergence, Biometrika 85 (3) (1998) 549–559. [19] G. Wang, Z. Wang, W. Zhao, Q. Zhou, Robust point matching using mixture of asymmetric gaussians for nonrigid transformation, in: Computer Vision-ACCV 2014, Part IV, Springer, 2015, pp. 433–444. [20] A. Rangarajan, H. Chui, F.L. Bookstein, The softassign procrustes matching algorithm, in: Information Processing in Medical Imaging, Springer, 1997, pp. 29–42. [21] S. Gold, A. Rangarajan, C.-P. Lu, S. Pappu, E. Mjolsness, New algorithms for 2d and 3d point matching: pose estimation and correspondence, Pattern Recognit. 31 (8) (1998) 1019–1031. [22] H. Chui, J. Rambo, J. Duncan, R. Schultz, A. Rangarajan, Registration of cortical anatomical structures via robust 3d point matching, in: Information Processing in Medical Imaging, Springer, 1999, pp. 168–181. [23] L. Baldassarre, L. Rosasco, A. Barla, A. Verri, Vector ﬁeld learning via spectral ﬁltering, in: Machine Learning Knowledge Discovery in Database, Springer, 2010, pp. 56–71. [24] R. Rifkin, G. Yeo, T. Poggio, Regularized least-squares classiﬁcation, Nato Sci. Ser. Sub Ser. III Comput. Syst. Sci. 190 (2003) 131–154. [25] A.N. Tikhonov, V.Y. Arsenin, Solutions of ill-posed problems, V.H. Winston, 1977. [26] C.A. Micchelli, M. Pontil, On learning vector-valued functions, Neural Comput. 17 (1) (2005) 177–204. [27] J. Nocedal, S.J. Wright, Conjugate Gradient Methods, Springer, 2006. [28] D. Tschumperle, R. Deriche, Vector-valued image regularization with pdes: a common framework for different applications, IEEE Trans. Pattern Anal. Mach. Intell. 27 (4) (2005) 506–517. [29] I. Markovsky, K. USEVICH, Low Rank Approximation, Springer, 2012. [30] T. Tuytelaars, L. Van Gool, Matching widely separated views based on aﬃne invariant regions, Int. J. Comput. Vis. 59 (1) (2004) 61–85. [31] M. Leordeanu, M. Hebert, R. Sukthankar, An integer projected ﬁxed point method for graph matching and map inference, in: Advances in Neural Information Processing Systems, 2009, pp. 1114–1122. [32] M. Cho, J. Lee, K.M. Lee, Reweighted random walks for graph matching, in: Computer Vision–ECCV 2010, Springer, 2010, pp. 492–505.

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

JID: YCVIU 14

ARTICLE IN PRESS

[m5G;June 24, 2015;15:19]

G. Wang et al. / Computer Vision and Image Understanding 000 (2015) 1–14

[33] F. Zhou, F. De la Torre, Factorized graph matching, in: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2012, pp. 127–134. [34] F. Zhou, F. De la Torre, Deformable graph matching, in: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2013, pp. 2922–2929. [35] M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model ﬁtting with applications to image analysis and automated cartography, Commun. ACM 24 (6) (1981) 381–395. [36] L. Greengard, J. Strain, The fast gauss transform, SIAM J. Sci. Stat. Comput. 12 (1) (1991) 79–94.

[37] A. Vedaldi, B. Fulkerson, Vlfeat: An open and portable library of computer vision algorithms, in: Proceedings of the International Conference on Multimedia, ACM, 2010, pp. 1469–1472. [38] J. Starck, A. Hilton, Correspondence labelling for wide-timeframe free-form surface matching, in: IEEE 11th International Conference on Computer Vision (ICCV), IEEE, 2007, pp. 1–8. [39] D.G. Lowe, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vis. 60 (2) (2004) 91–110.

Please cite this article as: G. Wang et al., A robust non-rigid point set registration method based on asymmetric gaussian representation, Computer Vision and Image Understanding (2015), http://dx.doi.org/10.1016/j.cviu.2015.05.014

A robust non-rigid point set registration method based ...

The algorithm is implemented in Matlab 2012b, and tested on a. Pentium Core I5-2450 CPU ..... their implemented source codes and test data sets. References.

Download PDF

5MB Sizes 2 Downloads 176 Views

Report

A robust non-rigid point set registration method based ...

Recommend Documents