Affine dense matching for wide baseline stereo Zoltán Megyesi and Dmitry Chetverikov MTA SZTAKI and ELTE H-1111 Hungary, Budapest, Kende u.13-17

Abstract Reconstructing the model of a real-world scene from images is a widely applicable and challenging computer vision task. Dense matching is an image processing problem that has always been a crucial step in the reconstruction. It locates projections of visible points on the two or more different image planes, thus setting the depth level of the points. A variety of efficient methods are available to solve the dense matching problem in the case of traditional short baseline stereo, when the two views are close and similar. However, when wide baseline conditions apply, when the viewpoints are substantially different, the matching becomes much more difficult. So far, this problem has not been addressed properly. In this paper, we propose a region growing based dense matching algorithm, that uses affine transformations to coop with wide baseline stereo images. We demonstrate that, given the epipolar geometry and adequate seed points, the proposed method effectively deals with wide baseline images. Categories and Subject Descriptors (according to ACM CCS): I.4.8 [Image Processing and Computer Vision]: Scene Analysis

1. Introduction Reconstructing a real-world scene has been a dedicated computer vision task for decades. The ability of sensing, understanding, rebuilding a 3D object or environment from visual information is invaluable for navigation tasks, architecture, photogrammetry, virtual reality applications, even medicine. However, the way from the images to any kind of model is far from straightforward. To obtain a realistic, rich model, we need to solve problems in different fields, from geometric modelling through image processing to graphics. The general, widely used process1 2 3 takes many steps, and each step has its own hard-points and limitations that affect the whole. This process is based on building the epipolar geometry from a number of initially matched points among the images. Although this geometry is the basis for any further step, to complete the reconstruction we need additional information regarding the cameras. Multiple images give us a way to automatically reconstruct a scene up to an unknown similarity transformation4 . To acquire a 3D model from different views, we need to find the projections on the image planes of as many points as possible. Reliable 3D information can only be assigned to points with at least two recognised projections. Dense matching is the image processing task to find these projec-

tions. Using the epipolar geometry and other constraints, we search the images for pixels of similar neighbourhood. Unfortunately some natural phenomena (occlusions, object borders, shadows, periodicity, projective distortion, un-textured regions) make this costly search ambiguous and impossible in many cases. Algorithms has been proposed to reduce ambiguity, and to address some of these phenomena3 5 8 .

Wide Baseline Stereo As already mentioned, dense matching is both costly and ambiguous. Some of the ambiguity can be derived back to the very foundation of the reconstruction: different viewpoints. Occlusions, projective distortion, matching difficulties near object borders are innate problems of any stereo algorithm. When the viewpoints are close, these problems are not critical, since the occlusions and distortions are small. This is not the case when wide baseline conditions apply. Here the viewpoints are far from each other, creating large occlusions and distortions. Same physical regions look, very different between views. When similarity fails, classical methods lose their ground, so special methods are required to solve the dense matching problem under wide baseline conditions.

Megyesi, Chetverikov / Affine Dense Matching

Assumptions and Contributions We propose a region growing based dense matching algorithm for wide baseline stereo. For simplicity, we work on image pairs, but with proper geometric considerations the method can easily be extended to multiple images. The following basic assumptions are used:

coincide4 . In a rectified image pair, if a pixel in the first image has a visible corresponding point in the other image, it can only be of the same row (See figure 1.) This not only utilises the epipolar constraint, but also gives us a way for simplifying our search for the best affine transformation.

1. The projective distortion between views can locally be approximated by an affine transformation. 2. On a smooth surface, this affine transformation and the disparity change smoothly. First we calculate the epipolar geometry to create rectified images, on which we locate reliably matchable seed points. For these seeds, we search for the best fitting affine transformation and disparity. These transformations are used as the basis for a region growing algorithm. We try to match smooth areas around the seed points, eventually segmenting the area into connected regions. We experimentally show that in case of successful seed selection our method yields reliable result on wide baseline images. 2. Preparations Images for matching For a successful reconstruction, we need effective dense matching. To achieve this, we may need some preprocessing steps to enhance the images, making them more alike and matchable. First of all, we may need radiometric corrections. Lighting conditions may change with the viewpoint, complicating the matching immensely. In this case, contrast stretching or adaptive histogram equalisation can improve the results significantly. Often sharpening2 can help by emphasising well matchable structural information in the images. Epipolar geometry The epipolar geometry is the geometric model behind the reconstruction. It models the relative position and orientation of the image planes. For each point in one image, it defines a line on the other image, where its corresponding point can possibly be. This can be used to constrain the search in dense matching. The epipolar geometry can be estimated from a sufficient number of sparse, initially matched points. For wide baseline images special methods are needed to automatically build the epipolar geometry6 10 11 . Although this is a matching problem too, its different aim and limitations make these methods inappropriate for dense matching. Rectified images In our method we deal with rectified image pairs. In rectification, 2D projective transformations are applied on the images. Given the epipolar geometry, the image planes are transformed so that the corresponding epipolar lines

Figure 1: A wide baseline image pair (top row) and result of rectification (bottom row).

Seed points Region growing is a versatile image processing method used for image segmentation. A region growing algorithm requires seed pixels, whose properties are propagated iteratively over neighbouring pixels as long as a homogeneity condition is satisfied. This propagation eventually grows the seeds into regions, which can later be merged to segment the image into homogeneous areas14 . We need many widely scattered, reliably matchable points as seeds for our region growing method. These are usually either highly textured areas or object corners. In our implementation, we used the KLT corner detector9 to automatically select well-textured seed points. It is important to have at least one reliable seed on each continuous surface for the surface to be reconstructed. 3. Affine Matching Generally if we look at two distinct views of a scene, the same area around a point might look significantly different, due to arbitrary projective transformation. Should we match these areas under wide baseline conditions, the results would be rather inaccurate if at all possible, except perhaps for the best of the cases. To compare these views we either compare features that are invariant to certain transformations6 10 11 , or try to approximate a transformation that makes one view look similar to the other 12 , and match with a conventional

Megyesi, Chetverikov / Affine Dense Matching

similarity function like Sum of Absolute Differences (SAD), Sum of Squared Differences (SSD), or Normalised CrossCorrelation (NCC)13 . The latter category of methods is better suited for dense matching purposes, since it can be applied over most parts of the image. Finding the best fitting affine transformation is costly, so we need to apply certain constraints in order to make the matching feasible. First of all we use our first assumption, if a region is on a smooth surface, its view on an other image can be locally approximated by an affine transformation. We can find this affine transformation by distorting the matched window, calculating the similarity and selecting the best resulting affine transformation and position. However, this creates ambiguity. By matching affinely distorted windows we may find false positive matches. To avoid this we must limit the affine transformations and trust that we will find the best affine transformation for the true positive as well, and it will overrule the false ones. The other problem is that this procedure requires heavy computations. Fortunately, the prior knowledge of the epipolar geometry grants us further constraints. If we take a rectified image pair, any pixel in the first image has its match in the same row of the other image. This not only limits the location of the corresponding point, but if we consider an area, there is no stretch and skew in the y direction. (See figure 2.) This removes two parameters from the four parameters of a 2D affine transformation matrix. An example on selected seed-points is shown in figure 3. Finally, if we add our second assumption (smoothness), we may not need thorough search in a large search space since possible parameter values for both disparity and affine transformation are constrained.

Figure 2: Example on affine distortion. Samples from the original image pair (top row), samples from the rectified image pair (middle row) and the first rectified image transformed with affine transformation, to match the second one (bottom row).

Pseudo Code Region Growing Indoor and outdoor scenes can often be divided into surface regions of different depth and orientation. In wide baseline matching, different regions are best treated differently. However, quite similar conditions apply inside each region. In our case both disparity and affine transformation changes smoothly inside a smooth region. Considering these and the heavy computation cost of finding the right transformation, the implementation of a region growing algorithm seems to give a feasible solution. If we manage to find a good seedpoint on a smooth surface, we can propagate both the disparity and the best affine transformation over the entire region. We may define different homogeneity conditions for the growing. Since our aim was to precisely reconstruct smooth surfaces, we assign pixels to a region where the best affine transformation and disparity is changing smoothly during the propagation.

Here we outline our algorithm in pseudo code. Let us assume we have a rectified image pair with suitable seed points selected in the first image. S set of seed point coordinates Il left image Ir right image R Region parameter array of size S L Label image B Border list D Disparity map //Initialization for each s 

S, Rs : 

empty_region;

Megyesi, Chetverikov / Affine Dense Matching

Figure 3: Seed-points and the best matching affinely distorted windows

B:

;

 x  y



for each unde f ined;



  

Il , L x y



  

candidate,D x y

b : B, B : 

//Finding best disparity and affine parameters for the seedpoints while S



{



s : S, S :

l: 

11



s





12





} else {

 

Lb :

} }

//Region growing {

12

l

        L  b  : label_o f  l  ; B : B candidate_borders_o f  b  ; D  b  : d; Rl add a11 a12 d c ;





11

if (l) {







11



while B

 

labelled_borders_o f b

 a  a  d  c   

is_homogeneous   a  a  d  c  R  ;



11



arg maxc l



 

S s;

12



B b;

best_params b region_limitations l

 a  a  d  c best_params  s  limitations ; R  add   a  a  d  c   ; L  s  : label_o f  s  ; D  s  : d; B : B candidate_borders_o f  s  ; 



}



candidate

12



Megyesi, Chetverikov / Affine Dense Matching

Function best_params does the searching part by checking all possible disparities – the whole epipolar line for the seed-points and a few pixels for the others – and selecting the best possible affine transformation. In our case we search the affine parameters by stepping through a limited domain, and calculating SAD difference. Function is_homogeneous decides whether two regions are close enough when considering the affine parameters and disparity. After growing seed-points into regions, we can easily merge regions with similar properties, segmenting the image into continuously smooth regions. 4. Experimental results In our implementation, we worked on gray-scale images. After preprocessing the images with adaptive histogram equalisation, we calculated a fairly accurate epipolar geometry based on manually given points. After the rectification step, we selected feature points by the KLT feature detector9 . After calculating the disparity levels with our method, we interpreted them into 3D depth, to help visualisation. We have compared our results with a conventional dense matching algorithm using SSD and left right consistency check1 . Examining the results, one can see that our method performs better on flat and textured regions. Un-textured regions and regions without good seed-points are lost. 3D visualisation shows that the ratio of good and erroneous matches is significantly higher in our case. See figure 4 for examples. 5. Conclusions We have developed a region growing based stereo matching algorithm, that is capable of matching wide baseline images. Although the algorithm heavily depends on good seed-points, with their help it provides reliable matching on smooth surfaces of different depth and orientation. The running time of the algorithm depends on the number of seed-points and the size of the matchable regions. Indexing techniques can be used to implement a fast affine parameter search. Though in our method we used the SAD difference for simplicity, SSD or NCC can also be applied for better results. In our experiments we did not use other methods for reducing ambiguity, or for refining matches near 3D edges. Future work will include integration of these methods. Acknowledgements This work is supported by OTKA under grants T038355 and M28078. The authors thank Zsolt Jankó for providing programs for rectification and epipolar geometry. References 1.

E. Trucco, A. Verri. Introductory Techniques for 3-D Computer Vision. 150-175, Prentice Hall International (UK) Ltd, 1998.

Figure 4: Examples: Rectified images (upper rows), disparity maps with proposed method (lower left) and standard dense matching using SSD (lower right).

Megyesi, Chetverikov / Affine Dense Matching

2.

M. Sonka, V. Hlavac, R. Boyle. Image Processing, Analysis, and Machine Vision. Second Edition.Brooks/Cole Publishing Company, 1999.

3.

Z. Zhang, R. Deriche, O. Faugeras, Q. T. Luong. A Robust Technique for Matching Two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry. INRIA No2273 1994.

4.

R. Hartley, A Zisserman. Multiple View Geometry in computer vision. Cambridge University Press, 2000.

5.

A. Fusiello, V. Roberto, E. Trucco. Symmetric Stereo with Multiple Windowing. International Journal of Pattern Recognition and Artificial Intelligence, Vol.14, No 8. 2000.

6.

A. Baumberg. Reliable Feature Matching Across Widely Separated Views. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2000.

7.

M. Lhuillier. Efficient Dense Matching for Textured Scenes Using Region Growing. Proc. 9th British Machine Vision Conference, Southampton, 1998.

8.

D.Chetverikov, Z.Megyesi, Zs.Janko and J.Matas. Using Periodic Texture as a Tool for Wide-Baseline Stereo. Proc. 26th Workshop of the Austrian Association for Pattern Recognition, Graz, 2002, pp.37-44. ISBN 3-85403-160-0

9.

J. Shi, C. Tomasi. Good Features to Track. IEEE Conference on Computer Vision and Pattern Recognition (CVPR94) Seattle, 1994.

10. T. Tuytelaars, L. Van Gool: Wide Baseline Stereo Matching based on local, affinely invariant regions. British Machine Vision Conference, pp. 412-422, 2000. 11. T. Tuytelaars, L. Van Gool: Content-Based Image Retrieval Based on Local Affinely Invariant Regions. International Conference on Visual Information Systems, pp. 493-500, 1999. 12. A. W. Gruen. Adaptive Least Squares Correlation: A Powerful Image Matching Technique. South African Journal of Photogrammetry, Remote Sensing, and Cartography, 1985, Vol. 14, No. 3 13. D. Scharstein and R. Szeliski. A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. IJCV 47(1/2/3):7-42, 2002. 14. I. Pitas. Digital Image Processing Algorithms. Prentice Hall International (UK) Ltd, 1993.

Affine dense matching for wide baseline stereo - CiteSeerX

This process is based on building the epipolar geom- etry from a number of ... First we calculate the epipolar geometry to create rectified images, on which we .... heavy computation cost of finding the right transformation, the implementation of ...

613KB Sizes 3 Downloads 240 Views

Recommend Documents

Affine dense matching for wide baseline stereo - CiteSeerX
Categories and Subject Descriptors (according to ACM CCS): I.4.8 [Image Processing and Computer Vision]: Scene. Analysis. 1. Introduction. Reconstructing a ...

Affine dense matching for wide baseline stereo
example on selected seed-points is shown in figure 3. Finally, if we ... original image pair (top row), samples from the rectified image pair .... Second Edi- tion.

Locally Affine Sparse-to-Dense Matching for Motion ...
age caused by camera motion or the moving objects in the scene, and has been ..... i of Ki is found (S3) and the current wi is updated. (S4). Eq. 5 guarantees that ...

Fast wide baseline matching for visual navigation - Computer Vision ...
should support their identification from different, but nev- ... Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern ...

Dense point cloud acquisition via stereo matching ...
and laser scanner technologies and it is now clear that DStM offers more than an alternative. Lately ... calibrated the camera (body and optics). The knowledge of.

Dense point cloud acquisition via stereo matching applied to: the ...
N.N. and N.N. (Editors). Dense point cloud acquisition via stereo matching applied to: the Kilwa archaeological site and the Gallo-Roman theatre of Mandeure.

A Fast Line Segment Based Dense Stereo Algorithm ...
problem stems from the pixel-based tree construction. Many edges in ... Stereo correspondence has been one of the most important problems in computer vision ...

a foveal architecture for stereo matching - Laboratorio de Señales
[1] Martin D. Levine, Vision in man and machine. Addison,. 1984. [2] Santiago Ramón y Cajal, Recuerdos de mi vida: Historia de mi labor científica. Alianza, Madrid, 1981. [3] R. E.Cummings, J. Van der Spiegel, P. Mueller, and M. Z.. Zhang, "A fovea

A Fast Line Segment Based Dense Stereo Algorithm ...
Intitute of HCI and Media Integration, Key Lab of Pervasive Computing(MOE). 3-524, Fit building, Tsinghua University, Beijing 100084, P.R. China ... survey by Scharstern and Szeliski [1] and the one by Brown et al. [2]. ..... Two registers stores.

a foveal architecture for stereo matching
[3] R. E.Cummings, J. Van der Spiegel, P. Mueller, and M. Z.. Zhang, "A foveated silicon retina for two-dimensional tracking" IEEE trans. on CAS-II, vol. 47, no. 6, pp. 504-517,. 2000. [4] S.S. Young, P.D. Scott and C Bandera, “Foveal automatic tar

a foveal architecture for stereo matching - Laboratorio de Señales
INTRODUCTION. The idea of a fovea as a spot of higher resolution in computer images is inspired in biological vision [1], [2]. For vertebrates, the distribution of photoreceptors in the retina is not uniform but concentrated on a small area centralli

A Fast Line Segment Based Dense Stereo Algorithm Using Tree ...
correspondence algorithm using tree dynamic programming (LSTDP) is ..... Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame.

Using Fuzzy Logic to Enhance Stereo Matching in ...
Jan 29, 2010 - Stereo matching is generally defined as the problem of discovering points or regions ..... Scheme of the software architecture. ..... In Proceedings of the 1995 IEEE International Conference on Robotics and Automation,Nagoya,.

Dynamic Surface Matching by Geodesic Mapping for 3D ... - CiteSeerX
point clouds from scanner data are registered using a ran- domized feature matching ..... tion Technology for Convivial Society”. References. [1] N. Ahmed, C.

Enforcing System-Wide Control Flow Integrity for Exploit ... - CiteSeerX
Adobe Reader. 8.1.1 .... Adobe flash, IE, Google Chrome and so on), no more than. 1000 files were ..... machine introspection via online kernel data redirection.

Enforcing System-Wide Control Flow Integrity for Exploit ... - CiteSeerX
of whitelist based and shadow call stack based approaches to monitor ... provide a proof-of-concept implementation of Total-CFI on. DECAF ... trospection, Vulnerability Detection, Software Security .... volving multiple processes/services. ... Knowin

On Global Controllability of Affine Nonlinear Systems with ... - CiteSeerX
We may call the function g1(x)f2(x) − g2(x)f1(x) as the criterion function for global ..... Here, we make a curve γ2 connecting (n − 1)-time smoothly γ1 and the ..... Control Conference, South China University of Technology Press, 2005, pp.

On Global Controllability of Affine Nonlinear Systems with ... - CiteSeerX
We may call the function g1(x)f2(x) − g2(x)f1(x) as the criterion function for global controllability of the system (2.2) ..... one side of the straight-line which passes through the point x0 with direction g(x0. ); ..... bridge: MIT Press, 1973. 1

Consultancy Service for the Stakeholder Satisfaction Baseline ...
2.The schedule for Submission and Opening of Technical Bids shall be moved from August 12,2015 to Auqust 19,. Very truly yours,. Bids and Awards Committee. Signature over printed name. Page 1 of 1. Bid Bulletin 2 - Consultancy Service for the Stakeho

Optimal Subpixel Matching of Contour Chains and ... - CiteSeerX
contours and does not call for any other constraint, so that it is particularly suitable ... mented in the same way (di erent number of segments with their endpoints at di ..... In Conference on Computer Vision and Pattern. Recognition, pages 202 ...

Distinct, genome-wide, gene-specific selectivity patterns of ... - CiteSeerX
Nov 4, 2014 - Yu EJ, Kim S-H, Kim MJ, Seo W-Y, Song K, Kang. M-S, Yang CK, Stallcup MR, Kim JH: SUMOylation of. ZFP282 potentiates its positive effect ...

Baseline Study - Pioneering, Connecting & Empowering Voices for ...
Baseline Study - Pioneering, Connecting & Empowering Voices for Change.pdf. Baseline Study - Pioneering, Connecting & Empowering Voices for Change.pdf.