A FOVEAL ARCHITECTURE FOR STEREO MATCHING Alfredo Restrepo Palacios and Javier Villegas Plazas [email protected], [email protected], Laboratorio de Señales, Dpt. Ing. Eléctica y Electrónica Universidad de los Andes, Bogotá, Colombia

Abstract. We propose a method for the matching of characteristics of stereo images based on the concepts of an initial global match and the foveation of the images. We present several macular tilings that can be used in the foveation process. Given a pair of cameras that image a static scene, vergence is achieved at a minimun of the mse difference between the two corresponding macular images. 1. INTRODUCTION The idea of a fovea as a spot of higher resolution in computer images is inspired in biological vision [1], [2]. For vertebrates, the distribution of photoreceptors in the retina is not uniform but concentrated on a small area centrallis, also called macula because of its yellowish appearance in mammals. The macula contains an even smaller area on a depresion called the fovea. In the macula, the concentration of photoreceptors for humans is about 150.000 per mm2 while for birds of pray, it can be as high as 1,000,000 per mm2 . This macular or foveal distribution of photoreceptors may have a bearing in the stereo perception of depth in vertebrates. In computer vision, a macular image can be obtained in several ways: an optical possibility uses a lens while, at acquisition level, the CCD device could be foveated [3], etc. We foveate the images after they have been acquired. The problem of correspondence is central to the field of computer vision, as it appears in the cases of motion estimation, target tracking and stereo vision. It is a hard problem, with multiple solutions. We implement a solution to the problem of stereo matching using macular (i.e. foveated, see Fig. 3) images and the concept of vergence for a global match. The macularization of the images speeds up both an initial global match, and the matchings of the intensity profiles by diminishing the number of pixels involved.

Figure 1. Image showing the right eye (camera) moving until vergence

When both eyes are looking at the same thing it is natural to think that the corresponding images are very similar. We define a function of dissimilarity between images, with variable the angle of the triangle at the right eye. The criterion of similarity is given by the nearness to zero of the sum of the pixelwise squared difference of the images: D(θl , θr) :=

2

Keeping the angle θl fixed, as θr varies, different degrees of dissimilarity are obtained. For most common scenes, when both eyes are focused on the same point of a scene the dissimilarity is minimal, as shown in Fig. 2 where a normalized dissimilarity is plotted. More important, because of the reduction in the number of pixels involved, the angle for a minimum of dissimilarity of the images is the same whether the images are macular or not. In turn, a minimum of dissimilarity corresponds to a maximum of the correlation of the images since xy = 0.5(x2 + y 2 - (x-y) 2 ). and the intensity values are nonnegative (disregarding border effects).

2. INITIAL GLOBAL MATCH Consider the angles at the base of the triangle formed by the pupils and the object of fixation (Fig. 1). More specifically, assume that the left eye is fixed on an object in the scene and that the right eye moves until vergence.

0-7803-7622-6/02/$17.00 ©2002 IEEE

∑ ( I θl − I θt )

II - 521

IEEE ICIP 2002

Figura 3.b. Radially blurred image . Figure 2. Dissimilarity graph .

Depending among other things on the relative numbers of pixels in the fovea and in the periphery, global similarity may correspond to either macular correspondence or peripheral correspondence. 3. MACULAR IMAGES If, as we move away from the center of an image (Figs. 3.a and 3.c), regions of increasing size are replaced by their average, we get a radially blurred image as in Fig. 3.b or Fig 3.d. If each of the regions averaged is then replaced by a single pixel, as in Fig. 3.e we get a macular (or foveated) image. The shape, size and position of the regions that are averaged in order to get a macular version of an image, determine a macular tiling in image space . See Fig. 4.

Fig. 3.c. Log original image

Figura 3.d. Log Radially blurred image .

Fig. 3.a. Original image

Figura 3.e. macular and log macular images

II - 522

A common choice [4], [5] for the shape of the tiles, is a square, with a side that doubles each time they increase in size, see Fig. 4.a. At the central region, where the tiles have a minimal size, the resolution is like that of the original image. This region is called the macula, or fovea, the remaining pixels are said to form the periphery. For a generating tile consisting of a "cross" of 12 pixels, the macular tiling of Fig. 4b results. Given the more circular nature of these tiles, the macularization process is considered to be more natural. Also, the way they tesselate the plane is topologically the same as the way hexagons do. On a tiling based on squares, it is possible to vary the rate of loss of resolution and to keep horizontal uniformity, see Fig. 5c. Also, if one is interested in decreasing the size of the tiles with the square of the radial distance, a logarithmic tiling may be used. See Fig. 6

-cFig. 5. Rectangular macular tilings with horizontal compatibility.

-a-

-b-

Fig. 4. Two posible macular tilings and their generating tiles.

Several macular versions of an image may be obtained by slightly translating or rotating the macular tiling. This allows for an overlapped multiresolution analysis of the image which retains some of the continuity lost when down sampling the image. 4. STEREO MATCHING Using a macular tiling such as that in Fig. 5.c, both, the left and the right images are foveated. The left camera images the scene of interest and is kept fixed while the right camera is rotated until vergence (as indicated in Section II) is reached. On the resulting pair of stereo images, the macular tiling determines a collection of pairs of horizontal lines, with corresponding intensity profiles, on which the process of matching pixels is made. Each pair of intensity profiles is used as input to the matching algorithm, which is of the Viterbi type [6]. Initially, a robust detection of local maxima and local minima is made on the profiles. A pixel is a local maximum (resp. minimum) if it minus (resp. plus) a threshold, is larger (resp. smaller) than the remaining pixels in the window. The cost of matching two pixels is

II - 523

Fig 6. Logarithmic tiling map

set low if both are minima or if both are maxima; otherwise, the cost is proportional to the difference in gray levels. The output of this matcher is given as an input to a depth reconstruction algorithm for vergent cameras. After all rows are considered, a 3D signal is obtained (Fig. 8). It is useful to know how different two corresponding profiles can be. Suppose there are two points in 3D space that when projected onto the retinae produce two image points on one retina and one image point on the other: this is caused by occlusion with an opaque object. If the object is transparent a reversal in the ordering of three image points results (Fig. 7). In order to have a simpler matching algorithm it is assumed here that no transparent objects are present in the scene and that no occlusion occurs .

1

3 3

a

2

2

1

β

b

a

γ

β

c

α

c

α

b

-a-

γ

-b-

Fig. 7. In case b there is inversion of the ordering of the image points, note that for this to occurr, either the points belong to different objects, as in a cloud of flies, or the points belong to a single solid but transparent object.

Fig. 8.b. 3D reconstruction corresponding to previous image

Fig. 8.a. Original image

REFERENCES [1] Martin D. Levine, Vision in man and machine. Addison, 1984. [2] Santiago Ramón y Cajal, Recuerdos de mi vida: Historia de mi labor científica. Alianza, Madrid, 1981. [3] R. E.Cummings, J. Van der Spiegel, P. Mueller, and M. Z. Zhang, "A foveated silicon retina for two-dimensional tracking" IEEE trans. on CAS-II, vol. 47, no. 6, pp. 504-517, 2000 [4] S.S. Young, P.D. Scott and C Bandera, “Foveal automatic target recognition using a multiresolution neural network,” IEEE trans on IP . vol 7, no. 8, pp. 1122-1135, 1998. [5] Marc Bolduc and Martin D. Levine, “A review of biologically-motivated space-variant data reduction models for robotic vision,” TR-CIM-95-05. Centre for Intelligent Machines. McGill University, 1996. [6] H. L. Lov, "Implementing the Viterbi algorithm" IEEE Signal processing magazine vol 12 no 5, pp 42-52 1995.

II - 524

a foveal architecture for stereo matching

[3] R. E.Cummings, J. Van der Spiegel, P. Mueller, and M. Z.. Zhang, "A foveated silicon retina for two-dimensional tracking" IEEE trans. on CAS-II, vol. 47, no. 6, pp. 504-517,. 2000. [4] S.S. Young, P.D. Scott and C Bandera, “Foveal automatic target recognition using a multiresolution neural network,”. IEEE trans on IP . vol 7, ...

811KB Sizes 0 Downloads 198 Views

Recommend Documents

a foveal architecture for stereo matching - Laboratorio de Señales
[1] Martin D. Levine, Vision in man and machine. Addison,. 1984. [2] Santiago Ramón y Cajal, Recuerdos de mi vida: Historia de mi labor científica. Alianza, Madrid, 1981. [3] R. E.Cummings, J. Van der Spiegel, P. Mueller, and M. Z.. Zhang, "A fovea

a foveal architecture for stereo matching - Laboratorio de Señales
INTRODUCTION. The idea of a fovea as a spot of higher resolution in computer images is inspired in biological vision [1], [2]. For vertebrates, the distribution of photoreceptors in the retina is not uniform but concentrated on a small area centralli

Affine dense matching for wide baseline stereo - CiteSeerX
This process is based on building the epipolar geom- etry from a number of ... First we calculate the epipolar geometry to create rectified images, on which we .... heavy computation cost of finding the right transformation, the implementation of ...

Affine dense matching for wide baseline stereo - CiteSeerX
Categories and Subject Descriptors (according to ACM CCS): I.4.8 [Image Processing and Computer Vision]: Scene. Analysis. 1. Introduction. Reconstructing a ...

Affine dense matching for wide baseline stereo
example on selected seed-points is shown in figure 3. Finally, if we ... original image pair (top row), samples from the rectified image pair .... Second Edi- tion.

A Program Behavior Matching Architecture for ...
program such as e2fsck to restore consistency. Journaling has three well-known modes of operation: •. Journal Mode: Both file system data and metadata are.

Using Fuzzy Logic to Enhance Stereo Matching in ...
Jan 29, 2010 - Stereo matching is generally defined as the problem of discovering points or regions ..... Scheme of the software architecture. ..... In Proceedings of the 1995 IEEE International Conference on Robotics and Automation,Nagoya,.

Dense point cloud acquisition via stereo matching applied to: the ...
N.N. and N.N. (Editors). Dense point cloud acquisition via stereo matching applied to: the Kilwa archaeological site and the Gallo-Roman theatre of Mandeure.

Dense point cloud acquisition via stereo matching ...
and laser scanner technologies and it is now clear that DStM offers more than an alternative. Lately ... calibrated the camera (body and optics). The knowledge of.

A Stereo Audio Σ∆ ADC Architecture with Embedded ...
INTERNATIONAL TEST CONFERENCE. 1 ... A Stereo Audio Σ∆ ADC Architecture with Embedded SNDR Self-Test ... out in order to find BIST solutions for ADCs.

Foveal Algorithm for the Detection of Microcalcification ... - Springer Link
Ewert House, Ewert Place, Summertown, Oxford OX2 7BZ, UK ... present in about a quarter of the total number of screening mammograms. For these reasons ... microcalcifications are expected to appear bright in a mammogram, indeed to be.

A faster algorithm for finding optimal semi-matching
Sep 29, 2007 - CancelAll(N2). Figure 2: Divide-and-conquer algorithm. To find the min-cost flow in N, the algorithm use a subroutine called CancelAll to cancel.

Computational Stereo
Another advantage is that stereo is a passive ... Computing Surveys, VoL 14, No. 4, December ...... conditions, cloud cover present in one im- age and not in the ...

Computational Stereo
For correspondence: S. T. Barnard, Artificial Intelligence Center, SRI ... notice is given that copying is by permission of the Association for Computing Machinery. To ... 3.2 Control Data CorporatJon ...... conditions, cloud cover present in one im-

A Semantic-Based Ontology Matching Process for PDMS
and Ana Carolina Salgado1. 1 Federal University of ..... In: International Conference on Management of Data (SIGMOD), Software. Demonstration (2005). 7.

A Tool for Matching Ontology-based Schemas
matching techniques have been used to determine schema mappings between .... the semantic correspondence's weight, database credentials, and linguistic- ...

A Declarative Framework for Matching Iterative and ...
effective pattern matching in modern applications. A language for de- ..... “tick-shape” pattern is monitored for each company symbol over online stock events, see rules (1). ..... graphcq: Continuous dataflow processing for an uncertain world.

Satellite matching for a federated ground station network
May 29, 2008 - there is an effort to develop a federated ground station network which will ..... approach to monitor and control for deep space communications.

A New Point Pattern Matching Method for Palmprint
Email: [email protected]; [email protected]. Abstract—Point ..... new template minutiae set), we traverse all of the candidates pair 〈u, v〉 ∈ C × D.

Uncoupled Antenna Matching for Performance ...
Department of Electrical and Information Technology. Lund University ... Such systems perform best ... First, we increase the degrees of freedom for uncoupled.

Clustering and Matching Headlines for Automatic ... - DAESO
Ap- plications of text-to-text generation include sum- marization (Knight and Marcu, 2002), question- answering (Lin and Pantel, 2001), and machine translation.

Uncoupled Antenna Matching for Performance ...
Optimization in Compact MIMO Systems using. Unbalanced ... For example, a large capacity gain ... for multiple antenna systems since strong coupling results in.

Fast Global Labeling for Real-Time Stereo Using ...
Aug 26, 2008 - an appropriate directed graph and determining the minimum cut, a whole class of ... message passing [10, 22, 9]. ... metric, and a different solution is required. ..... Performing local stereo with cost windows aligned with an ...

Underwater Stereo Vision and vSLAM Support for the ...
Digital Compass- OS5000-USD. • Cameras- Allied Vision Prosilica GC750 (x3) two front-facing, one down-facing. • Batteries- Thunder Power RC G4 Pro Power 45C. • Hydrophones- Aquarian H1a. • Depth Sensor- SSI part P51-15-S-B-I36-4.5V-R. Stingra