ENVIRONMETRICS Environmetrics 2008; 19: 347–368 Published online 7 August 2007 in Wiley InterScience (www.interscience.wiley.com) DOI: 10.1002/env.881

Improved likelihood inference for the roughness parameter of the GA0 distribution Michel Ferreira da Silva1 , Francisco Cribari-Neto2 and Alejandro C. Frery3∗,† 1 Departamento

de Estat´ıstica, ICEX, Universidade Federal de Minas Gerais, Brazil de Estat´ıstica, CCEN, Universidade Federal de Pernambuco, Brazil 3 Instituto de Computa¸ ca˜ o, Universidade Federal de Alagoas, BR 104 Norte km 97, 57072-970 Macei´o, AL, Brazil 2 Departamento

SUMMARY This paper presents adjusted profile likelihoods for α, the roughness parameter of the GA0 (α, γ, L) distribution. This distribution has been widely used in the modeling, processing and analysis of data corrupted by speckle noise, e.g., synthetic aperture radar images. Specifically, we consider the following modified profile likelihoods: (i) the one proposed by Cox and Reid, and (ii) approximations to adjusted profile likelihood proposed by Barndorff–Nielsen, namely the approximations proposed by Severini and one based on results by Fraser, Reid and Wu. We focus on point estimation and on signalized likelihood ratio tests, the parameter of interest being the roughness parameter that indexes the distribution. As far as point estimation is concerned, the numerical evidence presented in the paper favors the Cox and Reid adjustment, and in what concerns signalized likelihood ratio tests, the results favor the approximation to Barndorff–Nielsen’s adjustment based on the results by Fraser, Reid and Wu. An application to real synthetic aperture radar imagery is presented and discussed. Copyright © 2007 John Wiley & Sons, Ltd. key words: adjusted profile likelihood; image understanding; likelihood ratio test; profile likelihood; speckle noise; synthetic aperture radar

1. INTRODUCTION Imagery obtained with coherent illumination suffers from a noise known as speckle. This is the case of laser, sonar, ultrasound-B, and synthetic aperture radar (SAR) images. The noise does not follow the classical Gaussian additive structure, being multiplicative in nature. Classical techniques for image analysis are thus inefficient for extracting information from speckled data (see, for instance, Allende et al., 2001, 2006; Bustos et al., 2002; Medeiros et al., 2003). In particular, SAR sensors are becoming progressively more used in all areas that employ remotely sensed data, since they are active and therefore do not require external sources of illumination. They can image the environment in a wavelength that is little or not at all affected by weather conditions ∗ Corresponding to: A. C. Frery, Instituto de Computac ¸ a˜ o, Universidade Federal de Alagoas, BR 104 Norte km 97, 57072-970 Macei´o, AL, Brazil. † E-mail: [email protected]

Copyright © 2007 John Wiley & Sons, Ltd.

Received 27 April 2006 Accepted 6 July 2007

348

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

and provide complementary information to other sensors (optical, infrared etc.). The information obtained from SAR sensors is relevant for all remote sensing applications, including environmental studies, anthropic activities, oil spill monitoring, disaster assessment, reconnaissance, surveillance, and targeting, among others. Since speckle noise hampers the ability to identify objects, many techniques have been proposed to reduce such a noise, but a successful approach consists of devising techniques that can cope with it, mainly statistical procedures (see, for instance, Gambini et al., 2006). Only univariate signals will be discussed here; the reader interested in multivariate SAR statistical modeling is referred to Freitas et al. (2005). Goodman (1985) provided one of the first rigorous statistical frameworks, known as the ‘Multiplicative Model,’ for dealing with speckle noise in the context of laser imaging. The use of such a framework has led to successful techniques for SAR data processing and analysis. This phenomenological model states that the observation in every pixel is the outcome of a random variable Z :  → R+ which is, in turn, the product of two independent random variables: X :  → R+ , the ground truth or backscatter, related to the intrinsic dielectric properties of the target, and Y :  → R+ , the speckle noise, which follows a square root of gamma law. The distribution of the return, Z = XY , is completely specified by the distributions of X and Y. The univariate multiplicative model began as a single distribution, namely the Rayleigh law, was extended by Yueh et al. (1989) to accommodate the K law and was later further improved by Frery et al. (1997) to the G distribution, which generalizes the previous probability distributions. The GA0 law is an important particular case of the more general G distribution. It can successfully model a wide range of targets through their roughness. If Z is a GA0 (α, γ, L)-distributed random variable, then its probability density function is p(z; α, γ, L) = p(z) =

2LL (L − α)z2L−1 γ α (L)(−α)(γ + Lz2 )L−α

with −α, γ, z ≥ 0 and L ≥ 1. Its cumulative distribution function is given by F (z) = ϒ2L,−2α (−αz2 /γ), where ϒ2L,−2α (·) is the cummulative distribution function of a F2L,−2α -distributed random variable. The parameter α is directly related to the roughness of the target. For typical sensors and scenes, if α ≤ −10 then the area is homogeneous (usually crops or pastures), if −10 < α ≤ −5 then the region is heterogeneous (usually forests or undulated relief), and −5 < α < 0 is associated with extremely heterogeneous targets (usually urban areas). The scale parameter γ can be viewed as a nuisance parameter, and L, the number of looks, is directly related to the signal-to-noise ratio (the smaller L, the noisier the image). The latter can be controlled to some extent either in the early stages of the raw data processing or through filters, but at the expense of losing spatial resolution. Airborne SAR systems can achieve resolutions of the order of centimeters, which partially explains their large impact in contemporary remote sensing. Regarded as a parameter, L can be estimated using homogeneous targets, the estimate being valid for the entire image. It will be assumed known in our study. Relevant information can be extracted by estimating α and γ as, for instance, thematic maps (see Mejail et al., 2003) and maximum a posteriori filters (Moschetti et al., 2006). Recent research has focused on improved estimation through data resampling (Cribari-Neto et al., 2002) and via secondorder bias correction (Vasconcellos et al., 2005). Robust estimators have also been proposed for the parameter estimation of speckled data (Allende et al., 2006; Bustos et al., 2002).

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

349

1.5

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

1.0 0.0

0.5

p(z)

(−1, 0.405, 1) (−8, 7.882, 3) (−15, 14.704, 8)

0

1

2

3

4

z

Figure 1.

Probability density functions of the GA0 (−1, 0.405, 1) (solid), GA0 (−8, 7.882, 3) (dashed), and GA0 (−15, 14.704, 8) (dotted) distributions

The GA0 law does not belong to the exponential family, and maximum likelihood estimators are not minimal sufficient statistics for (α, γ). The rth order moment of a random variable obeying the GA0 (α, γ, L) law is E{zr } =

 γ r/2 (−α − r/2)(L + r/2) L (−α)(L)

if −r/2 > α, and ∞ otherwise. Figure 1 shows three GA0 (α, γ, L) densities: (α, γ, L) = (−1, 0.405, 1), (−8, 7.882, 3), and (−15, 14.704, 8). It illustrates, for different numbers of looks, extremely heterogeneous, heterogeneous, and homogeneous targets, respectively. The parameters were chosen so that the distribution means are equal to 1. A complete account of this distribution and its properties can be found in Frery et al. (1997) and Mejail et al. (2003). Mejail et al. (2001) provide details about its relationship to other distributions. This paper presents two new results regarding inference under the GA0 model, namely, we obtain analytically improved parameter estimators and develop improved one-sided likelihood ratio inference. Improved parameter estimation is achieved by maximizing an adjusted profiled likelihood function (Cox and Reid, 1987, 1989; Fraser and Reid, 1995; Fraser et al., 1999). We also develop one-sided improved likelihood ratio inference for the GA0 roughness parameter. We follow Sartori et al. (1999)

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

350

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

and consider tests based on the Barndorff-Nielsen (1980, 1983) adjusted profile likelihood function. The chief goal of such inference lies in identifying whether a given scanned region is extremely heterogeneous, heterogeneous, or homogeneous. This kind of analysis that turns data into valuable information for decision making is one of the ultimate goals of environmental studies. The remainder of the paper unfolds as follows. Section 2 presents the profile likelihood function and its properties, and it discusses the Barndorff-Nielsen (1980, 1983) adjustment and some alternative adjusted profile likelihoods, such as the Cox and Reid (1987) adjustment. In Section 3 we derive such adjustments for inference on the roughness parameter α. Monte Carlo results are presented in Section 4, and numerical examples with real data sets are presented in Section 5. Finally, Section 6 concludes the paper.

2. PROFILE AND MODIFIED PROFILE LIKELIHOOD Let Y = (y1 , . . . , yn ) be an n-vector of independent random variables, each following a distribution that is indexed by 2 (possibly vector-valued) parameters: ν and µ. Suppose that the interest lies in performing inference on µ in the presence of the nuisance parameter ν. It is sometimes possible to perform inference on µ using a marginal or a conditional likelihood function. Nevertheless, oftentimes such functions cannot be obtained. The standard approach is to use the profile likelihood function, which is defined as Lp (µ) = L(ˆνµ , µ), where L(·) is the usual likelihood function and νˆ µ is the maximum likelihood estimate of ν for a given, fixed µ. The usual likelihood ratio statistic,     LR(µ) = 2 (ˆν, µ) ˆ − (ˆνµ , µ) = 2 p (µ) ˆ − p (µ) is based on the profile likelihood function. Here, µ ˆ and νˆ are the maximum likelihood estimates of µ and ν, respectively, (·) is the log-likelihood function, and p (·) is the profile log-likelihood function. It is important to note, however, that Lp (·) is not a genuine likelihood and that the profile score and information biases are only guaranteed to be O(1). Several different adjustments to the profile likelihood function were proposed; see, e.g., Severini (2000, Chapter 9). Barndorff-Nielsen (1983) modified profile likelihood is obtained as an approximation to a marginal or to a conditional likelihood for µ, if either exists. In both cases, one uses the p∗ formula (Barndorff-Nielsen, 1980) to approximate the probability density function of the maximum likelihood estimator conditional on an ancillary statistic. The corresponding modified profile likelihood is    ∂ˆνµ −1    jνν (ˆνµ , µ)−1/2 Lp (µ) LBN (µ) =   ∂ˆν where jνν (ν, µ) = −∂2 /∂ν∂ν is the observed information for ν. The score and information biases are of order O(n−1 ), and LBN (µ) is invariant under reparameterizations of the form (ν, µ) → (λ, ξ), where λ = λ(ν, µ) and ξ = ξ(µ). The main difficulty in computing LBN (µ) lies in obtaining |∂ˆνµ /∂ˆν|. There is an alternative expression for LBN (µ) that does not involve this term, but it involves a sample space derivative of the log-likelihood function and the specification of an ancillary a such that (ˆν, µ, ˆ a) is a minimal sufficient statistic. It can

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

351

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

be shown that ∂ˆνµ = jνν (ˆνµ , µ; νˆ , µ, ˆ a)−1 ν;ˆν (ˆνµ , µ; νˆ , µ, ˆ a) ∂ˆν where ˆ a) = ν;ˆν (ˆνµ , µ; νˆ , µ,

∂ ∂ˆν



∂(ˆνµ , µ; νˆ , µ, ˆ a) ∂ν



Here, (ˆνµ , µ; νˆ , µ, ˆ a) and jνν (ˆνµ , µ; νˆ , µ, ˆ a) are the log-likelihood function and the observed information for ν, respectively; they depend on the data only through the minimal sufficient statistic. An approximation to ν;ˆν (ˆνµ , µ; νˆ , µ, ˆ a) can be obtained based on the population covariance between ν (ν, µ) and ν (ν0 , µ0 ). Severini (1998) proposed the following approximation to the modified profile log-likelihood function: BN (µ) = p (µ) +

    1 log jνν (ˆνµ , µ) − logIν (ˆνµ , µ; νˆ , µ) ˆ  2

where Iν (ν, µ; ν0 , µ0 ) = E(ν0 ,µ0 ) {ν (ν, µ)ν (ν0 , µ0 ) }, with ν (ν, µ) = ∂/∂ν. Note that ˆ does not depend on the ancillary statistic a and that Iν (ν, µ; ν0 , µ0 ) is the covariance Iν (ˆνµ , µ; νˆ , µ) between ν (ν, µ) and ν (ν0 , µ0 ). An alternative approximation to Barndorff-Nielsen (1983) modified profile likelihood function, say ˘ BN , was proposed by Severini (1999); it was obtained replacing I(ν, µ; ν0 , µ0 ) by ˘ µ; ν0 , µ0 ) = I(ν,

n 

ν (ν, µ)ν (ν0 , µ0 ) (j)

(j)

j=1 (j)

(j)

(j)

where θ (θ) = (ν (θ), µ (θ)) is the score function for the jth observation. This approximation is particularly useful when the computation of expected values of products of log-likelihood derivatives is cumbersome. A third approximation to ν;ˆν (ˆνµ , µ; νˆ , µ, ˆ a) can be obtained through an approximately ancillary statistic (Fraser and Reid, 1995; Fraser et al., 1999; Severini, 2000). The resulting log-likelihood function, ˜ BN , can be written as 1 ˜ BN (µ) = p (µ) + log |jνν (ˆνµ , µ)| − log |ν;Y (ˆνµ , µ)Vˆ ν | 2 where ν;Y (ν, µ) = ∂ν (ν, µ)/∂Y and  ν,µ)/∂ˆ ˆ ν 1 ;ˆ Vˆ ν = − ∂Fp1 (y ν,µ) ˆ 1 (y1 ;ˆ

ˆ ν · · · − ∂Fpn n(y(yn n;ˆν;ˆν,µ)/∂ˆ ,µ) ˆ



pj (y; ν, µ) being the probability density function of yj and Fj (y; ν, µ) being the cumulative distribution ˜ˆ BN . The construction of the matrix function of yj . The corresponding estimator shall be denoted as µ ˆ Vν is based on an approximately ancillary statistic (see Severini, 2000, p. 216). Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

352

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Suppose that the parameters are orthogonal, that is, that the elements of the score vector, ∂/∂µ and ∂/∂λ, are uncorrelated, where λ = λ(µ, ν). Cox and Reid (1987) proposed an adjustment that can be applied to the profile likelihood function in this setting. It is an approximation to a conditional probability density function of the observations given the maximum likelihood estimator of λ and can be written as LCR (µ) = |jλλ (λˆ µ , µ)|−1/2 Lp (µ). The modified profile log-likelihood function is CR (µ) = p (µ) − 21 log |jλλ (λˆ µ , µ)|. The maximizer of CR (µ) shall be denoted as µ ˆ CR . The score bias is O(n−1 ) but, in general, the information bias remains O(1). Cox and Reid (1987) assumed orthogonality between µ and λ. It is not always possible, however, to find an orthogonal parameterization. Additionally, their adjustment is not invariant under reparameterizations of the form (λ, µ) → (η, ξ), where η = η(λ, µ) and ξ = ξ(µ), unlike LBN (µ), for which the invariance property is guaranteed by the term |∂ˆνµ /∂ˆν|. Note that if νˆ µ = νˆ for all µ, then LBN (µ) = LCR (µ). In this case, µ and ν are orthogonal parameters: Cox and Reid (1987). Also, it is possible to show that BN (µ) − BN (µ) ˆ = CR (µ) − CR (µ) ˆ + Op (n−1 ). Cox and Reid (1989) suggested that one should obtain an orthogonal parameterization (µ, λ) under which the difference between λˆ µ and λˆ (the restricted and unrestricted maximum likelihood estimators of the nuisance parameter, respectively) is Op (n−3/2 ), instead of Op (n−1 ). When that holds, the modified profile likelihood function of Cox and Reid (1987) is equivalent to that of BarndorffNielsen (1983) to order Op (n−3/2 ) and, hence, the information bias of CR is O(n−1 ) (DiCiccio et al., 1996). It is not always possible to find such an orthogonal parameterization, and sometimes more than one parameterization is available. Cox and Reid (1989) have proposed a criterion for choosing a parameterization amongst several alternative orthogonal parameterizations. We have used it to obtain a version of CR whose maximum likelihood estimator of the roughness parameter proved to be more accurate than the usual maximum likelihood estimator. The choice is made for each given µ0 requiring that λ∗ = c i02 (µ0 , λ)/ i21 (µ0 , λ)dλ, where c is a constant suitably chosen and irs (µ, λ) = E{n−1 ∂r+s (µ, λ)/∂µr ∂λs }. We can also define λ∗ = h(λ) based on a non-orthogonal parameterization (µ, ν). Here, it is necessary to solve two equations, namely: ∂ν(µ, λ) = −i11 , ∂µ

  ∂ν(µ, λ) ∂ν(µ, λ) ∂2 ν(µ, λ) ∂ν(µ, λ) 2 i02 = c i21 + 2i12 − i02 + i03 ∂λ ∂µ ∂µ ∂µ2 i02

(1)

where, for simplicity, we have omitted the argument (µ, ν) from irs (µ, ν).

3. LIKELIHOODS FOR THE ROUGHNESS PARAMETER The log-likelihood function based on a random sample of size n, (z1 , . . . , zn ), from the GA0 (α, γ, L) distribution, apart from an unimportant constant, is (α, γ) = n log (L − α) − nα log γ − n log (−α) − (L − α)

n 

log(γ + Lz2i )

i=1

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

353

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

ˆ respectively, solve the following system of The maximum likelihood estimators of α and γ, αˆ and γ, non-linear simultaneous equations, where ψ(·) denotes the digamma function:

ˆ − ψ(L − α)] ˆ + n[ψ(−α)

n  i=1



γˆ + Lz2i log γˆ

=0

 nαˆ 1 ˆ =0 − (L − α) γˆ ˆ γ + Lz2i i=1 n



The maximum likelihood estimators do not have closed form and need to be obtained using a non-linear optimization method, such as Newton-Raphson, Fisher’s scoring, BHHH, or BFGS, the latter being a quasi-Newton method. The profile log-likelihood function is

p (α) = n log (L − α) − nα log γˆ α − n log (−α) − (L − α)

n 

log(γˆ α + Lz2i )

i=1

where γˆ α is the root of the equation  ∂(α, γˆ α ) nα 1 =− − (L − α) =0 ∂γ γˆ α γˆ α + Lz2i n

i=1

Following the definitions and notation of previous sections, we shall now obtain, in closed-form, the three approximations to BN (α) as a function of the pair (α, γˆ α ). Such functions do not require the use of an ancilar statistic nor an orthogonal parameterization. They are given by

    1 αˆ γˆ  γˆ  ˆ ˆ BN (α) = p (α) + log jγγ (α, γˆ α ) − log − H 1, L; α; H 1, L; 1 + α; 2 γˆ γˆ α γˆ α



α−L αˆ

ˆ ˆ (L − α) γˆ γˆ α L−α ˆ 1− π csc(απ) − log ˆ (L)(−α) γˆ α γˆ γˆ α 

 n ˆ (L − α)(L − α) 1 nααˆ  + ˘ BN (α) = p (α) + log jγγ (α, γˆ α ) − log − 2 γˆ γˆ α 2 (γˆ α + Lzi )(γˆ + Lz2i ) αˆ L − αˆ + − γˆ γˆ α − γˆ

i=1

and 

2  n  1 z i  ˜ BN (α) = p (α) + log jγγ (α, γˆ α ) − log(L − α) − log  2 γˆ α + Lz2i i=1

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

354

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

where  nα 1 − (L − α) 2 γˆ α (γˆ α + Lz2i )2 n

jγγ (α, γˆ α ) = −

i=1



H(a, b; c; t) =

(c)  (a + k)(b + k)t k (a)(b) (c + k)k! k=0

is the hypergeometric function and γˆ α satisfies ∂(α, γˆ α )/∂γ = 0. Due to numerical problems, in what follows we shall only use one of these approximations, namely, ˜ BN (α). We shall now consider the adjustment proposed by Cox and Reid (1987). To that end, an orthogonal parameterization was obtained following Cox and Reid (1989). The parameters α (interest) and γ (nuisance) that index the GA0 distribution are not orthogonal. By solving Equation (1), we obtain  γ = γ(α, λ) = −α

α α−L



1/L exp

c (L − α)2 α2



  2(α − 1) − (1 + 3L) λ L−α+2

As noted before, c is any constant conveniently chosen independently of λ. Thus, a family of parameterizations can be obtained by taking  c = α (L − α) 2

K

−1

2(α − 1) − (1 + 3L) L−α+2

where K ≥ 2 is a constant that should be determined empirically. As a consequence,   nα α (α, λ; K) = n log (L − α) − nα log(−α) − n log (−α) − log − nαλ(L − α)K−2 L α−L

 1/L n    α K−2 2 − (L − α) + Lzi log −α exp λ(L − α) α−L i=1

For a fixed value of the parameter of interest (α), the restricted maximum likelihood estimator of the nuisance parameter (λˆ α ) satisfies ∂(α, λˆ α ; K)/∂λ = 0. It does not have closed-form. The observed information relative to the parameter λ is 

1/L   α (L − α)2K−3 exp λ(L − α)K−2 α−L

−2  1/L n    α 2 K−2 2 × zi −α exp λ(L − α) + Lzi α−L

jλλ (α, λ; K) = −αL

i=1

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

355

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Therefore, CR (α; K) = (α, λˆ α ; K) −

1 log jλλ (α, λˆ α ; K) 2

where λˆ α satisfies ∂(α, λˆ α ; K)/∂λ = 0.

4. MONTE CARLO RESULTS Images are richly structured data revealing the underlying classes, that turn into more or less discernible groups of values; these values can be displayed as shades of gray or as colors. A digital image is a function f : S → Kp , where S ⊂ Z2 is the (finite) support of the data, p ∈ N is the number of bands, and K ⊂ R is the set of possible values. Neighborhoods are usually squares of odd side, called ‘windows’, centered on the pixel being processed. The size of the neighborhood plays an important role in image processing; as a general rule, the bigger the window the more precise the estimation but, at the same time, the technique will be more prone to undesirable effects caused by contamination. In this context, contamination is the use of information from more than one class. Smaller windows are thus preferred in order to reduce contamination. The smallest non-trivial odd window is of size 3 × 3, but odd windows up to side 11 are frequently used. This defines the sample sizes that will be used in the following Monte Carlo experiments, namely, 25, 49, 81, and 121. The following values were used for (−α; L): (1; 1), (1; 3), (5; 3), (5; 8), (8; 3), (8; 8), (10; 3), (10; 8), (15; 3), (15; 8). We have then considered different degrees of target homogeneity (ranging from homogeneous to extremely heterogeneous targets), and also typical numbers of looks (1, 3, and 8). The value of the nuisance parameter was chosen as 

(−α)(L) γ=L (−α − 1/2) (L + 1/2)

2

so that the resulting GA0 -distributed random variable has unit mean. In what follows we shall present numerical results related to the point estimation of the roughness parameter α. All Monte Carlo results are based on 10 000 replications. Maximum likelihood estimators obtained from , ˜ BN and CR are considered. The numerical maximizations of  and ˜ BN were performed using the alternated algorithm proposed by Frery et al. (2004). The tables contain the following measures: true value; n; L; mean value; estimated bias; variance; mean squared error (MSE); relative bias (r.b. = 100 × (bias/parameter value)%); asymmetry; kurtosis. We have also performed one-sided signalized likelihood ratio tests on the roughness parameter using the test statistics   √  BN and signal(αˆ CR − α) LRCR ˆ signal(αˆ − α) LR, signal(α˜ BN − α) LR  BN , and LRCR are the likelihood ratio test statistics based on the profile likelihood and where LR, LR on the adjusted profile likelihoods ˜ BN and CR , respectively. We performed two tests, namely:

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

356

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

1. homogeneous and heterogeneous regions × extremely heterogeneous region: H0 : α ≤ −5

versus

H1 : −5 < α < 0

2. homogeneous region × heterogeneous, and extremely heterogeneous regions: H0 : α ≤ −10

versus

H1 : −10 < α < 0

The main goal here is to compare the finite-sample behavior of the different tests. The asymptotic null distribution of all test statistics is standard normal. The numerical results regarding the tests comprise: null rejection rates at the 10% and 5% nominal levels, mean and variance of the test statistics (and their respective asymptotic values), and nonnull rejection rates at the 5% nominal level. We also present quantile discrepancy plots where the differences between exact and asymptotic quantiles of the test statistics are plotted against the asymptotic quantiles, i.e., against standard normal quantiles. For instance, for the test based √ on  (profile likelihood), we denote the qth sample quantile of the corresponding test statistics as ± LR(q) and √the respective standard normal quantile as N(0, 1)(q), then the quantile discrepancy is computed as ± LR(q) − N(0, 1)(q). We note that the results relative to CR (α; K) were obtained by setting K = 2.5 and K = 4. These values were selected empirically and proved to deliver the most accurate inference. At the outset, we consider the situation where α = −1, i.e., we simulate observations on the return signal amplitude of an extremely heterogeneous region, e.g., an urban area. Table 1 presents descriptive statistics on different estimators of α, the roughness parameter. It is noteworthy that αˆ CR,K=4 , obtained from the maximization of CR (α) with K = 4, displayed the best finite-sample behavior, both in terms of bias and mean squared error. For a window of size 7 × 7 (n = 49) and number of looks (L) equal to 1, the least favorable situation, the relative bias of the estimator αˆ CR,K=4 (0.973%) was approximately 20 times smaller than that of the usual maximum likelihood estimator αˆ (20.349%). The mean squared errors of these estimators were 0.097 and 1.421, respectively, that is, the mean squared error of αˆ CR,K=4 ˆ the usual likelihood estimator of α. The skewness was over 14 times smaller than that of α, and kurtosis of αˆ CR,K=4 were −1.803 and 8.835, respectively, being closest to the corresponding asymptotic values (0 and 3); the skewness and kurtosis of αˆ were, respectively, −40.724 and 2467.124. Table 1.

Descriptive analysis of estimators for α = −1

(n, L)

Estimator

Mean

Variance

Bias

MSE

r.b. (%)

Skewness

Kurtosis

(49, 1)

αˆ α˜ˆ BN

−1.203 −1.156 −1.010 −1.105

1.380 0.431 0.097 0.193

−0.203 −0.156 −0.010 −0.105

1.421 0.455 0.097 0.204

20.349 15.556 0.973 10.471

−40.724 −12.049 −1.803 −3.344

2467.124 303.737 8.835 24.822

−1.198 −1.151 −1.066 −1.126

0.358 0.285 0.145 0.220

−0.198 −0.151 −0.066 −0.126

0.397 0.308 0.150 0.236

19.768 15.116 6.643 12.627

−6.553 −5.268 −2.120 −3.422

96.423 6.652 12.085 29.257

αˆ CR,K=4 αˆ CR,K=2.5 (25, 3)

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

357

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Table 2. (n, L) α = −5 (25, 8)

Estimator

Mean

Variance

Bias

MSE

r.b. (%)

Skewness

Kurtosis

αˆ α˜ˆ BN

−8.440 −7.998 −5.100 −6.326

426.668 424.218 4.909 101.089

−3.440 −2.998 −0.100 −1.326

438.502 433.203 4.919 102.849

68.802 59.950 2.001 26.530

−17.992 −18.215 −1.873 −40.517

415.553 422.298 8.536 2087.235

−6.921 −6.815 −4.763 −5.663

162.281 162.472 2.590 9.200

−1.921 −1.815 0.237 −0.663

165.970 165.766 2.647 9.639

38.415 36.295 −4.749 13.251

−23.567 −23.566 −1.777 −4.813

755.522 755.209 8.989 50.625

−10.606 −10.324 −7.828 −9.255

126.190 126.096 7.968 63.263

−2.606 −2.324 0.172 −1.255

132.983 131.496 7.997 64.838

32.580 29.049 −2.148 15.687

−16.621 −16.714 −1.661 −32.205

494.639 497.405 7.812 1700.708

−11.792 −11.648 −7.195 −9.471

318.949 319.535 6.063 137.024

−3.792 −3.648 0.805 −1.471

333.328 332.840 6.711 139.189

47.400 45.594 −10.065 18.389

−12.154 −12.151 −2.754 −21.475

217.989 217.739 41.184 654.171

αˆ CR,K=4 αˆ CR,K=2.5 (81, 3)

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

α = −8 (49, 8)

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

(121, 3)

Descriptive analysis of estimators for α = −5 and α = −8

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

We shall now consider the case where α = −5, a value of the roughness parameter that is borderline between heterogeneous and extremely heterogeneous regions. Table 2 presents descriptive statistics related to the different estimators of α. Again, αˆ CR,K=4 displayed the best finite-sample behavior, both in terms of bias and mean squared error. For instance, when (n, L) = (81, 3), the mean squared error of αˆ CR,K=4 was 2.647, thus being 62 times smaller than that of the usual profile maximum likelihood estimator (165.970). Also, the absolute relative bias of the former (αˆ CR,K=4 ), 4.749%, is eight times ˆ Additionally, the skewness (−1.777) and kutosis (8.989) of αˆ CR,K=4 smaller than that of the latter (α). are closest to their asymptotic counterparts (0 and 3). Values of the roughness parameter (α) between −5 and −10 are typical of heterogeneous areas. Table 2 also presents numerical results related to point estimation of α when its true value is −8. Again, αˆ CR,K=4 was the best perfoming estimator. For example, when (n, L) = (49, 8), its mean squared error and relative bias were equal to 7.997 and −2.148%, respectively; the next best performing estimator was αˆ CR,K=2.5 (64.838 and 15.687%). The skewness (−1.661) and kurtosis (7.812) of αˆ CR,K=4 were, again, closest to the respective asymptotic values. We shall now move to the situation where α = −10, a value of the roughness parameter on the borderline between heterogeneous and homogeneous areas. Table 3 contains the numerical results relative to the point estimation of α. Once again the best performing estimator was αˆ CR,K=4 . When ˆ α˜ˆ BN , αˆ CR,K=2.5 were, respectively, 565.638, 566.338, and 229.064, (n, L) = (121, 3), the variances of α, ˆ α˜ˆ BN , αˆ CR,K=2.5 were considerably larger than that of αˆ CR,K=4 , 8.398. The absolute biases of the α, equal to 6.601, 6.584, and 2.390, respectively, whereas the absolute bias of αˆ CR,K=4 was 1.578. Table 3 presents simulation results corresponding to α = −15, which is a value of the roughness ˆ parameter typical of homogeneous regions. When (n, L) = (81, 8), the absolute relative biases of α, α˜ˆ BN , αˆ CR,K=4 and αˆ CR,K=2.5 were equal to 41.515%, 40.759%, 8.907%, and 17.413%, respectively. Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

358

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Table 3. (n, L) α = −10 (81, 8)

Estimator

Mean

Variance

Bias

MSE

r.b. (%)

Skewness

Kurtosis

αˆ α˜ˆ BN

−12.112 −11.856 −9.742 −11.021

93.041 91.807 10.278 24.471

−2.112 −1.856 0.258 −1.021

97.502 95.253 10.345 25.513

21.122 18.562 −2.584 10.209

−18.525 −18.942 −1.710 −3.686

565.235 582.709 8.472 32.524

−16.601 −16.584 −8.422 −12.390

565.638 566.338 8.398 229.064

−6.601 −6.584 1.578 −2.390

609.214 609.686 10.886 234.774

66.012 65.839 −15.775 23.895

−6.731 −6.725 −1.218 −11.351

66.176 66.102 5.002 190.176

−21.227 −21.114 −13.664 −17.612

700.326 702.603 26.184 279.904

−6.227 −6.114 1.336 −2.612

739.104 739.982 27.969 286.727

41.515 40.759 −8.907 17.413

−10.996 −10.960 −3.142 −18.027

178.476 177.639 43.471 526.996

−32.439 −32.238 −10.252 −15.564

2543.590 2551.771 10.128 72.390

−17.439 −17.238 4.748 −0.564

2847.704 2848.905 32.670 72.709

116.259 114.917 −31.652 3.762

−3.867 −3.861 −0.674 −1.477

19.710 19.657 2.930 5.059

αˆ CR,K=4 αˆ CR,K=2.5 (121, 3)

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

α = −15 (81, 8)

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

(121, 3)

Descriptive analysis of estimators for α = −10 and α = −15

αˆ α˜ˆ BN αˆ CR,K=4 αˆ CR,K=2.5

The corresponding mean squared errors were 739.104, 739.982, 27.969, and 286.727. It is noteworthy that the best performing estimator was again αˆ CR,K=4 , and that the second best performing estimator displayed mean squared error over 10 times larger than that of αˆ CR,K=4 . When (n, L) = (121, 3), αˆ CR,K=2.5 displayed the smallest absolute relative bias (3.762%); however, αˆ CR,K=4 had the smallest mean squared error (32.670). Overall, the estimator with the best finite-sample performance was the modified profile maximum likelihood estimator of Cox and Reid (1987, 1989). The usual maximum likelihood estimator and the modified profile likelihood estimator obtained from ˜ BN displayed similar finite-sample behavior. We shall next consider hypothesis testing on the roughness parameter: heterogeneous and homogeneous vs. extremely heterogeneous regions, so H0 : α ≤ −5 versus H1 : −5 < α < 0. The null rejection rates (expressed as percentages) and the exact quantiles of the test statistics were obtained from 10 000 Monte Carlo replications. The powers of the tests (rejection rates, expressed as percentages, when H0 is false) were estimated from 5000 replications. Table 4 presents the null rejection rates of the different signalized likelihood ratio tests at the following significance levels: 5% and 10%. The value of α is −5 and we consider the following pairs (n, L): (25, 8) and (81, 3). Table 4.

Rejection rates under the null hypothesis, α = −5

(n, L)

Nominal level



˜ BN

CR,K=4

CR,K=2.5

(25, 8)

10% 5%

6.513 3.150

8.037 3.925

12.725 6.400

8.925 4.375

(81, 3)

10% 5%

7.920 3.880

9.080 4.570

16.320 8.770

10.500 5.500

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

359

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Table 5.

Test statistics sample means and variances and their assymptotic values, α = −5

(n, L)

Moment

N(0, 1)



˜ BN

CR,K=4

(25, 8)

Mean Variance

0 1

−0.245 1.035

−0.113 1.014

0.237 0.829

−0.019 0.953

(81, 3)

Mean Variance

0 1

−0.143 1.017

−0.069 1.007

0.385 0.825

0.052 0.946

CR,K=2.5

0.8

The figures in Table 4 show that the tests based on ˜ BN and on CR,K=2.5 displayed the smallest size distortions; for instance, at the 10% nominal level and for (n, L) = (81, 3), the null rejection rates of the tests based on , ˜ BN , CR,K=4 , and CR,K=2.5 were equal to 7.920%, 9.080%, 16.320%, and 10.500%, respectively. Note that the test based on CR,K=4 was considerably liberal, i.e., it overrejects the null hypothesis when such a hypothesis is true. Table 5 contains the means and variances of the different test statistics and also their asymptotic counterparts. Note that the test statistic based on CR,K=4 displayed the poorest agreement between exact and asymptotic moments; for instance, when (n, L) = (81, 3), its mean and variance were equal to 0.385 and 0.825, respectively. The test statistics based on ˜ BN and CR,K=2.5 had first two moments closest to the corresponding asymptotic values; their means were equal to −0.069 e 0.052, and their variances were equal to 1.007 and 0.946, respectively.

0.4 0.2 −0.2

0.0

quantile discrepancy

0.6

original CR_K=4 CR_K=2.5 approx BN

−3

−2

−1

0

1

2

3

asymptotic quantile

Figure 2.

Copyright © 2007 John Wiley & Sons, Ltd.

Quantile discrepancy, α = −5, (n, L) = (25, 8)

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

360

0.8

1.0

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

0.4 −0.2

0.0

0.2

quantile discrepancy

0.6

original CR_K=4 CR_K=2.5 approx BN

−3

−2

−1

0

1

2

3

asymptotic quantile

Figure 3.

Quantile discrepancy, α = −5, (n, L) = (81, 3)

Figures 2 and 3 show quantile discrepancy plots where the differences between exact and asymptotic quantiles of the test statistics are plotted against the corresponding asymptotic quantiles, i.e., against standard normal quantiles. The closer to zero the relative quantile discrepancy, the better the approximation of the exact null distribution of the test statistic by the limiting normal distribution. When (n, L) = (25, 8), we note from Figure 2 that the null distribution function of the test statistic based on CR,K=2.5 is the one best approximated by the standard normal distribution. When (n, L) = (81, 3) (Figure 3), the best agreement between exact and asymptotic null distributions occurs for the test statistic obtained from ˜ BN (‘approx BN’ in the figure). The poor approximation of the exact null distribution of the test statistic based on CR,K=4 by the asymptotic null distribution (standard normal) in Figures 2 and 3 is noteworthy. Recall that it was by maximizing this modified profile likelihood that we obtained the most accurate point estimate of the roughness parameter (see Tables 1–3). Score function bias affects the bias of the corresponding maximum likelihood estimator, which is defined as the zero of the score estimating function. On the other hand, information matrix bias affects the distributional properties of the corresponding maximum likelihood estimator, thus affecting the finite-sample behavior of interval estimates and hypothesis tests based on such an estimator. For details, see McCullagh and Tibshirani (1990). This explains why the best performing point estimator does not lead to the most accurate associated test.

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

361

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Table 6.

Rejection rates at 5% significance level and at the null hypothesis, α = −5

(n, L)

α



˜ BN

CR,K=2.5

(25, 8)

−4.9 −4.5 −4.0 −3.5 −3.0 −2.5 −2.0 −1.0

3.160 5.540 9.840 19.300 33.660 53.620 77.060 99.660

3.940 6.680 12.080 22.060 37.500 57.740 79.620 99.680

4.420 7.580 13.380 23.740 39.420 59.520 80.920 99.680

(81, 3)

−4.9 −4.5 −4.0 −3.5 −3.0 −2.5 −2.0 −1.0

4.540 7.580 14.560 27.560 48.980 74.320 93.780 100.000

5.280 8.420 15.980 29.860 51.340 75.640 94.580 100.000

6.160 9.540 17.700 32.600 54.400 77.760 95.200 100.000

Table 6 contains the rejection rates of the null hypothesis of the tests based on , ˜ BN , and CR,K=2.5 , at the 5% nominal level, when such a hypothesis is false. The rejection rates (powers) of the test based on CR,K=2.5 were always greater than those of the other tests; the next best performing test was that based on ˜ BN . Table 7 presents the null rejection rates of the different tests when α = −10, a value of the roughness parameter on the borderline between heterogeneous and homogeneous regions. We consider the pairs (n, L): (81, 8) and (121, 3). At the 10% nominal level and for (n, L) = (81, 8), the null rejection rates of the tests based on , ˜ BN , CR,K=4 , and CR,K=2.5 were 8.060%, 9.410%, 15.300%, and 10.770%, whereas for (n, L) = (121, 3) the corresponding rejection rates were equal to 8.660%, 9.640%, 23.400%, and 12.050%, respectively. The best performing test was that based on ˜ BN . Table 8 presents the means and variances of the different test statistics and their asymptotic counterparts. The test statistic that displayed the best agreement between exact and asymptotic first two moments was that based on ˜ BN . For instance, when (n, L) = (121, 3), its mean and variance were −0.054 and 1.022, respectively, the corresponding figures for the profile likelihood ratio statistic being −0.118 and 1.026. Table 7.

Rejection rates at 5% significance level and at the null hypothesis, α = −10

(n, L)

Nominal level



˜ BN

CR,K=4

CR,K=2.5

(81, 8)

10% 5%

8.060 4.070

9.410 4.650

15.300 7.950

10.770 5.250

(121, 3)

10% 5%

8.660 4.350

9.640 4.960

23.400 12.770

12.050 6.450

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

362

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Table 8.

Test statistics sample means and variances and their assymptotic values, α = −10

(n, L)

Moment

N(0, 1)



˜ BN

CR,K=4

CR,K=2.5

(81, 8)

Mean Variance

0 1

−0.125 1.031

−0.050 1.025

0.320 0.885

0.047 0.982

(121, 3)

Mean Variance

0 1

−0.118 1.026

−0.054 1.022

0.647 0.742

0.147 0.905

Figures 4 and 5 present quantile discrepancy plots. When (n, L) = (81, 8), Figure 4, the tests with best finite-sample behavior were those based on ˜ BN and CR,K=2.5 . When (n, L) = (121, 3), Figure 5, the test statistic whose null finite-sample distribution is best approximated by the limiting N(0, 1) was the statistic based on ˜ BN , followed by the usual profile likelihood ratio test statistic. Table 9 contains the rejection rates of the null hypothesis when such a hypothesis is false, i.e., it contains the estimated powers of the tests. The nominal level of all tests is 5%. The results indicate that the test based on CR,K=2.5 is the most powerful, followed by the test based on ˜ BN .

Figure 4.

Copyright © 2007 John Wiley & Sons, Ltd.

Quantile discrepancy, α = −10, (n, L) = (81, 8)

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

363

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Figure 5.

Table 9.

Quantile discrepancy, α = −10, (n, L) = (121, 3)

Rejection rates at 5% significance level and at the null hypothesis, α = −10

(n, L)

α



˜ BN

CR,K=2.5

(81, 8)

−9.5 −9.0 −8.0 −7.0 −6.0 −5.0 −3.0

5.780 8.260 15.760 31.440 52.980 77.940 99.880

6.680 9.300 17.380 33.920 55.680 80.160 99.880

7.500 10.460 19.140 36.000 58.260 81.700 99.920

(121, 3)

−9.5 −9.0 −8.0 −7.0 −6.0 −5.0 −3.0

5.680 7.240 11.520 20.720 35.440 57.760 98.160

6.560 7.820 12.480 22.240 37.480 59.660 98.460

8.020 9.380 15.460 26.100 41.840 63.900 98.880

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

364

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Figure 6.

Single look E-SAR image

5. APPLICATION TO REAL DATA Figure 6 shows a single look image obtained by the E-SAR airborne sensor over surroundings of M¨unchen, Germany, originally of 1024 × 600 pixels with a resolution of the order of 1 m. Several types of land use are visible in this image, markedly crops (where little or no texture is visible), forest (where there is some texture) and urban areas (where the texture is intense). Representative areas are outlined and marked ‘C’, ‘F’, and ‘U’, respectively. The NW-SE arrow shows the flight path during which the data were collected. At each coordinate, a window of size 7 × 7 pixels was recorded, so we have over a thousand samples of size 49 with overlapping data. The data in each sample are assumed to be independent and identically distributed GA0 (αi , γi , 1) draws, i denoting the position. Figure 7 shows the pairs of estimates as computed in every coordinate of the image (the solid lines show the identity relationship). One can notice the different relationships among them: αˆ and αˆ BN behave similarly, as do αˆ CR,K=4 and αˆ CR,K=2.5 . The estimator αˆ CR,K=2.5 assumes smaller values than αˆ in most situations, whereas αˆ CR,K=4 usually exceeds αˆ CR,K=2.5 . The quantitative analysis we present consists of estimating α at each coordinate using profile and modified profile maximum likelihood methods. Four areas corresponding to well-defined classes were identified, namely two from the urban spot, one from forest, and one from pasture; 21 samples were taken from each area. Table 10 shows the mean, variance, skewness, and kurtosis of the estimates in each area. The first and second urban areas can be qualified as ‘pure’, in the sense that they mostly consist of buildings and houses; all estimators yield accurate point estimates, with a noticeable difference, ˆ however, in their variances (the observed variance of αˆ CR,K=2.5 is larger than that of α). Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

365

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

Figure 7.

Table 10.

First Urban Mean Variance Skewness Kurtosis Second Urban Mean Variance Skewness Kurtosis Forest Mean Variance Skewness Kurtosis Pasture Mean Variance Skewness Kurtosis

Pairs of estimates in all the positions

Descriptive analysis of estimates in four areas

αˆ

αˆ CR,K=4

αˆ CR,K=2.5

αˆ BN

−1.363 0.262 1.202 0.271

−1.220 0.662 −0.912 −0.316

−1.590 2.018 −1.298 0.298

−1.341 0.290 0.880 −0.231

−1.530 0.066 1.269 0.396

−1.222 0.142 −0.596 −0.277

−1.425 0.309 −0.766 −0.128

−1.500 0.086 0.923 −0.373

−2.232 0.092 −0.690 −0.273

−5.724 1.217 0.392 −0.783

−19.056 29.109 −0.187 −0.918

−2.457 0.115 −0.674 −0.334

−2.556 0.249 0.326 −1.293

−6.347 6.045 0.790 −0.613

−21.947 121.362 0.703 −0.828

−2.796 0.356 0.444 −1.119

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

366

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Table 11. αˆ

Estimator Level Decision

Results of applying likelihood ratio tests to identified samples αˆ CR,K=2.5

5%, 10% +He

He

10%

Ho

N

First Urban 95.24 0.00 0.00 Second Urban 100.00 0.00 0.00 Forest 100.00 0.00 0.00 Pasture 100.00 0.00 0.00

+He

He

4.76 71.43 4.76 0.00 76.19 14.29 0.00 0.00 0.00 0.00 9.52 4.76

5% Ho

19.05 9.52 100.00 85.71

N

+He

He

4.76 66.67 9.52 0.00 61.90 14.29 0.00 0.00 0.00 0.00 9.52 4.76

Ho

N

19.05 21.81 100.00 85.71

0.00 0.00 0.00 0.00

The main differences arise when estimation is performed using data from forest and pasture areas. In both cases αˆ and αˆ BN yield values that are larger than expected; αˆ CR,K=4 and αˆ CR,K=2.5 yield better estimates, at the expense more variability. Table 11 presents the results of applying the likelihood ratio test based on the maximum likelihood and Cox–Reid (K = 2.5) estimators to these samples at the 10% and 5% significance levels; since the results at the two significance levels are the same when αˆ is used, they are presented in the same columns. For each type of target (First Urban, Second Urban, Forest, and Pasture) 21 samples of size 49 were used and the results of the tests are presented as percentages of the following: extremely heterogeneous (+He), heterogeneous (He), homogeneous (Ho), and numerical problems (N). It is noticeable that the test based on αˆ consistently classifies the samples as extremely heterogeneous, regardless of the ground truth; this decision is correct for areas labeled as Urban, but incorrect in Forest and Pasture. On the other hand, the test based on αˆ CR,K=2.5 is able to detect the homogeneity of the Pasture area and, to a lesser extent, the extreme heterogeneity of the two Urban spots. Samples labeled as Forest were not identified as such by any test, a result which is consistent with the values presented in Table 10: the mean of the maximum likelihood estimates in these samples (−2.232) suggests that they were obtained from an extremely heterogeneous area, which is wrong, whereas the mean of the corrected estimates (−19.056) suggests homogeneity of the region. A few samples labeled as Urban were classified as heterogeneous by the tests based on the Cox–Reid estimators; this is possibly due to the suburban nature of the area and the presence of trees near the houses.

6. CONCLUDING REMARKS In this paper we obtained adjustments to the profile likelihood function for the GA0 (α, γ, L) distribution in the context of modeling synthetic aperture radar images. The interest lies in performing inference on the roughness parameter of this distribution, which is used to determine whether an imaged region is homogeneous, heterogeneous, or extremely heterogeneous. The results are encouraging. Cribari-Neto et al. (2002, p. 816, Table 2) proposed bootstrap-adjusted estimators that require resampling of the observations and are, thus, computer intensive. When (n, L) = (49, 1), their least biased estimator displays absolute bias equal to 0.033, and the smallest mean squared error of their bootstrap estimators was equal to 0.186. The estimators proposed in this paper clearly outperform those proposed by Cribari-Neto et al. (2002). Table 1 (Section 4) shows that the absolute bias of αˆ CR,K=4 was equal to 0.010, the mean squared error of this estimator being equal to 0.097. Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

367

IMPROVED INFERENCE FOR THE GA0 DISTRIBUTION

We have also considered hypothesis tests. Overall, the test that displayed the smallest size distortions was that based on ˜ BN ; the next best performing test was that based on CR,K=2.5 . The adjusted tests proved to be more powerful than the usual profile likelihood ratio test. Finally, we have analyzed real data obtained from a SAR image. We selected samples from the three typical regions present in the image, namely, Urban, Forest, and Pasture. The adjusted profile maximum likelihood estimators proved to be more capable of providing useful information about the nature of the ground truth than the usual maximum likelihood estimator. Profile and adjusted profile (Cox–Reid, K = 2.5) likelihood ratio test statistics were computed using the same data. Decisions based on the former always suggested that the imaged area was urban, even when that was clearly not so, whereas the adjusted profile likelihood test yielded much more sensible inference. Future work should focus on improving the detection of forest, since this type of area was not correctly identified by any procedure. We strongly encourage practitioners to use the adjusted profile likelihood inference developed in this paper when analyzing speckled data. ACKNOWLEDGEMENTS

The authors gratefully acknowledge partial financial support from CNPq and FAPESP.

REFERENCES Allende H, Frery AC, Galbiati J, Pizarro L. 2006. M-estimators with asymmetric influence functions: the GA0 distribution case. Journal of Statistical Computation and Simulation 76: 941–956. Allende H, Galbiati J, Vallejos R. 2001. Robust image modeling on image processing. Pattern Recognition Letters 22: 1219–1231. Barndorff-Nielsen OE. 1980. Conditionality resolutions. Biometrika 67: 293–310. Barndorff-Nielsen OE. 1983. On a formula for the distribution of the maximum likelihood estimator. Biometrika 70: 343–365. Bustos OH, Lucini MM, Frery AC. 2002. M-estimators of roughness and scale for GA0-modelled SAR imagery. EURASIP Journal on Applied Signal Processing 2002: 105–114. Cox DR, Reid N. 1987. Parameter orthogonality and approximate conditional inference. Journal of the Royal Statistical Society B 49: 1–39. Cox DR, Reid N. 1989. On the stability of maximum-likelihood estimators of orthogonal parameters. The Canadian Journal of Statistics 17: 229–233. Cribari-Neto F, Frery AC, Silva MF. 2002. Improved estimation of clutter properties in speckled imagery. Computational Statistics and Data Analysis 40: 801–824. DiCiccio TJ, Martin MA, Stern SE, Young GA. 1996. Information bias and adjusted profile likelihoods. Journal of the Royal Statistical Society B 58: 189–203. Fraser DAS, Reid N. 1995. Ancillaries and third-order significance. Utilitas Mathematica 47: 33–53. Fraser DAS, Reid N, Wu J. 1999. A simple formula for tail probabilities for frequentist and Bayesian inference. Biometrika 86: 655–661. Freitas CC, Frery AC, Correia AH. 2005. The polarimetric G distribution for SAR data analysis. Environmetrics 16: 13–31. Frery AC, Cribari-Neto F, Souza MO. 2004. Analysis of minute features in speckled imagery with maximum likelihood estimation. EURASIP Journal on Applied Signal Processing 2004: 2476–2491. Frery AC, M¨uller H-J, Yanasse CCF, Sant’Anna SJS. 1997. A model for extremely heterogeneous clutter. IEEE Transactions on Geoscience and Remote Sensing 35: 648–659. Gambini J, Mejail M, Jacobo-Berlles J, Frery AC. 2006. Feature extraction in speckled imagery using dynamic B-spline deformable contours under the G0 model. International Journal of Remote Sensing 27: 5037–5059. Goodman JW. 1985. Statistical Optics. Pure and Applied Optics. Wiley: New York. McCullagh P, Tibshirani R. 1990. A simple method for the adjustment of profile likelihoods. Journal of the Royal Statistical Society B 52: 325–344. Medeiros FNS, Mascarenhas NDA, Costa LF. 2003. Evaluation of speckle noise MAP filtering algorithms applied to SAR images. International Journal of Remote Sensing 24: 5197–5218. Mejail ME, Frery AC, Jacobo-Berlles J, Bustos OH. 2001. Approximation of distributions for SAR images: proposal, evaluation and practical consequences. Latin American Applied Research 31: 83–92.

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

368

M. F. DA SILVA, F. CRIBARI-NETO AND A. C. FRERY

Mejail ME, Jacobo-Berlles J, Frery AC, Bustos OH. 2003. Classification of SAR images using a general and tractable multiplicative model. International Journal of Remote Sensing 24: 3565–3582. Moschetti E, Palacio MG, Picco M, Bustos OH, Frery AC. 2006. On the use of Lee’s protocol for speckle-reducing techniques. Latin American Applied Research 36: 115–121. Sartori N, Bellio R, Salvan A, Pace L. 1999. The directed modified profile likelihood in models with many parameters. Biometrika 86: 735–742. Severini TA. 1998. An approximation to the modified profile likelihood function. Biometrika 85: 403–411. Severini TA. 1999. An empirical adjustment to the likelihood ratio statistic. Biometrika 86: 235–247. Severini TA. 2000. Likelihood Methods in Statistics. Oxford University Press: Oxford. Vasconcellos KLP, Frery AC, Silva LB. 2005. Improving estimation in speckled imagery. Computational Statistics 20: 503–519. Yueh SH, Kong JA, Jao JK, Shin RT, Novak LM. 1989. K-distribution and polarimetric terrain radar clutter. Journal of Electromagnetic Waves and Applications 3: 747–768.

Copyright © 2007 John Wiley & Sons, Ltd.

Environmetrics 2008; 19: 347–368 DOI: 10.1002/env

Improved likelihood inference for the roughness ...

Aug 7, 2007 - 4, and numerical examples with real data sets are presented in ..... was over 14 times smaller than that of ˆα, the usual likelihood ..... estimators that require resampling of the observations and are, thus, computer intensive.

882KB Sizes 4 Downloads 201 Views

Recommend Documents

Bayesian Optimization for Likelihood-Free Inference
Sep 14, 2016 - There are several flavors of likelihood-free inference. In. Bayesian ..... IEEE. Conference on Systems, Man and Cybernetics, 2: 1241–1246, 1992.

An Improved Likelihood Model for Eye Tracking
Dec 6, 2005 - This property makes eye tracking systems a unique and effective tool for disabled people ...... In SPIE Defense and Security Symposium,. Automatic ... Real-time eye, gaze, and face pose tracking for monitoring driver vigilance.

Empirical likelihood based inference in conditional ...
This paper proposes an asymptotically efficient method for estimating models with conditional moment restrictions. Our estimator generalizes the maximum empirical likelihood estimator (MELE) of Qin and Lawless (1994). Using a kernel smoothing method,

low-cost road roughness machine
vehicle maintenance, the extent of tyre damage and ... The second class of instrument is the dynamic profile .... the mark at its closest proximity to the road.

Properties of the Maximum q-Likelihood Estimator for ...
variables are discussed both by analytical methods and simulations. Keywords ..... It has been shown that the estimator proposed by Shioya is robust under data.

A maximum likelihood method for the incidental ...
This paper uses the invariance principle to solve the incidental parameter problem of [Econometrica 16 (1948) 1–32]. We seek group actions that pre- serve the structural parameter and yield a maximal invariant in the parameter space with fixed dime

likelihood
good sales people, what is the probability that most psychologists make good sales ... If the explanations are identical, we would expect the premise to increase.

An Improved Algorithm for the Solution of the ...
tine that deals with duplicate data points, a routine that guards against ... numerical algorithm has been tested on several data sets with duplicate points and ...

Improved Online Algorithms for the Sorting Buffer Problem
still capture one of the most fundamental problems in the design of storage systems, known as the disk ... ‡School of Mathematical Sciences, Tel-Aviv University, Israel. ... management, computer graphics, and even in the automotive industry.

Alexandropoulou_S._et_al. The Likelihood of Upper-Bound ...
The Likelihood of Upper-Bound Construals among Different Modified Numerals.pdf. Alexandropoulou_S._et_al. The Likelihood of Upper-Bound Construals ...

Blind Maximum Likelihood CFO Estimation for OFDM ... - IEEE Xplore
The authors are with the Department of Electrical and Computer En- gineering, National University of .... Finally, we fix. , and compare the two algorithms by ...

Fast maximum likelihood algorithm for localization of ...
Feb 1, 2012 - 1Kellogg Honors College and Department of Mathematics and Statistics, .... through the degree of defocus. .... (Color online) Localization precision (standard devia- ... nia State University Program for Education and Research.

LDR a Package for Likelihood-Based Sufficient ...
more oriented to command-line operation, a graphical user interface is also provided for prototype .... entries. In the latter case, there are only two matrices G1 = Ip and G2 = eeT , where e ∈ Rp ..... Five arguments are mandatory when calling the

Improved Algorithms for Orienteering and Related Problems
approximation for k-stroll and obtain a solution of length. 3OPT that visits Ω(k/ log2 k) nodes. Our algorithm for k- stroll is based on an algorithm for k-TSP for ...

Improved Algorithms for Orienteering and Related Problems
Abstract. In this paper we consider the orienteering problem in undirected and directed graphs and obtain improved approximation algorithms. The point to ...

Improved Competitive Performance Bounds for ... - Semantic Scholar
Email: [email protected]. 3 Communication Systems ... Email: [email protected]. Abstract. .... the packet to be sent on the output link. Since Internet traffic is ...

Improved Approximation Algorithms for (Budgeted) Node-weighted ...
2 Computer Science Department, Univ of Maryland, A.V.W. Bldg., College Park, MD ..... The following facts about a disk of radius R centered at a terminal t can be ..... within any finite factor when restricted to the case of bounded degree graphs.

Inference Protocols for Coreference Resolution - GitHub
R. 23 other. 0.05 per. 0.85 loc. 0.10 other. 0.05 per. 0.50 loc. 0.45 other. 0.10 per .... search 3 --search_alpha 1e-4 --search_rollout oracle --passes 2 --holdout_off.

LEARNING AND INFERENCE ALGORITHMS FOR ...
Department of Electrical & Computer Engineering and Center for Language and Speech Processing. The Johns ..... is 2 minutes, and the video and kinematic data are recorded at 30 frames per ... Training and Decoding Using SS-VAR(p) Models. For each ...

Best Practices: Optimizing for the Improved YouTube - Services
redesigning the site to make it easier for our users to find and subscribe to the channels they love by putting the Guide on all pages. So, what does this mean for you as a ... interesting information first since this is most likely to show up with y

An improved method for the determination of saturation ...
2) use of cost effective excitation source with improved voltage regulation and ..... power electronic applications, renewable energy systems, energy efficiency.

Exploring Games for Improved Touchscreen Authentication ... - Usenix
... device owners with more us- able authentication, we propose the study and development .... smart-phone-thefts-rose-to-3-1-million-last-year/ index.htm, 2014.

Improved memory for information learnt before ...
License, which permits use, sharing, adaptation, distribution and reproduction ... To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.