IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

To conclude, we have to address two questions unanswered in this correspondence. The first one is concerned with those practical cases where a state-space model is available. In these situations, the strategy of applying the standard estimators (EKF, UKF, etc.) directly on the known state-space model is obviously preferable to the proposed technique since the corresponding computational complexity is lower. Second, it has not been an objective of this correspondence to propose a definitive combination of an approximate KL expansion and a standard nonlinear estimation method that outperforms all other possible combinations. This combination probably does not exist. Hence, for each application, one has to pick the appropriate combination. This problem is similar to that of the standard estimation techniques (EKF, UKF, DD2, etc.) where there is not a “best” estimator either. In fact, one has to choose the estimator that is found to best trade off various properties, such as estimation accuracy, ease of implementation, numerical robustness, and computational burden [13] (see also [11]). REFERENCES [1] A. Jazwinski, Stochastic Processes and Filtering Theory. San Diego, CA: Academic, 1970. [2] M. Sugisaka, “The design of on-line least-squares estimators given covariance specifications via an imbedding method,” Appl. Math. Comput., no. 13, pp. 55–85, 1983. [3] S. Cambanis, “A general approach to linear mean-square estimation problems,” IEEE, Trans. Inf. Theory, vol. IT-19, no. 1, pp. 110–114, 1973. [4] W. Gardner, “A simple solution to smoothing, filtering, and prediction problems using series representations,” IEEE, Trans. Inf. Theory, vol. IT-20, no. 2, pp. 271–274, Mar. 1974. [5] J. Ruiz-Molina, J. Navarro-Moreno, and M. Estudillo, “On the problem of estimating a signal correlated with the observation noise,” IEEE Signal Process. Lett., vol. 11, no. 3, pp. 330–330, Mar. 2004. [6] K. Fukunaga and L. Koontz, “Representation of random processes using the finite Karhunen–Loève expansion,” Inf. Control, vol. 16, pp. 85–101, 1970. [7] J. Burl, “Estimating the basic functions of the Karhunen-Loève transform,” IEEE Trans. Acoust. Speech, Signal Process., vol. 37, no. 1, pp. 99–105, Jan. 1989. [8] J. Navarro-Moreno, J. Ruiz-Molina, and R. Fernández, “Approximate series representations of second-order stochastic processes: Applications to signal detection and estimation,” IEEE Trans. Inf. Theory, vol. 49, no. 6, pp. 1574–1579, Jun. 2003. [9] W. Gardner and L. Franks, “An alternative approach to linear least squares estimation of continuous random processes,” in Proc. 5th Annu. Princeton Conf. Information Sciences Systems, Princeton, NJ, Mar. 1971, pp. 267–275. [10] T. Kailath, “A view of three decades of linear filtering theory,” IEEE Trans. Inf. Theory, no. 2, pp. 146–181, Mar. 1974. [11] S. Haykin, Kalman Filtering and Neural Networks. New York: Wiley, 2001. [12] B. Anderson and J. Moore, Optimal Filtering. Englewood Cliffs, NJ: Prentince-Hall, 1979. [13] M. Nørgaard, N. Poulsen, and O. Ravn, “New developments in state estimation for nonlinear systems,” Automatica, vol. 36, pp. 1627–1638, 2000. [14] T. Kailath and H. Poor, “Detection of stochastic processes,” IEEE Trans. Inf. Theory, vol. 44, no. 6, pp. 2230–2259, Oct. 1998. [15] S. Nakamori, “Design of predictor using covariance information in continuous-time stochastic systems with nonlinear observation mechanism,” Signal Process., no. 68, pp. 183–193, 1998. [16] G. Eyink. (2001) “A variational formulation of optimal nonlinear estimation,”. Los Alamos National Laboratory. [Online]. Available: http://arxiv.org/abs/physics/0011049 [17] M. Chen, Z. Chen, and G. Chen, Approximate Solutions of Operator Equations, Singapore: World Scientific, 1997. [18] E. Kelly and W. Root, “A representation of vector-valued random processes,” J. Math. Phys., vol. 39, pp. 211–216, 1960. [19] K. Phoon, S. Huang, and S. Quek, “Implementation of Karhunen–Loève expansion for simulation using a Wavelet–Galerkin scheme,” Probab. Eng. Mech., no. 17, pp. 293–303, 2002.

2827

Second-Order Heavy–Tailed Distributions and Tail Analysis Tuncer C. Aysal, Student Member, IEEE, and Kenneth E. Barner, Senior Member, IEEE Abstract—This correspondence studies the second-order distributions of heavy-tail distributed random variables (RVs). Two models for the heavytailed distributions are considered: power law and -contaminated distributions. Special cases of the models considered include 1) RVs formed by the product of two independent, but not necessarily identically distributed, and , such that , and 2) RVs formed heavy-tailed RVs . Tail through squaring a heavy-tail distributed RV , such that analysis of the RVs, their cross terms, and square values shows the ordering of their tail heaviness. The following results hold strictly for power law distributions and, under mild conditions, for -contaminated distributions: ( ) is heavier than that of ( ), and the tail of ( ) The tail of is heavier than that of ( ), where ( ) denotes the probability density distribution of the corresponding random variable. The heaviness of the tails indicates that robust methods of sample combination and output determination should be utilized to avoid undue influence of outliers and degradation in performance. As examples, the denoising and frequency-selective filtering problems under the derived cross and square statistics for hyperbolic–tailed and -contaminated models are considered. Simulation results indicate that the weighted myriad (WMY) filter outperforms the weighted median (WM) filter, and the WM filter outperforms the weighted sum [finite-impulse-response (FIR)] filter. The results may be exploited in higher order applications of heavy-tailed distributions in networking, such as data traffic modeling, and in nonlinear signal and image processing, such as polynomial and Volterra filtering.

=

=

Index Terms—Cross distribution, heavy-tailed distributions, square distribution, tail analysis.

I. INTRODUCTION Heavy-tailed distributions are of particular interest in many networking and signal processing applications. The power law distributions are widely used for approximating packet interarrival times [1], as well as in other networking applications and the modeling of CPU time consumed by an arbitrary process [2]. Heavy-tailed distributions characterized by -contaminated (Gaussian-mixture) distributions are also of particular interest in many robust signal, image, and speech processing applications [3], [4]. Higher order statistics are exploited in numerous nonlinear filtering methods, including polynomial and Volterra filtering [2], [3], which utilize the cross and squared terms, as well as higher order statistics, to form the filter output. Similarly, networking applications, including the incremental process of a self-similar process [2], [5], often exploit higher order statistics. This paper studies the distribution of higher order statistics formed from random variables (RVs) with heavy-tail distributions following two commonly utilized models: power law and -contaminated distributions. The effects of the product and square operators on heavytailed distributed RVs are determined. The results show that, under given models, the cross- and square-term RVs are also heavy-tailed distributed. Moreover, their tail orders are such that the tail of fZ (x) is heavier than that of fX (x), and the tail of fW (x) is heavier than

Manuscript received April 4, 2005; revised September 6, 2005. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Chong-Yung Chi. The authors are with the Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716 USA (e-mail: [email protected]; [email protected]). Digital Object Identifier 10.1109/TSP.2006.874776

1053-587X/$20.00 © 2006 IEEE

2828

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

that of fZ (x), where fZ (x), fW (x), and fX (x) denote the distribution of a RV formed by the product of two independent, but not necessarily identically distributed, heavy-tailed RVs X and Y , such that Z = XY , the distribution of a RV formed through the squaring a heavy-tail distributed RV X , such that W = X 2 , and the heavy-tail distributed RV X , respectively. The denoising and frequency-selective filtering problems under the derived cross and square statistics for power law and Gaussian-mixture models are considered. Simulation results demonstrate that the weighted myriad (WMY) filter [4] outperforms the weighted median (WM) filter [6], and the WM filter outperforms the weighted sum [finite-impulse-response (FIR)] filter. The remainder of this correspondence is organized as follows. Section II discusses the distribution of the heavy-tailed random variables under two models: their cross- and square-term distributions, and tail analysis. The simulations and discussion evaluating the results are given in Section III, and the conclusions are drawn in Section IV.

where X = [a; b] denotes the range of the RV X and Y . Replacing FY (z=x) and fX (x) with their expressions, expanding the product, using the linearity of the integration, and performing some manipulations yields

FZ (z ) = 1 0 LK [ln (b) 0 ln (a)]z 0 where Z 2 [a2 ; b2 ]. The TF is simply pdf of Z is obtained as

0 FZ (z) and the (5)

where L = L . Taking the limit in (2) to test whether the distribution is hyperbolic–tailed or not yields lim

Two important heavy-tailed distribution models that are of particular interest in networking and signal processing applications are analyzed in this section. The cumulative distribution, density and tail functions of second-order terms including cross and square terms are derived. The derivation of tail functions are followed by a tail-heaviness study.

1

fZ (z ) = L K [ln (b) 0 ln (a)]z 0 01

z!1

II. SECOND–ORDER HEAVY–TAILED DISTRIBUTIONS AND TAIL ANALYSIS

F Z (z ) =

(4)

F Z (z ) z 0

=

LK [ln (b) 0 ln (a)]

(6)

where LK [ln (b) 0 ln (a)] > 0 is a constant. Note that a heavytailed distribution in [a2 ; b2 ] is heavy-tailed in [a2 ; 1] [7]. Thus, the product of two independent power-law-distributed RVs is also hyperbolic-tailed. Consider now an RV W formed as the square (i.e., W = X 2 ) of a nonnegative power-law-distributed RV X , with pdf fX (x) = K x0 01 . The cdf of W is given by the evaluation of

A. Power Law Distributions The relation between the long–range dependence and self–similarity relation is shown and formulated in [2]. In a communication context, self–similarity is closely related to power law distributions [2]. Hence, the higher order distributions and tails of the higher order distributions of power-law-distributed RV are analyzed. A nonnegative RV X is said to be heavy-tailed if the tail function (TF), F (x), of X is given by

F X (x)  P r(X > x) = c x0 x



(1)

for some x0 such that x > x0 , tail index > 0, and constant c > 0 [7]. Thus, regardless of the distribution for small values, a RV is power law distributed if the asymptotic shape of its distribution is hyperbolic [5], as follows: lim

x!1

F X (x ) x0

=

n

(2)

n > 0 is a positive constant. Since F X (x) = 1 0 P r(X < x) = 1 0 FX (x), the cumulative distribution function (cdf) of a powerlaw-distributed RV X can be found as F (x) = 1 0 Kx0 , where K = cx 0 > 0 is a constant. The probability density function (pdf) fX (x) of the RV X is then given by fX (x) = K x0 01 , where K = K . where

1) Second-Order of Power Law Distributions: In this section, the cdf’s, TFs and pdf’s of RVs generated through product and square operations performed on independent power-law-distributed RVs are analyzed. Consider first a RV Z formed as the product of two nonnegative independent RVs X and Y , i.e., Z = XY . It is taken that X and Y are power law distributed, with cdf’s FX (x) = 1 0 Kx0 and FY (x) = 1 0 Lx0 , where K and L are constants. The cdf of Z , over the range of X , is given by

FZ (z ) =

FY z fX (x)dx x x2X

(3)

FW (w) =

fX (x)dx: jxj
(7)

Utilizing the fact that X is a nonnegative RV defined over X = [a; b], p where a > 0, yields FW (w) = FX ( w). Evaluating this for the power law pdf of X gives

FW (w) = 1 0 Kw0 =2

(8)

where K > 0 is a constant. The TF is simply F W (w) = 1 0 FW (w). Finally, the pdf of W is given by

fW (w) = K w0 =201 2

(9)

and W 2 [a2 ; b2 ]. Applying a process similar to the previous case for evaluating the limit in (2) yields lim

w!1

F W (w ) w0 =2

=

K:

(10)

Thus, a RV W = X 2 , formed as the square of a power-law-distributed RV, is also hyperbolic–tailed. 2) Tail Analysis of Second-Order Power Law Distributions: Tails of the distributions derived in Section II-A-1) are now analyzed, and the tail heaviness order is determined. Consider first the TFs of a power-law-distributed RV X and the RV Z = XY generated as the product of two independent heavytailed RVs. Consider the following to determine tail heaviness order: limx!1 F Z (x)=F X (x). It is clear that if this limit is greater then unity, then the pdf tail of Z = XY is heavier than the pdf tail of X , and vise versa if the limit is smaller than unity. Replacing F Z (x) and F X (x) in the preceding limit equation yields

F Z (x ) lim x!1 F X (x)

=

L [ln (b) 0 ln (a)]

(11)

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

2829

Note that the extension to symmetric pdf’s of the derivations detailed here is straightforward. B. Gaussian-Mixture Distributions A second important model that is used for modeling heavy-tailed environments in signal processing applications [4] is the Gaussian-mixture (-contaminated) distributions. The Gaussian-mixture pdf takes the form

fX (x) = (1 0 )fn (x) + fc (x)

(14)

where fn (x) is the nominal Gaussian pdf with variance n2 ,  is a small positive constant determining the percentage of contamination, and fc (x) is the contaminating pdf with a large relative variance c2  n2 . The TF of Gaussian-mixture-distributed RV can be shown to have exponential order tails with asymptotic similarity

=

=

Fig. 1. Second-order distributions of Pareto distribution: TFs of (solid (dashed line), and, (dotted line). All location line), parameters are 1.

=

where L = L . For b=a > exp (1=L ), which is the case for the power-law-distributed RVs, limx!1 F Z (x)=F X (x) > 1. Similarly, it can be shown that limx!1 F Z (x)=F Y (x) > 1 for b=a > exp (1=K ) . The tail of the pdf of Z = XY is therefore heavier than those of the pdf’s of X and Y . Consider now the TF of the RV generated by squaring a power-law-distributed RV. As in the previous case, we examine 2 limx!1 F W (x)=F Z (x), where W = X . Replacing F W (x) and F Z (x) in the preceding yields

F W (x ) lim x!1 F Z (x) since x0 decays faster then tail-heaviness order is

=

1

(12)

x0 =2 . Thus, we can conclude that the

F X (x) > F XY (x) > F X (x)

(13)

where F X (x), F XY (x), and F X (x) denote the TF of the squared, cross, and parent power-law-distributed RVs, respectively. Example: Consider the Pareto distribution whose TF is defined as F X (x) = (x=k)0 . The variance 2 = 2 and k = 1 case, which corresponds to = 2:5370, is examined here. The cdf, TF, and pdf’s of the Pareto-distributed RV are given by FX (x) = 1 0 (1=x)2:5370 , F X (x) = (1=x)2:5370 , and fX (x) = 2:5370x03:5370 , respectively. The cdf, TF and pdf’s of the RV Z = XY , where FX (x) = FY (x), are given by FZ (x) = 1 0 3:9120(1=x)2:5370 , F Z (x) = 3:9120(1=x)2:5370 , and fZ (x) = 9:9247x03:5370 , respectively. Similarly, the cdf, TF and pdf’s the RV W = X 2 are FW (x) = 1 0 (1=x)1:2685 , F W (x) = (1=x)1:2685 , and fW (x) = 1:2685x02:2685 , respectively. The tail decay rate in this case is ordered as X > Z > W , where (1) denote the tail decay rate of f(1) (x). Thus, the tail-heaviness order is F W (x) > F Z (x) > F X (x). Fig. 1 illustrates the TFs of the parent, cross, and square hyperbolic–tailed Pareto distributions. Robust schemes should be developed to appropriately process the second-order distributions of heavy-tailed RVs with hyperbolic tails as indicated by the results. This indicates robust combination methods should be utilized in, for instance, polynomial filters, which is the approach adopted in [8].

F X (x)  (1 0 ) pn x01 exp 2

0 2x

2

n2

+

pc x0 2

1

0 2x : 2

exp

c2

(15)

1) Second-Order of Gaussian-Mixture Distributions: Consider the second-order distributions of Gaussian-mixture density functions. Let fX (x) = (1 0 1 )fn;1 (x)+ 1 fc;1 (x) and fY (x) = (1 0 2 )fn;2 (x)+ 2 fc;2 (x), where fX (x), fY (x), 1 , and 2 denote the pdf’s of the Gaussian-mixture-distributed RVs X , Y , and contamination parameters, respectively. The pdf of Z = XY (not restricted to any pdf’s) is given by

1

fn;1 z fn;2 (x)dx x 1 1 + (1 0 1 )2 fn;1 z fc;2 (x)dx x x x=01 1 1 + 1 (1 0 2 ) fc;1 z fn;1 (x)dx x x x=01 1 1 +  1 2 fc;1 z fc;2 (x)dx: (16) x x=01 x

fZ (z ) = (1 0 1 )(1 0 2 )

x=01

1

x

Substituting fn;1 (1), fc;1 (1), fn;2 (1), and fc;2 (1) with zero-mean Gaussian pdf’s with scale parameters n;1 , c;1 , n;2 , and c;2 , respectively, yields

jz j fZ (z ) = (1 0 1 )(1 0 2 ) K0 n;1 n;2 n;1 n;2 (1 0 1 )2 jz j + K n;1 c;2 0 n;1 c;2 1 (1 0 2 ) K jz j + c;1 n;1 0 c;1 n;1 1  2 K jz j + c;1 c;2 0 c;1 c;2

(17)

where Kn (1) denotes the modified Bessel function of the second kind of order n. The modified Bessel function of the second kind is not appropriate for the tail analysis of the RV Z . It has been shown in [9] that the p K0p(x) is asymptotically similar with 1= x exp (0x), i.e., K0 (x)  0 x 1= xe . Using this similarity in the computation of F Z (x), applying

2830

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

integration by parts, and using asymptotic series expansion for erf(1) function yields the TF of Z F Z (z)

01=2

 z2p

0 1 )pn;1 n;2(1 0 2 ) exp 0 n;1zn;2 p z +(1 0 1 )2 n;1 c;2 exp 0 (1

n;1 c;2

p z +1 (1 0 2 ) c;1 n;2 exp 0 c;1 n;2 p z +1 2 c;1 c;2 exp 0 :

(18)

c;1 c;2

Consider now a random variable W formed as the square (i.e., W = ), of a Gaussian-mixture-distributed RV X with pdf defined in (14), i.e., fX (x) = (1 0 1 )fn;1 (x) + 1 fc;1 (x). The pdf of W = X 2 (not restricted to any pdf’s) is given by X

2

fW (w) = (1

01=2

0 1 ) w 2

[fn;1 (

+1

w

pw) + f

01=2 2

n;1 (

0pw)]

p p [fc;1 ( w) + fc;1 (0 w)]:

=

(19)

Replacing fn;1 (1) and fc;1 (1) with the zero-mean Gaussian pdf expressions yields fW (w) = (1

0 1 )  p2w exp 0 1

n;1

+1

c;1

Similar reduction can be applied to F W (w), yielding F W (w)

w 2 2

n;1

p 1

2w

:

2 

w

01=2

(1

0 1 )n;1 exp 0

c;1

w 2 2

+1 c;1 exp

n;1

!1

x

:

c;1

(21)

lim

F Z (x)

!1 F X (x)

=

1:

(22)

It can be similarly shown that limx!1 F Z (x)=F Y (x) = 1. Hence, the tail of cross Gaussian-mixture pdf is heavier than that of the pdf of the parent Gaussian mixture. Consider next limx!1 F W (x)=F Z (x). In F Z (x), note that c;1 c;2  n;1 n;2; n;1 c;2 ; c;1 n;2 . Thus, the component that determines the tail heaviness F Z (z) reduces to F Z (z)

01=2

 z2p 1 2 pc;1c;2 exp 0 c;1zc;2

:

(23)

F W (x) F Z (x)

=

=

Since 2

0 2w2

2) Tail Analysis of Second-Order Gaussian-Mixture Distributions: Tails of the distributions derived in Section II-B-1) are now analyzed and the order of tail heaviness determined. Consider first the TFs of a Gaussian-mixture distributed RV X and the RV Z = XY generated as the product of two independent Gaussian-mixture-distributed RVs. We evaluate 01 limx!1 F Z (x)=F X (x) to determine tail-heaviness order. Since, x 2 0 1=2 and exp (0x ) decay faster than x and exp (0x), it is clear that

x

c;1 exp

0 2w2

:

c;1

(20)

Similarly to the F Z (x) case, the integration by parts and asymptotic series expansion for the erf(1) applied to F W (w) gives



01=2

 1 w 2

(24)

The limit in consideration is now formulated as

0 2w2

exp

lim

F W (w)

=

Fig. 2. Second-order distributions of Gaussian-mixture distribution: TFs of (solid line), (dashed line), and (dotted line). The 2 yielding ( ) ( ) and 2 yielding ( ) ( ) cases are shown.



0

2 c; x(c;2 2c;1 ) 1 lim exp 2  2 2 22 x!1 2c; 1 c;2 0; if c;2 > 2c;1 ; if c;2 < 2c;1 :  1 ; if c;2 = 2c;1  2

0

1

(25)

1, in > 1.

most practical cases, it can be noted that The tail of the square pdf is thus heavier than that of the cross pdf if c;2  2c;1 . Also, the tail of the cross pdf is heavier than that of square pdf if c;2 > 2c;1 . We can thus, conclude the tail-heaviness order as (2 )

01

=2

j 2

F X (x) > F XY (x) 

> F X (x)

(26)

which is similar to the hyperbolic–tailed distributed RV case. Example: Fig. 2 gives the exact TFs of the parent, cross, and square Gaussian-mixture RVs for 1 = 2 = 0:1, n;1 = n;2 = 1, c;1 = 5, and c;2 = 5, 15 cases. It can be noted that in the case c;2 = 5 < 2c;1 , the tail of square pdf is heavier than that of cross pdf, as expected. In the c;2 = 15 > 2c;1 case, however, the tail of the cross pdf is heavier than that of the square pdf, again in agreement with the derivations. In addition, the cross and square pdf’s have heavier tails than that of the parent mixture pdf in both cases. Hence, the cross-terms and square values of a Gaussian-mixture-distributed RV have heavy tails with the ordering in (26). Robust schemes must be developed to appropriately process the second-order distributions of heavy-tailed RVs with Gaussian-mixture distribution. III. SIMULATION RESULTS AND DISCUSSIONS This section considers the filtering problem under the statistics that are derived and studied in this paper. Consider first the filtering problem

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

Fig. 3. Filtering under cross and square –stable distribution case. (a) From top to bottom: Corrupted single–tone input signal, FIR, WM, and WMY filter outputs for cross statistics. (b) From top to bottom: Corrupted single–tone input signal, FIR, WM, and WMY filter outputs for square statistics.

under the higher orders of a hyperbolic–tailed distribution. The hyperbolic–tailed noise distribution considered here is –stable distribution [10], where the TF is given by F X (x)  0( )sin( =2)=x0 [4]. A single-tone signal is taken as the input signal with normalized frequency of 0.01 Hz. The case where the input signal is corrupted by additive cross –stable noise with X = Y = 1:4, where X and Y denote the tail index of the parent -stable distributions, is shown in Fig. 3(a). In addition, the corrupted signal is passed through FIR, WM [6], and WMY [4] filters with window sizes N = 11 for denoising purposes. The FIR filter output is based on the weighted sum combination of observation samples, while the WM output is founded on sample selection based on rank order. Rank-order-based selection of the output sample is more robust than output methodologies based on weighted sums. The WMY filter, with roots in the maximum-likelihood estimate of the -stable ( = 1) models minimizing the sum of logaritmic square deviations, are shown to be robust to very impulsive noise characteristics [4]. (More detail on the implementation and characteristics of the filters can be found in [2], [4], and [6]). The FIR filter weights are designed through MATLAB’s fir1 command. The same filter weights are also used for WM and WMY filter formulations. The FIR, WM, and WMY filter outputs are given in Fig. 3(a). The FIR filter output is severely affected by the noise components as the impulsive samples are clearly present in the FIR filter output. In the WM and WMY filter cases, the noise does not significantly deteriorate the responses, especially in the WMY case. Next, consider the filtering problem in the squared -stable noise case. The single–tone input signal corrupted with square -stable noise is given in Fig. 3(b), along with the FIR, WM, and WMY filter outputs. Once again, the FIR filter output is severely damaged, while the WM and WMY filter outputs are robust to the square noise characteristics. Note that the output signals, even for WM and WMY cases, are not as clean as in the cross case, which is due to the heavier nature of the square pdf tail. The filtering problem under the higher order distributions of contaminated Gaussian case is considered next. The parameters utilized in the simulations are 1 = 2 = 0:1, n;1 = n;2 = 1, and c;1 = c;2 = 10. The same single-tone input signal is considered. The input signal corrupted by cross-contaminated Gaussian distribution noise is shown in Fig. 4(a), along with the FIR, WM, and WMY filter outputs. The FIR filter output is again severely affected by the noise. As in the hyperbolic-tailed noise case, the WMY outperforms the WM, and the WM outperforms the FIR filter. The experiment is also performed under the square-contaminated Gaussian case, and the results are given in Fig. 4(b). As in the hyperbolic case, all the filter performances are slightly worse than the cross-contaminated Gaussian case, but as expected, the WMY filter is more robust to noise than the WM filter, and the WM filter is more robust than the FIR filter. The time-domain plot observations are consistent with the quantitative L1 norm error measurements, which are tabulated in Tables I and

2831

Fig. 4. Filtering under cross and square -contaminated distribution case. (a) From top to bottom: Corrupted single–tone input signal, FIR, WM, and WMY filter outputs for cross statistics. (b) From top to bottom: Corrupted single–tone input signal, FIR, WM, and WMY filter outputs for square statistics.

II for -stable (where is the scale parameter) and -contaminated Gaussian-mixture distribution cases. The tables also include results obtained under varying statistics. A large number of important problems in signal processing require the design of bandpass and high–pass filters. Channel equalization, beamformers, and predictors are example applications where flexible frequency–selective processing is needed. Fig. 5(a) depicts a linearly swept–cosine signal spanning instantaneous frequencies ranging from 0 to 400 Hz corrupted by additive cross -stable noise, along with the FIR, WM, and WMY filter outputs. Fig. 5(a) is truncated so that the same scale is used in all plots. The FIR filter is designed by MATLAB’s fir1 function with passband 0:075  w  0:125 (normalized frequency with Nyquist = 1). The same filter weights are also used for the WM and WMY cases. The FIR filter output is severely affected by the noise components. Ringing artifacts emerge with each impulse fed into the filter. In the WM and WMY filter cases, the noise does not significantly deteriorate the responses, especially in the WMY case. Next, consider the frequency-selective filtering problem in the squared -stable noise case. The input chirp signal corrupted with square -stable noise is given in Fig. 5(b), along with the FIR, WM, and WMY filter outputs. Once again, the FIR filter output is severely damaged, while the WM and WMY filter outputs are robust to the square noise characteristics. Note that the output signals, even for WM and WMY cases, are not as clean as in the cross case, which is due to the heavier nature of the square pdf tail. The frequency-selective filtering problem under the higher order distributions of the contaminated Gaussian case is considered next. The parameters utilized in the simulations are 1 = 2 = 0:1, n;1 = n;2 = 1, and c;1 = c;2 = 10. The same input chirp signal is considered. The input signal corrupted by cross-contaminated Gaussian distribution noise is shown in Fig. 6(a), along with the FIR, WM, and WMY filter outputs. The FIR filter output is again severely affected by the noise. As in the hyperbolic-tailed noise case, the WMY outperforms the WM, and the WM outperforms the FIR filter. The experiment is also performed under square contaminated Gaussian case, and the results are given in Fig. 6(b). All the filter performances are slightly worse than the cross-contaminated Gaussian case, but as expected, the WMY filter is more robust to noise than the WM filter, and the WM filter is more robust than the FIR filter. Note that linear FIR, WM, and WMY filters are statistically related to the maximum- likelihood estimate under Gaussian, Laplacian, and Cauchy statistics, respectively [3], [4], [6], [8]. In addition, the Cauchy pdf exhibits tails heavier than that of the Laplacian pdf, and the Laplacian pdf exhibits tails heavier than that of Gaussian pdf. The WMY filter is thus more robust to impulsive noise characteristics than the WM filter is, and the WM filter is more robust to impulsive noise characteristics than the FIR filter is. The analysis, results, and simulations

2832

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 7, JULY 2006

TABLE I ENSEMBLE AVERAGE OF 100 MAES OF THE OUTPUT OF THE FIR, WM, AND WMY FILTERS 0 2 AND VARYING) AND SQUARE -STABLE NOISE IN PRESENCE OF CROSS (

=

TABLE II ENSEMBLE AVERAGE OF 100 MAES OF THE OUTPUT OF THE FIR, WM, AND WMY FILTERS IN PRESENCE OF CROSS ( = = 1, = 10, AND VARYING) AND SQUARE -CONTAMINATED GAUSSIAN-MIXTURE NOISE

Fig. 5. Frequency-selective filtering under cross and square –stable distribution case. (a) From top to bottom: Corrupted chirp input signal, FIR, WM, and WMY filter outputs for cross statistics. (b) From top to bottom: Corrupted chirp input signal, FIR, WM, and WMY filter outputs for square statistics.

Fig. 6. Frequency-selective filtering under cross and square Gaussian-mixture distribution case. (a) From top to bottom: Corrupted chirp input signal, FIR, WM, and WMY filter outputs for cross statistics. (b) From top to bottom: Corrupted chirp input signal, FIR, WM, and WMY filter outputs for square statistics.

presented here demonstrate the need for robust combinations and filtering methods for the processing of second-order statistics. In addition to the tangible examples presented here, such processing is critical for the common additive noise case in polynomial and multiplicative environments. IV. CONCLUSION Power law and Gaussian-mixture heavy-tailed models that are of particular interest in networking and signal processing applications are considered in this paper. The second-order distribution and density functions of the focused models are derived along with the tail functions of the obtained statistics. The tail-heaviness order of the obtained statistics is determined for both models. The heaviness of the tails indicate that robust methods of sample combination and output determination should be utilized to avoid undue influence of outliers and degradation in performance. The denoising and frequency-selective filtering problems under the computed statistics are considered and evaluated with simulations for different filter types. It has been shown that the WMY filter outperforms the WM filter, and the WM filter outperforms

the FIR filter under corruption by the cross and square statistics of the considered models. REFERENCES [1] V. Paxson and S. Floyd, “Wide area traffic: The failure of Poisson modeling,” IEEE/ACM Trans. Netw., vol. 3, no. 3, pp. 226–244, Jun. 1995. [2] K. E. Barner and G. R. Arce, Nonlinear Signal and Image Processing: Theory, Methods and Applications. Boca Raton, FL: CRC Press, 2004. [3] I. Pitas and A. Venetsanopoulos, Nonlinear Digital Filters: Principles and Application. Norwell, MA: Kluwer, 1990. [4] G. R. Arce, Nonlinear Signal Processing: A Statistical Approach. New York: Wiley, 2005. [5] M. Crovella and A. Bestavros, “Self-similarity in world wide web traffic: Evidence and possible causes,” IEEE/ACM Trans. Netw., pp. 835–846, Dec. 1997. [6] G. Arce, “A general weighted median filter structure admitting negative weights,” IEEE Trans. Signal Process., vol. 46, no. 12, pp. 3195–3205, Dec. 1998. [7] J. Chen, “From the central limit theorem to heavy-tailed distributions,” J. Appl. Probab., vol. 40, pp. 803–806, Feb. 2003. [8] K. E. Barner and T. C. Aysal, “Polynomial weighted median filtering,” IEEE Trans. Signal Process., vol. 54, no. 2, pp. 636–650, Feb. 2006. [9] A. Erdelyi, W. Magnus, F. Oberhettinger, and F. Tricomi, Higher Transcendental Functions. New York: McGraw-Hill, 1953, vol. I, II and II, The Bateman Manuscript Project. [10] C. Nikias and M. Shao, Signal Processing with Alpha–Stable Distributions and Applications. New York: Wiley, 1995.

Second-Order Heavy–Tailed Distributions and Tail ...

Information Sciences Systems, Princeton, NJ, Mar. 1971, pp. 267–275. ... continuous-time stochastic systems with nonlinear observation mecha- nism,” Signal ...

292KB Sizes 0 Downloads 34 Views

Recommend Documents

A New Method for Measuring Tail Eponents of Firm Size Distributions
Dec 7, 2011 - Informatics and Business, 10 Sue-machi Kanazawa, Ishikawa, Japan; ... Economics: The Open-Access, Open-Assessment E-Journal,. Vol.

A New Method for Measuring Tail Eponents of Firm Size Distributions
Dec 7, 2011 - New Approaches in Quantitative Modeling of Financial Markets ...... Levy Distribution to Stock Market Index Data: Selecting a Suitable Cutoff.

Health Savings Account Balances, Contributions, Distributions, and ...
Nov 29, 2016 - Distributions, and Other Vital Statistics, 2015: Estimates ... The Employee Benefit Research Institute (EBRI) maintains data on ...... An Analysis of Health Savings Account Balances, Contributions, and Withdrawals in 2012.

Health Savings Account Balances, Contributions, Distributions, and ...
Nov 29, 2016 - and Affordable Care Act of 2010 (ACA) requires be covered in full.) Otherwise, all health care services must be subject to the HSA's deductible.

Health Savings Account Balances, Contributions, Distributions, and ...
Sep 19, 2017 - This Issue Brief is the fourth annual report drawing on cross-sectional data from the EBRI ... ebri.org Issue Brief • Sept. .... ERISA Compliance .

Health Savings Account Balances, Contributions, Distributions, and ...
3 days ago - Distributions, and Other Vital Statistics, 2016: Statistics ... Institute (EBRI) developed the EBRI HSA Database to analyze the state of and ... Health Education and Research Program at the Employee Benefit Research Institute.

(>
(An e-book reader could be a software application for use on the pc, for ... Reader application, or possibly a book-sized laptop that's employed entirely as a.

Tail-On
zincporphyrin-fullerene dyads exhibit seven one-electron reversible redox reactions within the accessible potential window of the solvent and the measured ...

Tail-On
efficiency changes to some extent in comparison with the results obtained for the “tail-on” form, suggesting the presence of some ... visible spectrum,11 and small reorganization energy in electron- .... ring of fulleropyrrolidine to the metal ce

Reference Distributions and Inequality Measurement
(1) then yn would automatically be set to sup (X) where X is the support of F∗. 2 .... However, two major drawbacks make this index useless in practice: 1.

disappearing tail lights.pdf
Page 1 of 1. disappearing tail lights.pdf. disappearing tail lights.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying disappearing tail lights.pdf.

dragon tail uranium.pdf
dragon tail uranium.pdf. dragon tail uranium.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying dragon tail uranium.pdf.

Cross-Sectional Distributions and Power Law with ...
Nov 24, 2014 - Therefore log wealth follows the Brownian motion with drift g− 1. 2 v2 and volatil- ..... Toda and Walsh (2014) documents that cross- sectional ...

pdf-1853\lognormal-distributions-theory-and-applications-statistics-a ...
... of the apps below to open or edit this item. pdf-1853\lognormal-distributions-theory-and-applications-statistics-a-series-of-textbooks-and-monographs.pdf.

Skewed Wealth Distributions: Theory and Empirics - Department of ...
Indeed, the top end of the wealth distribution obeys a power law ..... In equilibrium y(h) is homogeneous of degree m/(1−a) > 1 in h: small differences in skills ..... Returns on private equity have an even higher idiosyncratic dispersion ..... 188

Tail Probabilities of Gaussian Suprema and Laplace ...
where d and σ are some important numeric parameters of the field x and Φ is the distribution function of the standard normal law. We apply this relation to the investigation of the Gaussian measure of large balls in the space lp in order to general

Bose-Einstein and Fermi-Dirac distributions in ...
Jul 21, 2009 - ence, biology, and economics 7 . One of the .... ba a ua−1e−bu. 14. The upper lower sign in Eqs. 9 , 10 , and 13 denotes boson fermion case ...

Tail and Volatility Indices from Option Prices
the quadratic variation of a jump-free process (“integrated variance”).2 ... of correctly accounting for jumps when estimating stock return variability. ..... shows that the annualized D(T) is O(T) for small T for stochastic volatility models. ..

Tail risk premia and return predictability - Duke Economics
SCOR/IDEI conference on Extreme Events and Uncertainty in Insurance and Finance in Paris, France, the 2015 Midwest Finance Association meetings,. Chicago and ..... Empirically, however, with discretely sampled prices and options data, it is impossibl

Multiple Tail Median and Bootstrap Technique for ...
pareto distribution (GPD) arises as the limiting distribution. The concept of .... standard normal cumulative distribution function applied to the CDF (c) Logarithmic.

Derandomizing Algorithms on Product Distributions and ...
Our main theorem for d-part product distributions is as follows. Theorem 1.5. Fix any positive integers d, n and k and let C be the class of d-part product distributions on ({0,1}n)k. Let f : {0,1}n → {0,1} be any function. Let AR and AD be algorit

NEAR MDS POSET CODES AND DISTRIBUTIONS 1 ...
q is called a Maximum Distance Separable (MDS) code if the Hamming distance .... We write (t, n, r, q) OOA for an orthogonal array of strength t in Fr,n q .