Review Article Volume 7 Issue 3
Linear inference under alpha–stable errors
Donald R Jensen
Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
Virginia Polytechnic Institute and State University, USA
Correspondence: Donald R Jensen, Professor Emeritus, Virginia Polytechnic Institute and State University, Blacksburg VA, 24061, USA, Tel 540 639 0865
Received: May 02, 2018 | Published: May 23, 2018
Citation: Jensen DR. Linear inference under alpha–stable errors. Biom Biostat Int J. 2018;7(3):205–210. DOI: 10.15406/bbij.2018.07.00210
Download PDF
Abstract
Linear inference remains pivotal in statistical practice, despite errors often having excessive tails and thus deficient of moments required in conventional usage. Such errors are modeled here via spherical –stable measures on with stability index arising in turn through multivariate central limit theory devoid of the second moments required for Gaussian limits. This study revisits linear inference under –stable errors, focusing on aspects to be salvaged from the classical theory even without moments. Critical entities include Ordinary Least Squares solutions, residuals, and conventional ratios in inference. Closure properties are seen in that OLS solutions and residual vectors under –stable errors also have –stable distributions, whereas F ratios remain exact in level and power as for Gaussian errors. Although correlations are undefined for want of second moments, corresponding scale parameters are seen to gauge degrees of association under –stable symmetry.
AMS subject classification: 62E15, 62H15, 62J20
Keywords:excessive errors, central limit theory, stable laws, linear inference
Introduction
Models here are with error vector Classical linear inference rests heavily on means, variances, correlations, skewness and kurtosis parameters, these requiring moments to fourth order. To the contrary, distributions having excessive tails, and devoid of moments even of first or second order, arise in a variety of circumstances. These encompass acoustics, image processing, radar tracking, biometrics, portfolio analysis and risk management in finance, and other venues in contemporary practice. Supporting references include1–6 monographs of note are,7–9 together with the recent work of Nolan.10 In these settings the classical foundations necessarily must be reworked.
To place this study in perspective, alternatives to Gaussian laws long have been sought in theory and practice, culminating in the class consisting of elliptically contoured distributions in centered at 0 with scale parameters
These typically are taken to be rich in moments, and to provide alternatives to the use of large-sample approximate Gaussian distributions under conditions for central limit theory. Comprehensive treatises on the theory and applications of these models are.11–13
In contrast, errors having excessive tails are modeled on occasion via spherically symmetric stable distributions in with index These comprise the limit distributions of standardized vector sums, specifically, Gaussian limits at Cauchy limits at and corresponding stable limits otherwise. These distributions are contained in the class thus sharing its essential geometric features, but instead are deficient in moments usually ascribed to Despite the venues cited, stable errors have seen limited usage for want of closed expressions for stable density functions, known only in selected cases but topics of continuing research. Nonetheless, findings reported here rest on well defined characteristic functions (), on critical representations for these, and on the inversion of the latter in order to represent the -stable densities themselves. Even here a divide emerges between independent, identically distributed -stable sequences, and dependent variables, as reported in Jensen14 and as summarized here for completeness in an Appendix. In addition, many findings of the present study are genuinely nonparametric, in applying for all or portions of distributions in the range and thus remaining distribution-free within that class. An outline follows.
Notation and technical foundations are provided in the next major section, Preliminaries, to include Notation and accounts of Special Distributions, Central Limit Theory and Essentials of Distributions as subsections. The principal sections following these address Linear Models under Errors, with a separate subsection on Models Having Cauchy Errors, and Conclusions. Collateral topics are contained for completeness in Appendix A.
Preliminaries
Notation
Spaces of note include as Euclidean space, with as the real symmetric matrices and as their positive definite varieties. Vectors and matrices are set in bold type; the transpose, inverse, trace, and determinant of are and the unit vector in is ; and is the identity.
Moreover, is a block-diagonal array, and is the spectral square root of
Special distributions
Given its distribution, expected value, and dispersion matrix are designated as and with variance on Specifically, is Gaussian on with parameters Distributions on of note include the and related distributions, together with the Snedecor -Fisher these having as degrees of freedom and a noncentrality parameter. The characteristic function for is the expectation with argument and a standard source is Lukacs & Laha.15 Attention is drawn subsequently to probability density (pdf) and cumulative distribution (cdf) functions. Moreover, the class consists of elliptically contoured distributions in centered at 0 and having chf’s of type We adopt the following.
Definition 1 A distribution P on is said to be monotone unimodal about if for every and every convex set C symmetric about is no increasing in See reference.16
Central limit theory
For vectors in let and consider limit distributions of type for suitably chosen C. On specializing from the elliptical class having location-scale parameters we consider stable limit distributions as follow on identifying with
Definition 2 Let designate an elliptical -Stable law on centered at with scale parameters and stable index having the Each marginal distribution of on namely has the chf Let designate the class of all such distributions.
Remark 1 is of full rank and has a density in if and only if is of full rank in otherwise is concentrated in a subspace of of dimension equal to the rank of
To continue, designate by the domain of attraction of each element in in having in That is, their chfs satisfy when scaled suitably. Specifically, the distributions attracted to Gaussian limits comprise all distributions in having second moments. More generally, domains of attraction to distributions in have been studied in references,17–20 to include Lindeberg conditions in Barbosa & Dorea,21 together with rates of convergence to stable limits in Paulauskas.22
Remark 2 That has elliptical contours derives from the spherical chf through the transformation a
Essentials for distributions
As noted, closed expressions for densities are known in selected cases only, to be complemented by results to follow. Here is the Gaussian density on having parameters and is the provisional density corresponding to The following properties are essential.
Theorem 1 Let have the chf and density function if defined. Then the following properties hold.
- For nonsingular, is absolutely continuous in having a density function
- The Gaussian mixture holds with as a mixing on
- The Gaussian mixture holds with as a mixing cdf as before;
- is monotone unimodal with mode at for each
- Let be scale-invariant; then for the distribution is identical to its normal-theory form under
Proof: Conclusion (i) is Theorem 6.5.4 of Press.23 Conclusion (ii) invokes a result of Hartman et al.24 namely, the process is spherically invariant if and only if, for each n andthe chf is a scale mixture of spherical Gaussian s on to give conclusion (ii) on transforming from spherical to elliptical symmetry. To continue, is the standard inversion formula from s to densities in with as Lebesgue measure, so that from conclusion (ii) we recover
(1)
Reversing the order of integration inverts the Gaussian chf to give conclusion (iii). Conclusion (iv) follows as in Wolfe25 conjunction with conclusion (iii). Finally observe from conclusion (iii), with that the change of variables behind the integral is independent of since is scale-invariant independently of s to give conclusion (v).
It remains to reconsider degrees of association in distributions, as distinct from the classical second-moment correlation parameters For with the elements of s serve instead as scale parameters, since and are dimensionless. As to whether again might quantify associations for a definitive answer is supplied in the following.
Lemma 1 Let For , the parameters serve to quantify degrees of association between the extent of their association increasing with
Proof: It suffices to consider centered at with On taking clearly is symmetric about 0 with scale parameter A result of Fefferman et al.26 shows for each that is decreasing in thus increasing in Equivalently, as identifying the sense in which become increasingly indistinguishable, thus associated, with increasing values of
Definition 3 For with the entities are called pseudo–correlation, specifically, -association parameters
Linear models under errors
The principal findings
Take with as centering and scale parameters, where solutions , as minimally dispersed unbiased linear estimates, are available here only for whereas alternative moment criteria necessarily are subject to moment constraints. Specifically, for scalars under loss the risk is undefined for as for Cauchy errors at Moreover, risk functions are defined but concave for and for are convex, at issue in attaining global optima. Versions of these apply also for vector parameters; however, minimal risk estimation would require not only knowledge regarding but also optimizing algorithms. Instead we seek what might be salvaged from classical linear models under the constraints of errors. In addition, portions of our findings extend beyond Gauss–Markov theory and to include the much larger class of equivariant estimators.
Definition 4 An estimator for is translation –equivariant if for then for every
On taking the elements of comprise the observed residuals and the residual mean square. Normal–theory tests for against utilize having the distribution with We proceed to examine essential properties of as ranges over where some expressions simplify on taking then reinstating as needed. The following properties are fundamental.
Theorem 2 Given consider with as the residual vector, and Then
(i) with a distribution on of rank s
(ii) The marginal’s are centered at with scale parameters and
(iii) on of rank centered at 0with scale parameters
(iv) has density with as the central chi–squared density on degrees of freedom, scaled by S, and with as a mixing distribution.
Proof. Let and to project onto the error space, so that operates on y to give
(2)
the latter of order and rank The chf with argument is = with argument replacing t, to give conclusion (i). Next partition with to obtain = = The marginal s of and e follow on setting then in succession, to give conclusions (ii) and (iii). Conclusion (iv) attributes to Hartman et al.24 through Theorem 1. Specifically, a change of variables behind the integral on the right of Theorem 1(iii) gives the conditional density for namely the scaled chi–squared density depending on s so that integrating with respect to gives conclusion (iv).
Remark 3 That is block–diagonal in conclusion (i), assures under errors that are –unassociated as in Definition 3, well known to be mutually uncorrelated under second moments.
It remains to reexamine topics in inference under errors. The following are germane.
Definition 5 An estimator for is said to be linearly median unbiased if and only if the median for each and to be modal unbiased provided that the mode
Definition 6 An estimator for is said to be more concentrated about than provided that for every convex set in symmetric under reflection about
Essential properties under errors include the following.
Theorem 3 For consider properties of the solutions and of the equivariant estimators of Definition 4.
- is unbiased for for each
- is linearly median unbiased for
- is most concentrated about among all median–unbiased linear estimators;
- is modal unbiased for
- is consistent for in a sequence of identical but dependent experiments
- The null distribution of has exactly its normal–theory form; the power increases with increasing and such tests are unbiased;
- is most concentrated about among all modal–unbiased linear estimators.
is most concentrated about among all equivariant estimators
Proof. Conclusions (i)–(vi) carry over from reference Jensen DR27 without benefit of moments, regardless of membership in the class. To consider concentration properties of modal–unbiased estimators, begin with and consider with so that
That should have mode at it is necessary that i.e. accordingly, with Clearly the matrix is positive semi definite, giving conclusion (vii) from Jensen.28 Conclusion (viii) follows from Theorem 2.7 of Burk et al.29 since distributions are unimodal from Theorem 1(iv).
Spherical cauchy errors
Spherical multivariate t errors on v degrees of freedom trace to Zellner30 to include Cauchy errors at equivalently, at in the class Specializing from Theorem 1(ii), the spherical Cauchy chf is Recast in terms of linear inference, we have the following specialization of Theorems 1 and 2.
Corollary 1 Under the conditions of Theorems and 2, the following properties hold under spherical Cauchy errors.
- The spherical Cauchy density on at is
where the mixing density.
- The elliptical Cauchy density for on is
(3)
Proof. The multivariate distribution on is that of from with as a sample standard deviation on degrees of freedom, known to be spherical Cauchy at This gives conclusion (i) on specializing the conventional multivariate density. Conclusion (ii) follows directly on specializing Theorem 2(ii) at
Case study
The viability for each of biological specimens was recorded after storage under additives and as listed in Table 1;31 Walpole RE & Myers RH.31 The model is where the errors are taken to be spherical Cauchy. The conventional solutions are as elements of The matrix its inverse and the transition of the latter into its -association form of Definition 3 are given respectively by
The following properties are evident.
- The elliptical Cauchy density for is given by equation (3) with and as listed for these data.
- The solution is both linear median–unbiased and modal–unbiased, and among all such estimators is most concentrated about
- The normal–theory confidence set holds exactly with confidence coefficient where is the residual mean square on degrees of freedom, and is the upper 0.95 percentile for
- As correlations are undefined, elements of the –association matrix nonetheless do serve to quantify the degrees of association among as in Definition 3, on taking in Lemma 1.
- In particular, is negatively associated with whereas are themselves positively associated (Table 1).
|
|
|
|
|
|
0.5 |
1.74 |
3.3 |
31.2 |
6.32 |
5.42 |
0.9 |
6.22 |
8.41 |
38.4 |
10.52 |
4.63 |
0.4 |
1.19 |
11.6 |
26.7 |
1.22 |
5.85 |
0.4 |
4.1 |
6.62 |
25.9 |
6.32 |
8.72 |
0 |
4.08 |
4.42 |
25.2 |
4.15 |
7.6 |
0.7 |
10.15 |
4.83 |
35.7 |
1.72 |
3.12 |
0.5 |
1.7 |
5.3 |
|
|
|
Table 1 The viability of biological specimens after storage under additives and
Summary and discussion
This study offers further insight into the class comprising the spherical –stable laws as limit distributions under conditions for central limit theory. In addition to their essential properties, expanded here to include representations for density functions, this study focuses on models of type when devoid of moments undergirding the classical theory. Recall that normal–theory procedures routinely are applied in practice as large–sample approximations in distributions attracted to Gaussian laws. Specifically, Berry–Esséen bounds on rates of convergence to Gaussian limits are given Jensen,32,33 with special reference to linear models in Jensen.34,35 Results here validate corresponding large–sample approximations for distributions attracted to laws as cited in references.17–21 Of similar importance are rates of convergence to stable limits as in Paulauskas.22 By showing that many standard properties carry over in essence under significantly weakened assumptions, this study gives further credence to the widely and correctly held view that Gauss–Markov estimation and normal theory inferences extend considerably beyond the confines of the classical theory.
A appendix
The preceding study has developed exclusively around spherically dependent errors, as alternative to stable errors. This choice is prompted by discrepancies encountered in the simplest case with common location parameter. Essential details from Jensen14 may be summarized as follows. To distinguish the disparate properties of vs spherical models, sequences are fundamental in order to take limits. Of significance is that averages of sequences with may be inconsistent for sequences but consistent under symmetry. Accordingly, let Essentials follow.
Lemma 2 Given consider the case that either are with chf or are on with chf Let and and consider the standardized variables
- Consistent and inconsistent properties of for sequences are as follow.
For for so that is inconsistent for
For so that is inconsistent for
For for so that is consistent for
- For sequences is consistent for for every
- For iid sequences with diverges to an improper distribution.
- For sequences the limit being identical to each component.
Acknowledgement
Conflict of interest
Authors declare that there is no conflict of interest.
References
- Cheng BN, Rachev ST. Multivariate stable futures prices. Math Finance. 1995;5:133–153.
- Kuruoglu E, Zerubia J. Modeling SAR Images with a generalization of the Rayleigh distribution. IEEE Trans Image Processing. 2009;13(4):527–533.
- Qiou Z, Ravishanker N, Dey DK. Multivariate survival analysis with positive stable frailties. Biometric. 1999;55(2):637–644.
- Bonato M. Modeling fat tails in stock returns: A multivariate stable–GARCH approach. Computational Statist. 2012;27(3):499–521.
- Kim YS, Glacometti R, Rachev ST, et al. Measuring financial risk and portfolio optimization with a non–Gaussian multivariate model. Ann Operations Research. 2012;201(1):325–343.
- Tsionas EG. Estimating multivariate heavy tails and principal directions easily, with an application to international exchange rates. Statist Probability Letters. 2012;82(11):1986–1989.
- Samorodnitsky G, Taqqu M. Stable Non–Gaussian Random Processes. USA: Chapman–Hall; 1994.
- Arce GR. Nonlinear Signal Processing. New York: Wiley; 2000.
- Chernobai A, Rachev ST, Fabozzi FJ. Operational Risk: A Guide to Basel II Capital Requirements, Models and Analysis. USA: Wiley; 2007.
- Nolan JP. Multivariate elliptically contoured stable distributions: Theory and estimation. Computational Statist. 2013;28(5):2067–2089.
- Fang KT, Anderson TW. Statistical Inference in Elliptically Contoured and Related Distributions. New York: Allerton Press; 1990.
- Fang KT, Zhang YT. Generalized Multivariate Analysis. Beijing: Science Press; 1990.
- Gupta AK, Varga T. Elliptically Contoured Models in Statistics. Dordrecht: Kluwer Academic Publishers; 1993.
- Jensen DR. Limits and inferences for alpha–stable variables. Biostat Biometrics Open Acc J. 2017;4(1):1–4.
- Lukacs E, Laha RG. Applications of Characteristic Functions. New York: Hafner; 1964.
- Dharmadhikari S, Joag Dev K. Unimodality, Convexity, and Applications. New York: Academic; 1998.
- Rvaeva EL. On domains of attraction of multidimensional dlstributions. Transl Math Statist Prob. 1962;2:183–205.
- Kuelbs J, Mandrekar V. Domains of attraction of stable measures on a Hilbert space. Stucia Math. 1974;50:149–162.
- De Haan L, Resnick SI. Derivatives of regularly varying functions in Rd and domains of attraction of stable distributions. Stochastic Processes Applic. 1979;8(3):349–356.
- Ruschendorf L, Rueschendorf L, Rachev ST, et al. A new ideal metric with applications to multivariate stable limit theorems. Probability Theory and Related Fields. 1992;94(2):163–187.
- Barbosa EG, Dorea CCY. A note on the Lindeberg condition for convergence to stable laws in Mallows distance. Bernoulli. 2009;15(3):922–924.
- Paulauskas V. On the rate of convergence to bivariate stable laws. Lithuanian Math J. 2009;49(426):426–445.
- Press SJ. Applied Multivariate Analysis. USA: Holt, Rinehart, Winston; 1972.
- Hartman P, Wintner A. On the spherical approach to the normal distribution law. Amer J Math.1940;62(1):759–779.
- Wolfe SJ. On the unimodality of spherically symmetric stable distribution functions. J Multvariate Anal. 1975;8(1):236–242.
- Fefferman C, Jodeit M, Perlman. A spherical surface measure inequality for convex sets. Proc Amer Math Soc. 1972;33(1):114–119.
- Jensen DR. Linear models without moments. Biometrika. 1979;66(3):611–617.
- Jensen DR. Ordering ellipsoidal measures: Scale and peakedness orderings. SIAM J Appl Math. 1984;44(6):1226–1231.
- Burk R, Hwang JT. Optimality of the least squares estimator. J Multivariate Analysis. 1989;30(2):245–254.
- Zellner A. Bayesian and non–Bayesian analysis of the regression model with multivariate Student–t error terms. J Amer Statist Assoc. 1976;71:400–405.
- Walpole RE, Myers RH. Probability and Statistics for Engineers and Scientists, 4th ed. USA: Macmillan; 1989.
- Jensen DR. Monotone bounds on the chi–squared approximation to the distribution of Pearson’s X statistics 12. Australian J Statist. 1973;15(2):65–70.
- Jensen DR. On approximating the distribution of Friedman’s 75–175–175–1 and related statistics. Metrika. 1977;24(1):75–85.
- Jensen DR, Mayer LS. Normal–theory approximations to tests for linear hypotheses. Ann Statist. 1975;3(2):429–444.
- Jensen DR, Mayer LS, Myers RH. Optimal designs and large–sample tests for linear hypotheses. Biometrika. 1975;62(1):71–78.
©2018 Jensen. This is an open access article distributed under the terms of the,
which
permits unrestricted use, distribution, and build upon your work non-commercially.