Linear inference under alpha–stable errors

doi:10.15406/bbij.2018.07.00210

Linear inference remains pivotal in statistical practice, despite errors often having excessive tails and thus deficient of moments required in conventional usage. Such errors are modeled here via spherical $α$ –stable measures on $ℝ^{n}$ with stability index $α \in (0, 2],$ arising in turn through multivariate central limit theory devoid of the second moments required for Gaussian limits. This study revisits linear inference under $α$ –stable errors, focusing on aspects to be salvaged from the classical theory even without moments. Critical entities include Ordinary Least Squares $(O L S)$ solutions, residuals, and conventional $F$ ratios in inference. Closure properties are seen in that OLS solutions and residual vectors under $α$ –stable errors also have $α$ –stable distributions, whereas F ratios remain exact in level and power as for Gaussian errors. Although correlations are undefined for want of second moments, corresponding scale parameters are seen to gauge degrees of association under $α$ –stable symmetry.

AMS subject classification: 62E15, 62H15, 62J20

Keywords:excessive errors, central limit theory, stable laws, linear inference

Models here are ${Y = X β + ε}$ with error vector $ε \in ℝ^{n} .$ Classical linear inference rests heavily on means, variances, correlations, skewness and kurtosis parameters, these requiring moments to fourth order. To the contrary, distributions having excessive tails, and devoid of moments even of first or second order, arise in a variety of circumstances. These encompass acoustics, image processing, radar tracking, biometrics, portfolio analysis and risk management in finance, and other venues in contemporary practice. Supporting references include^1–6monographs of note are,^7–9 together with the recent work of Nolan.¹⁰ In these settings the classical foundations necessarily must be reworked.

To place this study in perspective, alternatives to Gaussian laws long have been sought in theory and practice, culminating in the class ${S_{n} (0, Σ)}$ consisting of elliptically contoured distributions in $ℝ^{n}$ centered at 0 with scale parameters $Σ .$ These typically are taken to be rich in moments, and to provide alternatives to the use of large-sample approximate Gaussian distributions under conditions for central limit theory. Comprehensive treatises on the theory and applications of these models are.^11–13

In contrast, errors having excessive tails are modeled on occasion via spherically symmetric $α$ stable $(S α S)$ distributions in $ℝ^{n}$ with index $α \in (0, 2].$ These comprise the limit distributions of standardized vector sums, specifically, Gaussian limits at Cauchy limits at $α = 2,$ and corresponding stable limits otherwise. These distributions are contained in the class ${S_{n} (0, I_{n})},$ thus sharing its essential geometric features, but instead are deficient in moments usually ascribed to ${S_{n} (0, I_{n})}.$ Despite the venues cited, $α$ stable errors have seen limited usage for want of closed expressions for stable density functions, known only in selected cases but topics of continuing research. Nonetheless, findings reported here rest on well defined characteristic functions ( $chf s$ ), on critical representations for these, and on the inversion of the latter in order to represent the $α$ -stable densities themselves. Even here a divide emerges between independent, identically distributed $α$ -stable sequences, and dependent $S α S$ variables, as reported in Jensen¹⁴ and as summarized here for completeness in an Appendix. In addition, many findings of the present study are genuinely nonparametric, in applying for all or portions of distributions in the range $α \in (0, 2],$ and thus remaining distribution-free within that class. An outline follows.

Notation and technical foundations are provided in the next major section, Preliminaries, to include Notation and accounts of Special Distributions, Central Limit Theory and Essentials of $S α S$ Distributions as subsections. The principal sections following these address Linear Models under $S α S$ Errors, with a separate subsection on Models Having Cauchy Errors, and Conclusions. Collateral topics are contained for completeness in Appendix A.

Notation

Spaces of note include $ℝ^{n}$ as Euclidean $n$ space, with $S_{n}$ as the real symmetric $(n \times n)$ matrices and $S_{n}^{+}$ as their positive definite varieties. Vectors and matrices are set in bold type; the transpose, inverse, trace, and determinant of are $A',$ $A^{- 1},$ $t r (A),$ and $| A |;$ the unit vector in $ℝ^{n}$ is ; and $I_{n}$ is the $(n \times n)$ identity.

Moreover, $D i a g (A_{1}, \dots, A_{k})$ is a block-diagonal array, and $Σ^{\frac{1}{2}}$ is the spectral square root of

Special distributions

Given $Y = [Y_{1}, \dots, Y_{n}]^{'} \in ℝ^{n},$ its distribution, expected value, and dispersion matrix are designated as $L (Y),$ $E (Y) = μ,$ and $V (Y) = \sum,$ with variance $V a r (Y) = σ^{2}$ on $ℝ^{1} .$ Specifically, $L (Y) = N_{n} (m, \sum)$ is Gaussian on $ℝ^{n}$ with parameters $(μ, \sum).$ Distributions on $ℝ^{1}$ of note include the $χ^{2} (u; ν, λ)$ and related $χ (u; ν, λ)$ distributions, together with the Snedecor -Fisher $F (u; ν_{1}, ν_{2}, λ),$ these having $(ν, ν_{1}, ν_{2})$ as degrees of freedom and $λ$ a noncentrality parameter. The characteristic function $(chf)$ for $Y \in ℝ^{n}$ is the expectation $ϕ_{Y} (t) = E [e^{ι t' Y}]$ with argument $t' = [t_{1}, \dots, t_{n}]$ and $ι = \sqrt{- 1};$ a standard source is Lukacs & Laha.¹⁵ Attention is drawn subsequently to probability density (pdf) and cumulative distribution (cdf) functions. Moreover, the class ${L (Z) \in S_{n} (0, \sum)}$ consists of elliptically contoured distributions in $ℝ^{n}$ centered at 0 and having chf’s of type $ϕ_{Z} (t) = ψ (t' S t).$ We adopt the following.

Definition 1 A distribution P on $ℝ^{n}$ is said to be monotone unimodal about $0 \in ℝ^{n}$ if for every $y \in ℝ^{n}$ and every convex set C symmetric about $0 \in ℝ^{n},$ $P [C + k y]$ is no increasing in $k \in [0, \infty).$ See reference.¹⁶

Central limit theory

For $i i d$ vectors ${Z_{1}, Z_{2}, Z_{3}, \dots}$ in $ℝ^{n},$ let ${\bar{Z}}_{ N} = N^{- 1} [Z_{1} + \dots + Z_{N}],$ and consider limit distributions of type ${L_{\infty} (c {\bar{Z}}_{ N}) = liminf L (c {\bar{Z}}_{ N})}$ for suitably chosen C. On specializing from the elliptical class $S_{n} (d, \sum)$ having location-scale parameters $(d, S),$ we consider $α$ stable limit distributions as follow on identifying $L_{\infty} (c {\bar{Z}}_{ N})$ with

Definition 2 Let $L (Z) \in S_{n}^{α} (d, \sum)$ designate an elliptical $α$ -Stable law on $ℝ^{n}$ centered at $d \in ℝ^{n}$ with scale parameters $\sum$ and stable index $α \in (0, 2],$ having the $chf$ $ϕ_{Z} (t) = \exp {ι t' d - \frac{1}{2} {(t' S t)}^{\frac{α}{2}}}.$ Each marginal distribution of $S_{n}^{α} (δ 1_{ n}, I_{n})$ on $ℝ^{1},$ namely $S_{1}^{α} (δ,1),$ has the chf $ϕ_{Z_{i}} (t) = \exp {ι t δ - \frac{1}{2} | t |^{α}}.$ Let $S α S = {S_{n}^{α} (d, S); (d, \sum) \in (ℝ^{n} \otimes \sum_{n})}$ designate the class of all such distributions.

Remark 1 $L (Z)$ is of full rank and has a density in $ℝ^{n}$ if and only if $\sum$ is of full rank in $S_{n}^{+};$ otherwise $L (Z)$ is concentrated in a subspace of $ℝ^{n}$ of dimension equal to the rank of

To continue, designate by $D_{ α}$ the domain of attraction of each element $Z_{i}$ in ${Z_{1}, Z_{2}, Z_{3}, \dots}$ in $ℝ^{n}$ having $liminf L (c {\bar{Z}}_{ N})$ in $S α S .$ That is, their chfs satisfy ${liminf ϕ_{c {\bar{Z}}_{ N}} (t) = \exp [ι t' d - \frac{1}{2} {(t' S t)}^{\frac{α}{2}}]}$ when scaled suitably. Specifically, the distributions $D_{2}$ attracted to Gaussian limits comprise all distributions $L (Z_{i})$ in $ℝ^{n}$ having second moments. More generally, domains of attraction to distributions in $S α S$ have been studied in references,^17–20 to include Lindeberg conditions in Barbosa & Dorea,²¹ together with rates of convergence to stable limits in Paulauskas.²²

Remark 2 That $Φ_{Z} (t) = e x p [ι t' d - \frac{1}{2} {(t' S t)}^{\frac{α}{2}}]$ has elliptical contours derives from the spherical chf $ϕ_{U} (t) = \exp [ι t' q - \frac{1}{2} {(t' t)}^{\frac{α}{2}}]$ through the transformation a

Essentials for $S α S$ distributions

As noted, closed expressions for $S α S$ densities are known in selected cases only, to be complemented by results to follow. Here $g_{n} (u; d, \sum)$ is the Gaussian density on $ℝ^{n}$ having parameters $(δ, \sum),$ and $f_{n}^{α} (μ; δ, \sum)$ is the provisional $S α S$ density corresponding to $ϕ_{Z} (t) = \exp [ι t' d - \frac{1}{2} {(t' S t)}^{\frac{α}{2}}].$ The following properties are essential.

Theorem 1 Let $L (Z) \in S_{n}^{α} (d, S)$ have the chf $ϕ_{Z} (t) = \exp [ι t' d - \frac{1}{2} {(t' S t)}^{\frac{α}{2}}]$ and density function $f_{n}^{α} (z; δ, \sum)$ if defined. Then the following properties hold.

For nonsingular, $L (Z) \in S_{n}^{α} (δ, \sum)$ is absolutely continuous in $ℝ^{n},$ having a density function $f_{n}^{α} (z; δ, \sum);$
The Gaussian mixture $ϕ_{Z} (t) = \int_{0}^{\infty} e^{ι t' d - t' S t /2 s} d Ψ (s; α)$ holds with $Ψ (s; α)$ as a mixing on $ℝ^{1};$
The Gaussian mixture $f_{n}^{α} (z; δ, \sum) = \int_{0}^{\infty} g_{n} (z; δ, s^{- 1} \sum) d Ψ (s; α)$ holds with $Ψ (s; α)$ as a mixing cdf as before;
$L (Z) \in S_{n}^{α} (δ, \sum)$ is monotone unimodal with mode at $d,$ for each $α \in (0, 2);$
Let $T (Z) = U \in ℝ^{k}$ be scale-invariant; then for $L (Z) \in S_{n}^{α} (δ, \sum),$ the distribution $L (U)$ is identical to its normal-theory form under $L (Z) = N_{n} (z; δ, \sum).$

Proof: Conclusion (i) is Theorem 6.5.4 of Press.²³ Conclusion (ii) invokes a result of Hartman et al.²⁴ namely, the process ${Z_{t}; t = 1,2, \dots}$ is spherically invariant if and only if, for each n and $Z = [Z_{1}, \dots, Z_{n}],$ the chf $ϕ_{Z} (t)$ is a scale mixture of spherical Gaussian s on $ℝ^{n},$ to give conclusion (ii) on transforming from spherical to elliptical symmetry. To continue, $f_{Z} (z) = {(2 π)}^{- n} \int_{ℝ^{n}} e^{- i t' z} ϕ_{Z} (t) Λ (d t)$ is the standard inversion formula from s to densities in $ℝ^{n}$ with $Λ (\cdot)$ as Lebesgue measure, so that from conclusion (ii) we recover

$f_{n}^{α} (μ; δ, I_{n}) = \frac{1}{{(2 π)}^{n}} \int_{ℝ^{k}} e^{- i t' x} \int_{0}^{\infty} e^{i t' δ - t' t /2 s} d Ψ (s; α) Λ (d t).$ (1)

Reversing the order of integration inverts the Gaussian chf to give conclusion (iii). Conclusion (iv) follows as in Wolfe²⁵ conjunction with conclusion (iii). Finally observe from conclusion (iii), with $\int_{0}^{\infty} g_{n} (Z; δ, \sum / s) d Ψ (s; α),$ that the change of variables $Z \to U = T (Z)$ behind the integral is independent of $Ψ (s; α)$ since $T (Z)$ is scale-invariant independently of s to give conclusion (v).

It remains to reconsider degrees of association in $S α S$ distributions, as distinct from the classical second-moment correlation parameters ${ρ_{i j} = σ_{i j} /(σ_{i i} σ_{j j})^{\frac{1}{2}}}.$ For $L (Z) \in S_{k}^{α} (δ, \sum)$ with $α < 2,$ the elements of s serve instead as scale parameters, since $U = \sum^{- \frac{1}{2}} Z$ and $U' U = Z' \sum^{- 1} Z$ are dimensionless. As to whether ${ρ_{i j}}$ again might quantify associations for $α < 2,$ a definitive answer is supplied in the following.

Lemma 1 Let $L (Z) \in S_{n}^{α} (δ, \sum).$ For , the parameters ${ρ_{i j} = σ_{i j} /(σ_{i i} σ_{j j})^{\frac{1}{2}}}$ serve to quantify degrees of association between $(Z_{i}, Z_{j}),$ the extent of their association increasing with

Proof: It suffices to consider $(Z_{1}, Z_{2})$ centered at $(0, 0)$ with $S = [\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}] .$ On taking $U = (Z_{1} - Z_{2}),$ $L (U)$ clearly is symmetric about 0 with scale parameter $σ_{U} = 2(1 - ρ).$ A result of Fefferman et al.²⁶ shows for each $c > 0$ that $P (U \in (- c, c))$ is decreasing in $σ_{U}$ thus increasing in $ρ .$ Equivalently, $P (| Z_{1} - Z_{2} | \leq c) ↑ 1$ as $ρ ↑ 1,$ identifying the sense in which $(Z_{1}, Z_{2})$ become increasingly indistinguishable, thus associated, with increasing values of

Definition 3 For $L (Z) \in S_{n}^{α} (δ, S)$ with $α < 2,$ the entities ${ρ_{i j} = σ_{i j} /(σ_{i i} σ_{j j})^{\frac{1}{2}}}$ are called pseudo–correlation, specifically, $α$ -association parameters

The principal findings

Take $L (Y) \in S_{n}^{α} (X β, σ^{2} I_{n})$ with $(X β, σ^{2} I_{n})$ as centering and scale parameters, where ${Y \in ℝ^{n}, X \in F_{ n \times k}, β \in ℝ^{k}}.$ $O L S$ solutions $\overset{⌢}{β} = {(X' X)}^{ - 1} X' Y$ , as minimally dispersed unbiased linear estimates, are available here only for $α = 2,$ whereas alternative moment criteria necessarily are subject to moment constraints. Specifically, for scalars $(\overset{⌢}{θ}, θ) \in ℝ^{1}$ under loss $L (\overset{⌢}{θ}, θ) = | \overset{⌢}{θ} - θ |,$ the risk $R (\overset{⌢}{θ}) = E [L (\overset{⌢}{θ}, θ)]$ is undefined for $α < 1$ as for Cauchy errors at $α = 1.$ Moreover, risk functions ${R (\overset{⌢}{θ}) = E (| \overset{⌢}{θ} - θ |^{κ})}$ are defined but concave for ${κ < α < 1},$ and for ${1 < κ < α \leq 2}$ are convex, at issue in attaining global optima. Versions of these apply also for vector parameters; however, minimal risk estimation would require not only knowledge regarding $α,$ but also optimizing algorithms. Instead we seek what might be salvaged from classical linear models under the constraints of $S α S$ errors. In addition, portions of our findings extend beyond Gauss–Markov theory and $O L S$ to include the much larger class of equivariant estimators.

Definition 4 An estimator $δ (Y)$ for $β \in ℝ^{k}$ is translation –equivariant if for ${Y \to Y + X b},$ then ${δ (Y + X b) = δ (Y) + b}$ for every $b \in ℝ^{k} .$

On taking $P = [I_{n} - X {(X' X)}^{ - 1} X'],$ the elements of $e = P Y$ comprise the observed residuals and $S^{2} = e' e /(n - k)$ the residual mean square. Normal–theory tests for $H_{0} : β = β_{0}$ against $H_{1} : β \neq β_{0}$ utilize $F = (\overset{⌢}{β} - β_{0})^{'} X' X (\overset{⌢}{β} - β_{0})/ S^{2}$ having the distribution $F (u; k, n - k, λ)$ with $λ = (β - β_{0})^{'} X' X (β - β_{0})/ σ^{2} .$ We proceed to examine essential properties of $S_{n}^{α} (X β, σ^{2} I_{n})$ as $α$ ranges over $(0, 2),$ where some expressions simplify on taking $σ^{2} = 1,$ then reinstating $σ^{2}$ as needed. The following properties are fundamental.

Theorem 2 Given $L (Y) = S_{n}^{α} (X β, σ^{2} I_{n}),$ consider $[\overset{⌢}{β}, e]$ with $e = P Y$ as the residual vector, and $U = (n - k) S^{2} / σ^{2} .$ Then

(i) $L (\overset{⌢}{β}, e) = S_{n + k}^{α} ([β, 0], S),$ with $\sum = σ^{2} D i a g ((X' X)^{ - 1}, P),$ a distribution on $ℝ^{n + k}$ of rank s

(ii) The marginal’s are $L (\overset{⌢}{β}) = S_{k}^{α} (β, σ^{2} {(X' X)}^{ - 1})$ centered at $β$ with scale parameters $σ^{2} {(X' X)}^{ - 1},$ and

(iii) $L (e) = S_{n}^{α} (0, σ^{2} P)$ on $ℝ^{n}$ of rank $n - k$ centered at 0with scale parameters $σ^{2} P;$

(iv) $U = (n - k) S^{2} / σ^{2}$ has density $f (u; ν, α) = \int_{0}^{\infty} h (u; ν, s) d Ψ (s; α)$ with $h (u; ν, s)$ as the central chi–squared density on $ν = (n - k)$ degrees of freedom, scaled by S, and with $Ψ (s; α)$ as a mixing distribution.

Proof. Let $L' = {(X' X)}^{ - 1} X'$ and $P = [I_{n} - X {(X' X)}^{ - 1} X']$ to project onto the error space, so that $G = [L, P]$ operates on y to give

$Z = G' Y = [\begin{matrix} \overset{⌢}{β} \\ e \end{matrix}] = [\begin{matrix} L' \\ P' \end{matrix}] Y \in ℝ^{n + k} a n d G' G = [\begin{matrix} {(X' X)}^{ - 1} & 0 \\ 0 & P \end{matrix}],$ (2)

the latter of order $[(n + k) \times (n + k)]$ and rank $n .$ The chf with argument $s' = [s_{1}, \dots, s_{n + k}]$ is $E [\exp (ι s' Z)] = E [\exp (ι s' G' Y)] = E [\exp (ι v' Y)]$ = $ϕ_{Y} (v)$ with argument $v = G s$ replacing t, to give conclusion (i). Next partition $s' = [s_{1}', s_{2}']$ with $s_{1}' = [s_{1}, \dots, s_{k}],$ to obtain = $ϕ_{Z} (s) = \exp [ι s' G' X β - \frac{1}{2} {(s' G' G s)}^{\frac{α}{2}}]$ = $\exp [ι s_{1}' β - \frac{1}{2} {(s_{1}'(X' X)^{ - 1} s_{1} + s_{2}' P s_{2})}^{\frac{α}{2}}].$ The marginal s of $\overset{⌢}{β}$ and e follow on setting $s_{2} = 0,$ then $s_{1} = 0$ in succession, to give conclusions (ii) and (iii). Conclusion (iv) attributes to Hartman et al.²⁴ through Theorem 1. Specifically, a change of variables $u \to e \to e' e = (n - k) S^{2}$ behind the integral on the right of Theorem 1(iii) gives the conditional density for $L ((n - k) S^{2} | s),$ namely the scaled chi–squared density $h (u; ν, s)$ depending on s so that integrating with respect to $d Ψ (s; α)$ gives conclusion (iv).

Remark 3 That $S = σ^{2} D i a g (X' X, P)$ is block–diagonal in conclusion (i), assures under $S α S$ errors that $(\overset{⌢}{β}, e)$ are $α$ –unassociated as in Definition 3, well known to be mutually uncorrelated under second moments.

It remains to reexamine topics in inference under errors. The following are germane.

Definition 5 An estimator $\overset{⌢}{θ}$ for $θ \in ℝ^{k}$ is said to be linearly median unbiased if and only if the median $med (a' \overset{⌢}{θ}) = a' θ$ for each $a \in ℝ^{k};$ and to be modal unbiased provided that the mode

Definition 6 An estimator $\overset{⌢}{θ}$ for $θ$ is said to be more concentrated about $θ$ than provided that $\overset{⌢}{θ}$ $P ((\hat{θ} - θ) \in C_{0}) \geq P ((\tilde{θ} - θ) \in C_{0})$ for every convex set $C_{0}$ in $ℝ^{k}$ symmetric under reflection about $0 \in ℝ^{k} .$

Essential properties under $S α S$ errors include the following.

Theorem 3 For $L (Y) = S_{n}^{α} (X β, σ^{2} I_{n}),$ consider properties of the $O L S$ solutions $\overset{⌢}{β} = {(X' X)}^{ - 1} X' Y,$ and of the equivariant estimators $\overset{⌢}{β} = δ (Y)$ of Definition 4.

$\overset{⌢}{β}$ is unbiased for $β$ for each ${1 < α \leq 2};$
$\overset{⌢}{β}$ is linearly median unbiased for $β;$
$\overset{⌢}{β}$ is most concentrated about $β;$ among all median–unbiased linear estimators;
$\overset{⌢}{β}$ is modal unbiased for $β;$
${\overset{⌢}{β}}_{ N}$ is consistent for $β;$ in a sequence of identical but dependent experiments ${Y_{ i} = X β + e_{i}; i =1,2, \dots, N};$
The null distribution of $F = (\overset{⌢}{β} - β_{0})^{'} X' X (\overset{⌢}{β} - β_{0})/ S^{2}$ has exactly its normal–theory form; the power increases with increasing $λ = (β - β_{0})^{'} X' X (β - β_{0})/ σ^{2};$ and such tests are unbiased;
$\overset{⌢}{β}$ is most concentrated about $β$ among all modal–unbiased linear estimators.

$\overset{⌢}{β}$ is most concentrated about $β$ among all equivariant estimators $\overset{⌢}{β} = δ (Y).$

Proof. Conclusions (i)–(vi) carry over from reference Jensen DR²⁷ without benefit of moments, regardless of membership in the $S α S$ class. To consider concentration properties of modal–unbiased estimators, begin with $ϕ_{Y} (t) = \exp [ι t' X β - \frac{1}{2} {(t' t)}^{\frac{α}{2}}],$ and consider $\tilde{β} = L' Y$ with $L' = [(X' X)^{ - 1} X', G'],$ so that

$ϕ_{\tilde{β}} (s)= \exp [ι s' L' X β - \frac{1}{2} {(s' L' L s)}^{\frac{α}{2}}];$

$s' L' X β = s'[(X' X)^{ - 1} X', G'] X β .$

That $\overset{⌢}{β}$ should have mode at $β,$ it is necessary that $s' L' X β = s' β,$ i.e. $G' X = 0 .$ accordingly, $ϕ_{\overset{⌢}{β}} (s) = \exp [ι s' β - \frac{1}{2} {[s' Ω s]}^{\frac{α}{2}}],$ with $Ω = L' L = [(X' X)^{ - 1} + G' G].$ Clearly the matrix $[L' L - {(X' X)}^{ - 1}] = G' G$ is positive semi definite, giving conclusion (vii) from Jensen.²⁸ Conclusion (viii) follows from Theorem 2.7 of Burk et al.²⁹ since distributions are unimodal from Theorem 1(iv).

Spherical multivariate t errors on v degrees of freedom trace to Zellner³⁰ to include Cauchy errors at $ν = 1,$ equivalently, at $α = 1$ in the class $S α S .$ Specializing from Theorem 1(ii), the spherical Cauchy chf is $ϕ_{Z} (t) = \exp [i t' d - \frac{1}{2} {(t' t)}^{\frac{1}{2}}].$ Recast in terms of linear inference, we have the following specialization of Theorems 1 and 2.

Corollary 1 Under the conditions of Theorems and 2, the following properties hold under spherical Cauchy errors.

The spherical Cauchy density on $ℝ^{n}$ at $α = 1$ is

$f_{n}^{1} (z; δ, I_{n})= \int_{0}^{\infty} g_{n} (z; δ, s^{- 2} I_{n}) d Ψ (s;1)$

$= c (n) {[1 + (z - δ)^{'} (z - δ)]}^{- \frac{n + 1}{2}}$

$c (n)= Γ (\frac{n + 1}{2})/ π^{\frac{n + 1}{2}}$

where $d Ψ (s;1) = e^{- \frac{s^{2}}{2}} /(2 π)^{\frac{1}{2}},$ the mixing $χ (s;1)$ density.

The elliptical Cauchy density for $\overset{⌢}{β}$ on $ℝ^{k}$ is

$f_{k}^{1} (\overset{⌢}{β}; β, X' X) = c (k) {[1 + (\overset{⌢}{β} - β)^{'} X' X (\overset{⌢}{β} - β)]}^{- \frac{k + 1}{2}}$ (3)

Proof. The multivariate $t$ distribution on $ℝ^{n}$ is that of ${T_{i} = Y_{ i} / S; 1 \leq i \leq n}$ from $L (Y) = N_{n} (δ, σ^{2} I_{n}),$ with $S$ as a sample standard deviation on $ν$ degrees of freedom, known to be spherical Cauchy at $ν = 1.$ This gives conclusion (i) on specializing the conventional multivariate density. Conclusion (ii) follows directly on specializing Theorem 2(ii) at $α = 1$

The viability $(Y_{ i})$ for each of $n = 13$ biological specimens was recorded after storage under additives $X_{i 1}$ and $X_{i 2}$ as listed in Table 1;³¹ Walpole RE & Myers RH.³¹ The model is ${Y_{ i} = β_{0} + β_{1} X_{i 1} + β_{2} X_{i 2} + ε_{i}},$ where the errors are taken to be spherical Cauchy. The conventional $O L S$ solutions are ${\overset{⌢}{β}}_{0} = 36.094,$ $\hat{β_{1}} = 1.031,$ ${\overset{⌢}{β}}_{2} = - 1.870,$ as elements of $\overset{⌢}{β} = [{\overset{⌢}{β}}_{0}, {\overset{⌢}{β}}_{1}, {\overset{⌢}{β}}_{2}]^{'} .$ The matrix $X' X,$ its inverse ${(X' X)}^{ - 1},$ and the transition of the latter into its $α$ -association form of Definition 3 are given respectively by

${[\begin{matrix} 13 & 59.43 & 81.82 \\ 59.43 & 394.7255 & 360.6621 \\ 81.82 & 360.6621 & 576.7264 \end{matrix}]}^{- 1} = [\begin{matrix} 1.0114 & - 0.0494 & - 0.1126 \\ - 0.0494 & 0.0083 & 0.0018 \\ - 0.1126 & 0.0018 & 0.0166 \end{matrix}] \to [\begin{matrix} 1 & - 0.5392 & - 0.8690 \\ - 0.5392 & 1 & 0.1533 \\ - 0.8690 & 0.1533 & 1 \end{matrix}] .$

The following properties are evident.

The elliptical Cauchy density for $\overset{⌢}{β}$ is given by equation (3) with $k = 3$ and $X' X$ as listed for these data.
The solution $\overset{⌢}{β}$ is both linear median–unbiased and modal–unbiased, and among all such estimators is most concentrated about $β .$
The normal–theory confidence set ${β \in (\overset{⌢}{β} - β)^{'} X' X (\overset{⌢}{β} - β) \leq S^{2} c_{γ}}$ holds exactly with confidence coefficient $1 - γ = 0.95,$ where $S^{2} = 4.001$ is the residual mean square on $ν = 10$ degrees of freedom, and $c_{γ} = 3.71$ is the upper 0.95 percentile for $F (3,10,0).$
As correlations are undefined, elements of the $α$ –association matrix nonetheless do serve to quantify the degrees of association among $[{\overset{⌢}{β}}_{0}, {\overset{⌢}{β}}_{1}, {\overset{⌢}{β}}_{2}]$ as in Definition 3, on taking $α = 1$ in Lemma 1.
In particular, ${\overset{⌢}{β}}_{0}$ is negatively associated with $({\overset{⌢}{β}}_{1}, {\overset{⌢}{β}}_{2}),$ whereas $({\overset{⌢}{β}}_{1}, {\overset{⌢}{β}}_{2})$ are themselves positively associated (Table 1).

$Y_{ i}$	$X_{i 1}$	$X_{i 2}$	$Y_{ i}$	$X_{i 1}$	$X_{i 2}$
0.5	1.74	3.3	31.2	6.32	5.42
0.9	6.22	8.41	38.4	10.52	4.63
0.4	1.19	11.6	26.7	1.22	5.85
0.4	4.1	6.62	25.9	6.32	8.72
0	4.08	4.42	25.2	4.15	7.6
0.7	10.15	4.83	35.7	1.72	3.12
0.5	1.7	5.3

Table 1 The viability $(Y_{ i})$ of $n = 13$ biological specimens after storage under additives $X_{i 1}$ and $X_{i 2}$

This study offers further insight into the class $S α S$ comprising the spherical $α$ –stable laws as limit distributions under conditions for central limit theory. In addition to their essential properties, expanded here to include representations for density functions, this study focuses on models of type ${Y = X β + e}$ when devoid of moments undergirding the classical theory. Recall that normal–theory procedures routinely are applied in practice as large–sample approximations in distributions attracted to Gaussian laws. Specifically, Berry–Esséen bounds on rates of convergence to Gaussian limits are given Jensen,^32,33 with special reference to linear models in Jensen.^34,35Results here validate corresponding large–sample approximations for distributions attracted to laws as cited in references.^17–21 Of similar importance are rates of convergence to stable limits as in Paulauskas.²² By showing that many standard properties carry over in essence under significantly weakened assumptions, this study gives further credence to the widely and correctly held view that Gauss–Markov estimation and normal theory inferences extend considerably beyond the confines of the classical theory.

The preceding study has developed exclusively around spherically dependent $S α S$ errors, as alternative to $i i d$ stable errors. This choice is prompted by discrepancies encountered in the simplest case ${Z_{i} \to Z_{i} + δ; i = 1,2, \dots, N}$ with common location parameter. Essential details from Jensen¹⁴ may be summarized as follows. To distinguish the disparate properties of $i i d$ vs spherical $S α S$ models, sequences $ℤ = {Z_{1}, Z_{2}, Z_{3}, \dots}$ are fundamental in order to take limits. Of significance is that averages of $S α S$ sequences with $α < 2$ may be inconsistent for $i i d$ sequences but consistent under $S α S$ symmetry. Accordingly, let $L_{\infty} (Z_{N}) = liminf L (Z_{N}).$ Essentials follow.

Lemma 2 Given $ℤ = {Z_{1}, Z_{2}, Z_{3}, \dots},$ consider the case that $Z' = [Z_{1}, \dots, Z_{N}]$ either are $i i d$ $S_{1}^{α} (δ, 1),$ with chf $ϕ_{Z_{i}} (t) = \exp {ι t δ - | t |^{α}},$ or are $S α S$ on $ℝ^{N}$ with chf $ϕ_{Z} (t) = \exp {ι δ t' 1_{ N} - {(t' t)}^{\frac{α}{2}}}.$ Let $S_{N} = (Z_{1} + \dots + Z_{N})$ and ${\bar{Z}}_{_{N}} = N^{- 1} S_{N},$ and consider the standardized variables $U_{N} = N^{\frac{1}{2}} ({\bar{Z}}_{_{N}} - δ).$

Consistent and inconsistent properties of for ${\bar{Z}}_{_{N}}$ sequences are as follow.

For $0 < α < 1:$ $ϕ_{{\bar{Z}}_{_{N}}} (t) = e^{ι t δ - N^{ε} | t |^{α}}$ for $ε > 0,$ so that ${\bar{Z}}_{_{N}}$ is inconsistent for $δ .$

For $α = 1,$ $ϕ_{{\bar{Z}}_{_{N}}} (t) = e^{ι t δ - | t |^{α}} \equiv ϕ_{Z_{i}} (t),$ so that ${\bar{Z}}_{_{N}}$ is inconsistent for

For $1 < α \leq 2,$ $ϕ_{{\bar{Z}}_{_{N}}} (t) = e^{ι t δ - N^{- ε} | t |^{α}}$ for $ε > 0,$ so that ${\bar{Z}}_{_{N}}$ is consistent for $δ .$

For $S a S$ sequences ${\bar{Z}}_{_{N}}$ is consistent for $δ .$ for every $0 < α \leq 2.$
For iid sequences with $0 < α < 2,$ $L_{\infty} (U_{N})$ diverges to an improper distribution.
For $S α S$ sequences $liminf L (U_{N}) \equiv L (Z_{i}),$ the limit being identical to each component.

None.

Authors declare that there is no conflict of interest.

Submit manuscript...

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Linear inference under alpha–stable errors

Donald R Jensen

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Abstract

Introduction

Preliminaries

Linear models under errors

Spherical cauchy errors

Case study

Summary and discussion

A appendix

Acknowledgement

Conflict of interest

References

Citations

Rejected Articles

Journal Menu

Useful Links