Research Article Volume 7 Issue 3
On local polynomial estimation of hazard rates and their derivatives under left truncation and right censoring models
Jiancheng Jiang,1
Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.
Lingju Chen,2 Yuze Yuan2
1Department of Mathematics and Statistics, University of North Carolina at Charlotte, USA
2Department of Mathematics, Minjiang University, China
Correspondence: Yuze Yuan, Department of Mathematics, Minjiang University, China
Received: April 28, 2018 | Published: May 15, 2018
Citation: Yuan Y, Jiang J, Chen L. On local polynomial estimation of hazard rates and their derivatives under left truncation and right censoring models. Biom Biostat Int J. 2018;7(3):199–204. DOI: 10.15406/bbij.2018.07.00209
Download PDF
Abstract
Estimating hazard rate function is an important problem in survival analysis. There are some estimation approaches based on kernel smoothing. However, they suffer from the boundary effects or need high order kernels, which increases the mean squared error. We introduce local polynomial estimators of hazard rates and their derivatives for the left truncation and right censoring models. The estimators have favorable properties similar to those of local polynomial regression estimators. Asymptotic expressions for the mean squared errors (AMSE’s) are obtained. Consistency and joint asymptotic normality of the local polynomial estimators are established. A data-based local bandwidth selection rule is proposed.
Keywords: censoring, data-driven local bandwidth selection, hazard rate, local polynomial, truncation
Introduction
Consider a subject in survival studies. Only if its onset time, i.e. the time origin of its lifetime, passes the beginning of the study, the subject can enter into the study. For the entered individuals, each of them is then followed for a fixed time point. Such subjects are so-called left truncated and right censored. To be specific, let be the lifetime with distribution function (df) , the random left truncation time with arbitrary df , and the random censoring time with arbitrary df , where is independent of . Then the cumulative hazard function of is . Under the left truncation and right censoring model, one observes if , where and is an indicator of the censoring status of which takes value one if and zero otherwise. When , nothing is observed (see for example, Gurler & Wang1). Let the distribution of be and assume that . Then , where and throughout the paper for any df. , is the corresponding survival function.
The left truncation and right censoring model has been investigated by many authors. Interesting work along the field can be found in Gross & Lai2 and Gurler & Wang1 among others. Several authors have considered the estimation of hazard functions under the left truncation and right censoring model. For examples, Uzunogullari & Wang,3 and Wu & Wells.4
In the present investigation, we study local polynomial (LP for short) estimators of hazard functions and their derivatives based on the i.i.d sample from . Under the left truncation and right censoring model, one observes only those i.i.d pairs for which . It is worth pointing out that our estimators inherit some favorable properties from local polynomial regression estimators, in particular, our estimators can reduce the bias according to the degree of the polynomial without increasing the variance and automatically correct the left boundary effect. The point wise asymptotic normality of our estimators enables one to find the asymptotically optimal variable bandwidth choice, and thereafter allows one to develop a data-driven optimal local bandwidth selector by using the ideas of Fan & Gurler.5 We here present a simpler data-driven method for choosing the local bandwidth.
The outline of this paper is as follows. In Section 2, we introduce the LP estimators. In Section 3 we concentrate on the asymptotic properties of the proposed estimators, including point wise strong consistency and joint asymptotic normality. In Section 4, we propose the data-driven local bandwidth selection rule. Technical proofs are given in the Appendix.
Estimation
In order to introduce the LP estimators, we use the following notation from Gurler & Wang1
- , the modified empirical estimator of .
- , the product-limit estimator of from Tsai.
Let and denote the left and right endpoints of the support for any d.f. , respectively. Then is identifiable if and (Woodroofe6 and Gurler & Wang1).
Therefore, we assume this condition holds. As in Gurler & Wang,1 for estimating the density function of , we also assume .
Following Gurler & Wang,1 we define and Then and
By Gurler & Wang,1 the Nelson-Aalen type estimator of is given by
(1)
Gijbel & Wang1 considered the following kernel estimator for the density function of , which is a convolution of the product-limit estimator with an appropriate kernel function :
(2)
where is a higher order kernel. The method can be also adapted to the case of estimating the hazard function and its derivatives if one uses instead of :
(3)
The estimator is an extension to that of Müller & Wang7,8 where right censoring model is considered. However, for the estimation of derivatives or reduction of bias, the estimator needs higher order kernels, which can lead to a negative hazard rate estimator. The practical advantages of using higher order kernels can be quite small for moderate sample sizes as demonstrated in Marron & Wand.9 When estimating at point near or , the effective support of the kernel is not contained in , most kernel estimators in density estimation and regression settings will encounter boundary effects. The estimator (3) suffers from boundary effects near the endpoints of the support of the hazard rates. In the presence of censoring for estimating hazard rate function, Müller & Wang9 solved the problem by employing boundary kernels and a data-adaptive varying bandwidth selection procedure. Hall & Wehrly10 used a geometrical method for removing edge effects from kernel-type nonparametric regression estimators. These boundary correction methods may also be adapted to the estimation approach in (3). Here we introduce a simple and intuitive approach to the problem. Our approach does not need higher order kernels or boundary kernels while automatically correcting the boundary effects. Our idea is similar to that of Jiang & Doksum,11 but their procedure cannot be directly applied to the current setting.
Following Jiang and Doksum,11 we consider the following optimization problem:
(4)
By Taylor expansion, the solution of the optimization problem, , will estimate . Since in (1) is the empirical estimator of , we define the following generalized empirical hazard rate as the generalized derivative of :
(5)
where is the Dirac function with the following property:
For any integral function. Then , which is why we call the generalized empirical hazard rate. Replacing in (4) by , we obtain that
(6)
Then is the LP estimator of . Jones12 considered a locally linear estimator and established its link with the generalized jackknife boundary correction for by using the ideas of Lejeune & Sarda13 in local linear fitting to the empirical distribution . Here we study the local polynomial estimation of hazard functions and their derivatives, , under the left truncation and right censoring model. Obviously, our method can be used for the complete data case, which corresponds to “no truncation and no censoring”.
Taking the derivative with respect to ’s of the integral in (6), we obtain the LP estimator as the solution to the linear equations: for ,
(7)
It follows that the LP estimator at any point has the following closed form:
(8)
where , and
(9)
When and , , which is the same as the kernel estimator of in (3). However, this equivalence does not hold for boundary points.
We will show in next section that the LP estimator shares nice properties with the local polynomial regression estimator, in particular, the estimator will keep its convergence rate up to the left boundary point, i.e. the estimator automatically corrects the left boundary effect, which contrasts with the results for other hazard rate estimators.
Asymptotic properties
In this section, we will establish the consistency and joint asymptotic normality of the local polynomial estimators. To this end, we introduce some regularity conditions. For a given point , the following notations and assumptions are needed.
The hazard rate function has a continuous derivative at the point .
The sequence of bandwidths tends to zero such that as . Let .
is continuous at the point .
The kernel function is a continuous function of bounded variation and with bounded support , say. Let , , for , , and be matrices.
Theorem 3.1 Under conditions (A1) – (A4),
Theorem 3.2 Under conditions (A1) – (A4),
(10)
Remark 3.1
When estimating a hazard rate which is a polynomial of order on an interval, the finite sample bias of the LP estimators on the interval is zero (see the proof of Theorem 3.2). This contrasts with the methods of Müller and Wang7,8 based on higher order kernels, for which the respective zero bias only holds true asymptotically.
Remark 3.2
When there is no truncation, we take and over the support of , then , and the asymptotic normality for interior point is the same as in Müller & Wang.7,8 When there is no censoring, we take the censoring variable and over the support of , then , and the asymptotic normality for interior point is the same as in Gürler & Wang.1
(11)
where has one in the th component and zeros in the others. Therefore, the optimal local bandwidth for estimating the th derivative of at , in the sense of minimizing , is
(12)
Theorem 3.3
Consider the left edge effect on the estimator. Assume that we estimate at in the left boundary region for some positive constant . Then similar to (8), in (6) has the following closed form, for ,
(13)
where defined as but with replaced by . Let . Then the joint asymptotic normality (10) continues to hold with , and replaced by , and , respectively, where and is matrices.
This property of our estimator in Theorem 3.3 is similar to that of local polynomial regression estimation, which is not shared by other kernel estimators of hazard rates (Hess et al.14) The LP estimators are automatically boundary adaptive in the sense of Fan & Gurler.15 Note that the above property holds even for , which contrasts with the cases of local polynomial regression.
Remark 3.3 For a finite sample, one may encounter right boundary effects when estimating near T. A good method for dealing with the problem is to use the following estimator similar to the estimator in (13):
(14)
where for , and defined as but with replaced by .
Data-driven local bandwidth choice
The proposed estimators depend on the band width . It is important to develop a local bandwidth choice for estimating hazard functions and their derivatives, especially when one would like to have a data-driven approach to bandwidth choice in practice. For hazard rate estimation, Patil16 considered least squares cross-validation bandwidth selection, and Gonz lez-Manteiga et al.17 studied smoothed bootstrap selection of the global bandwidth. Müller & Wang,7 Hess et al.14 and Jiang & Doksum11 studied the local bandwidth choice for estimating hazard rates under right censoring. Here we extend the data-driven local bandwidth choice of Jiang & Doksum11 to the left truncation and right censoring model.
From the proof of Theorem 3.2, we see that the exact bias of the estimator is
(15)
where and The asymptotic variance of is
(16)
where and . Then we propose to estimate the AMSE of via
(17)
where and are defined similarly to and but with replaced by a pilot estimator and replaced by its empirical estimator . We define
Then, for estimating when , the following algorithm similar to Jiang and Doksum11 can be used for estimating .
Algorithm for estimating:
Step 1: Pilot estimators of : Choose a kernel, such as the Epanechnikov kernel, and an initial global bandwidth . The choice of the initial bandwidth depends on the specific case. Assume the data are available on , then a possible value for is as recommended by Muller & Wang,8 where is the number of uncensored observations. The pilot estimators of are obtained by using and our estimators (8), (13) and (14).
Step 2: Minimizing of : Choose an equispaced grid of points , between 0 and T. For each of the grid points compute in (17) with and obtain its minimizes on the interval , say. The minimization of may be computed via Discretisation.
Step 3: Smoothing bandwidths: Choose another equispaced grid of points , , over the interval on which the final hazard estimator is desired. Running local linear smoother by employing global bandwidth or :
where and , for .
Step 4: Final hazard function estimators: Using (8), (13) and (14), obtain the estimators by employing the bandwidth , for .
The above algorithm can be repeated by using the estimators in Step 4 as pilot estimators in Step 1 and running Step 2 – Step 4 again. The pilot estimators of in Step 1 can also be obtained via maximum likelihood if one has a plausible parametric model in mind.18
Appendix proofs of theorems
Proofs of theorems 3.1 and 3.2
Using (1) and (9), we obtain that
(18)
Let and . Then By (8), we have
(19)
We will show that contributes to the bias term, and the variance term of our estimator.
- By (18), a change of variable for integration, and the Taylor expansion, we get
Then
(20)
In particular, if is a polynomial up to order p in a neighborhood of, then the exact bias of the LP estimator is zero.
- For , by Gurler & Wang,1 we have
(21)
where for , any and or 0, and
which satisfies that and where Let . Using the definition of , (21) and integration by parts, we obtain the following almost surely representation of :
(22)
where
is the stochastic component of and contributes to the variance of our estimator, and is the negligible error of the approximation which satisfies
(23)
for . Note that and
(24)
Let and . Then by (22) and (24)
(25)
and By the central limit theorem, we get
(26)
By (23), we know
(27)
Then by (25), (26) and (27)
(28)
Combination of (19), (20) and (28) completes the proof of Theorem 3.2.
Note that, for , are i.i.d.’s sums. By the SLN, we know Then . This combined with (22) and (23) yields . Therefore, by (19) and (20),
Proofs of Theorem 3.3 The result follows by the same argument as in part (i) of the proof of Theorem 3.2.
Acknowledgement
This work is partially supported by Natural Science Foundation of Fujian Providence of China (2016J01024) and by Education and Scientific Research Projects for young and middle-aged teachers of Fujian Providence (JAT160383).
Conflict of interest
Author declares that there are no conflicts of interest.
References
- Gürler Ü, Wang JL. Nonparametric estimation of hazard functions and their derivatives under truncation model. Ann Inst Statist Math. 1993;(45):249–264.
- Gross ST, Lai TL. Nonparametric estimation and regression analysis with left–truncated and right censored data. Jour Amer Statist Assoc. 1996;(91):1166–1180.
- Uzunogullari U, Wang JL. A comparison of hazard rate estimators for left truncated and right censored data. Biometrika. 1992;79(2):297–310.
- Wu SS, Wells MT. Nonparametric estimation of hazard functions by wavelet methods. Journal of Nonparametric Statistics. 2000;15(2):187–203.
- Fan J, Gurler I. Data–driven bandwidth selection in local polynomial fitting: variable bandwidth and spatial adaption. JR Statist Soc B. 1995;(57):371–394.
- Woodroofe M. Estimating a distribution function with truncated data. Ann Statist. 1985;(13):163–177.
- Muller, HG, Wang JL. Locally adaptive hazard smoothing. Prob Th Rel Fields. 1989;85(4):523–538.
- Muller HG, Wang JL. Hazard rate estimation under random censoring with varying kernels and bandwidths. Biometrics. 1994;50(1):61–76.
- Marron JS, Wand MP. Exact mean integrated squared error. Ann Statist. 1992;20(2):712–736.
- Hall P, Wehrly TE. A geometrical method for removing edge effects from kernel–type nonparametric regression estimators. Jour Amer Statist Soc. 1991;86(415):665–672.
- Jiang J, K Doksum. On Local Polynomial Estimation of Hazard Functions and Their Derivatives under Random Censoring. Lecture Notes–Monograph Series. 2003;(42):463–481.
- Jones MC. Simple boundary correction for kernel density estimation. Statistics and Computing. 1993;3(3):135–146.
- Lejeune M, Sarda P. Smooth estimators of distribution and density functions. Computational Statistics and Data Analysis. 1992;(14):457–471.
- Hess KR, Serachitopol DM, Brown BW. Hazard function estimators: a simulation study. Statistics in Medicine. 1999;18(22):3075–3088.
- Fan J, Gurler I. Local Polynomial Modelling and Its Applications. London: Chapman and Hall; 1996.
- Patil PN. On the least squares cross–validation bandwidth in hazard rate estimation. Ann Statist. 1993;21(4):1792–1810.
- Gonz lez–Manteiga W, Cao R, Marron JS. Bootstrap selection of the smoothing parameter in nonparametric hazard rate estimation. Jour Amer Statist Assoc. 1996;91(435):1130–1140.
- Tsai WY, Jewell NP, Wang MC. A note on the product–limit estimator under right censoring and left truncation. Biometrika. 1987;74(4):883–886.
©2018 Yuan, et al. This is an open access article distributed under the terms of the,
which
permits unrestricted use, distribution, and build upon your work non-commercially.