In this study, we model aggregate claims using a subordinator, specifically a non-decreasing Levy process. Large positive jumps, exceeding a predetermined threshold, represent significant claims, while frequent but sm...
详细信息
In this study, we model aggregate claims using a subordinator, specifically a non-decreasing Levy process. Large positive jumps, exceeding a predetermined threshold, represent significant claims, while frequent but smaller fluctuations capture other sources of non-insurance uncertainty, such as miscellaneous expenses. The primary challenge lies in extracting the necessary mathematical insights to estimate the jump measure from a sample path of truncated aggregate claims. Through a discrete time-point sampling scheme, we conduct an initial comparison between conventional parametricestimators of the Levy measure associated with the subordinator, based on simulated significant claims, and our proposed non-parametric estimator, derived by adapting classical differential processes originally introduced by Rubin and Tucker. The results of this comparison suggest the potential utility of our estimator in the context of real data from the insurance sector. While the primary focus of this work is to uncover the mathematical foundations, a preliminary simulation study, although lacking rigorous numerical analysis, hints at the favorable estimation of the Poisson rate for the number of jumps exceeding the threshold, achieved using our proposed non-parametric estimator of the Levy measure.
We first consider a stochastic system described by an absorbing semi-Markov chain (SMC) with finite state space, and we introduce the absorption probability to a class of recurrent states. Afterwards, we study the fir...
详细信息
We first consider a stochastic system described by an absorbing semi-Markov chain (SMC) with finite state space, and we introduce the absorption probability to a class of recurrent states. Afterwards, we study the first hitting probability to a subset of states for an irreducible SMC. In the latter case, a non-parametric estimator for the first hitting probability is proposed and the asymptotic properties of strong consistency and asymptotic normality are proven. Finally, a numerical application on a five-state system is presented to illustrate the performance of this estimator.
Suppose that , where are random inputs, Y is the output, and is an unknown link function. The Sobol indices gauge the sensitivity of each X against Y by estimating the regression curve's variability between them. ...
详细信息
Suppose that , where are random inputs, Y is the output, and is an unknown link function. The Sobol indices gauge the sensitivity of each X against Y by estimating the regression curve's variability between them. In this paper, we estimate these curves with a kernel-based method. The method allows to estimate the first order indices when the link between the independent and dependent variables is unknown. The kernel-based methods need a bandwidth to average the observations. For finite samples, the cross-validation method is famous to decide this bandwidth. However, it produces a structural bias. To remedy this, we propose a bootstrap procedure which reconstruct the model residuals and re-estimate the non-parametric regression curve. With the new set of curves, the procedure corrects the bias in the Sobol index. To test the developed method, we implemented simulated numerical examples with complex functions.
We investigate the estimation of the l-fold convolution of the density of an unobserved variable X from n i.i.d. observations of the convolution model Y = X + epsilon. We first assume that the density of the noise e i...
详细信息
We investigate the estimation of the l-fold convolution of the density of an unobserved variable X from n i.i.d. observations of the convolution model Y = X + epsilon. We first assume that the density of the noise e is known and define non-adaptive estimators, for which we provide bounds for the mean integrated squared error. In particular, under some smoothness assumptions on the densities of X and e, we prove that the parametric rate of convergence 1/n can be attained. Then, we construct an adaptive estimator using a penalisation approach having similar performances to the non-adaptive one. The price for its adaptivity is a logarithmic term. The results are extended to the case of unknown noise density, under the condition that an independent noise sample is available. Lastly, we report a simulation study to support our theoretical findings.
This paper investigates the rate of occurrence of failures (ROCOF) of finite state semi-Markov systems. Firstly, a formula for evaluating the ROCOF for semi-Markov systems is derived. As a consequence of this result, ...
详细信息
This paper investigates the rate of occurrence of failures (ROCOF) of finite state semi-Markov systems. Firstly, a formula for evaluating the ROCOF for semi-Markov systems is derived. As a consequence of this result, we derive the ROCOF of Markov systems as well as the ROCOF of the alternating renewal processes and we give their asymptotic values. From the semi-Markov ROCOF formula, we propose a statistical estimator of this function. Finally, the uniform strong consistency and the asymptotic normality of this estimator are obtained. (C) 2002 Elsevier Science B.V. All rights reserved.
estimators for quantiles based on linear combinations of order statistics have been proposed by Harrell and Davis(1982) and kaigh and Lachenbruch (1982). Both estimators have been demonstrated to be at least as effici...
详细信息
estimators for quantiles based on linear combinations of order statistics have been proposed by Harrell and Davis(1982) and kaigh and Lachenbruch (1982). Both estimators have been demonstrated to be at least as efficient for small sample point estimation as an ordinary sample quantile estimator based on one or two order statistics: Distribution-free confidence intervals for quantiles can be constructed using either of the two approaches. By means of a simulation study, these confidence intervals have been compared with several other methods of constructing confidence intervals for quantiles in small samples. For the median, the Kaigh and Lachenbruch method performed fairly well. For other quantiles, no method performed better than the method which uses pairs of order statistics.
To increase the predictive abilities of several plasma biomarkers on the coronary artery disease (CAD)-related vital statuses over time, our research interest mainly focuses on seeking combinations of these biomarkers...
详细信息
To increase the predictive abilities of several plasma biomarkers on the coronary artery disease (CAD)-related vital statuses over time, our research interest mainly focuses on seeking combinations of these biomarkers with the highest time-dependent receiver operating characteristic curves. An extended generalized linear model (EGLM) with time-varying coefficients and an unknown bivariate link function is used to characterize the conditional distribution of time to CAD-related death. Based on censored survival data, two non-parametric procedures are proposed to estimate the optimal composite markers, linear predictors in the EGLM model. Estimation methods for the classification accuracies of the optimal composite markers are also proposed. In the article we establish theoretical results of the estimators and examine the corresponding finite-sample properties through a series of simulations with different sample sizes, censoring rates and censoring mechanisms. Our optimization procedures and estimators are further shown to be useful through an application to a prospective cohort study of patients undergoing angiography.
Given a random vector ( X,Y ) with distribution function H ( x,y ) such that the marginal random variable X has support S , different statistical parameters θ( x ) associated with Y conditioned by X = x can be define...
详细信息
Given a random vector ( X,Y ) with distribution function H ( x,y ) such that the marginal random variable X has support S , different statistical parameters θ( x ) associated with Y conditioned by X = x can be defined by means of functionals T:R×Θ× F →R (Θ a parameter space, F a space of distribution functions) as those for which ʃ T(y,θ(x),F(·|x))dF(y|x)=0 , where F (·| x ) represents the conditional distribution of the random variable Y given X = x . This paper shows the asymptotic normality of the general class of estimators θ n ( x ) ( x∈S ) defined as the solutions of ʃT(y, θ ̂ n (x), F ̂ n (·|x))d F ̂ n (y|x)=0 where F̂ n ( y | x ) is a non-parametric estimator of F ( y | x ). This result is applied to several particular cases.
The area under the receiver operating characteristic (ROC) curve (AUC) is one of the commonly used measure to evaluate or compare the predictive ability of markers to the disease status. Motivated by an angiographic c...
详细信息
The area under the receiver operating characteristic (ROC) curve (AUC) is one of the commonly used measure to evaluate or compare the predictive ability of markers to the disease status. Motivated by an angiographic coronary artery disease (CAD) study, our objective is mainly to evaluate and compare the performance of several baseline plasma levels in the prediction of CAD-related vital status over time. Based on censored survival data, the non-parametric estimators are proposed for the time-dependent AUC. The limiting Gaussian processes of the estimators and the estimated asymptotic variance-covariance functions enable us to further construct confidence bands and develop testing procedures. Applications and finite sample properties of the proposed estimation methods and inference procedures are demonstrated through the CAD-related death data from the British Columbia Vital Statistics Agency and Monte Carlo simulations. (C) 2009 Elsevier B.V. All rights reserved.
This paper aims at proposing efficient vegetation sampling strategies. It describes how the estimation of species richness and diversity of moist evergreen forest is affected by (1) sampling design (simple random samp...
详细信息
This paper aims at proposing efficient vegetation sampling strategies. It describes how the estimation of species richness and diversity of moist evergreen forest is affected by (1) sampling design (simple random sampling, random cluster sampling, systematic cluster sampling, stratified cluster sampling);(2) choice of species richness estimators (number of observed species vs, non-parametric estimators) and (3) choice of diversity index (Simpson vs. Shannon). Two sites are studied: a 28-ha area situated in the Western Ghats of India and a 25-ha area located at Pasoh in Peninsular Malaysia. The results show that: (1) whatever the sampling strategy, estimates of species richness depend on sample size in these very diverse forest ecosystems which contain many rare species;(2) Simpson's diversity index reaches a stable value at low sample sizes while Shannon's index is affected more by the addition of rare species with increasing sample size;(3) cluster sampling strategies provide a good compromise between cost and statistical efficiency;(4) 300-400 sample trees grouped in small clusters (10-50 individuals) are enough to obtain unbiased and precise estimates of Simpson's index;(5) the local topography of the Western Ghats has a major influence on forest composition, the steep slopes being richer and more diverse than the ridges and gentle slopes;(6) stratified duster sampling is thus an interesting alternative to systematic cluster sampling.
暂无评论