检索结果-内蒙古大学图书馆

Estimation and testing for semiparametric mixtures of partially linear models

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2017年第17期46卷 8690-8705页

作者： Wu, Xing Liu, Tian Shanghai Univ Finance & Econ Sch Stat & Management Shanghai 200433 Peoples R China

In this paper, we study the estimation and inference for a class of semiparametric mixtures of partially linear models. We prove that the proposed models are identifiable under mild conditions, and then give a PL-em algorithm estimation procedure based on profile likelihood. The asymptotic properties for the resulting estimators and the ascent property of the PL-em algorithm are investigated. Furthermore, we develop a test statistic for testing whether the non parametric component has a linear structure. Monte Carlo simulations and a real data application highlight the interest of the proposed procedures.

关键词： em algorithm hypothesis testing mixture of regression models partially linear models profile likelihood

来源：评论

学校读者我要写书评

暂无评论

Multivariate Spatial Data Fusion for Very Large Remote Sensing Datasets

引用

RemOTE SENSING 2017年第2期9卷

作者： Hai Nguyen Cressie, Noel Braverman, Amy CALTECH Jet Prop Lab 4800 Oak Grove Dr Pasadena CA 91125 USA Univ Wollongong Natl Inst Appl Stat Res Australia Wollongong NSW 2500 Australia

Global maps of total-column carbon dioxide (CO2) mole fraction (in units of parts per million) are important tools for climate research since they provide insights into the spatial distribution of carbon intake and emissions as well as their seasonal and annual evolutions. Currently, two main remote sensing instruments for total-column CO2 are the Orbiting Carbon Observatory-2 (OCO-2) and the Greenhouse gases Observing SATellite (GOSAT), both of which produce estimates of CO2 concentration, called profiles, at 20 different pressure levels. Operationally, each profile estimate is then convolved into a single estimate of column-averaged CO2 using a linear pressure weighting function. This total-column CO2 is then used for subsequent analyses such as Level 3 map generation and colocation for validation. In principle, total-column CO2 in these applications may be more efficiently estimated by making optimal estimates of the vector-valued CO2 profiles and applying the pressure weighting function afterwards. These estimates will be more efficient if there is multivariate dependence between CO2 values in the profile. In this article, we describe a methodology that uses a modified Spatial Random Effects model to account for the multivariate nature of the data fusion of OCO-2 and GOSAT. We show that multivariate fusion of the profiles has improved mean squared error relative to scalar fusion of the column-averaged CO2 values from OCO-2 and GOSAT. The computations scale linearly with the number of data points, making it suitable for the typically massive remote sensing datasets. Furthermore, the methodology properly accounts for differences in instrument footprint, measurement-error characteristics, and data coverages.

关键词： em algorithm Fixed Rank Kriging multivariate geostatistics Spatial Random Effects model

来源：评论

学校读者我要写书评

暂无评论

The Pareto IV power series cure rate model with applications

引用

SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS 2017年第2期41卷 297-318页

作者： Gallardo, Diego I. Gomez, Yolanda M. Arnold, Barry C. Gomez, Hector W. Univ Atacama Dept Matemat Fac Ingn Copiapo Chile Univ Calif Riverside Dept Stat Riverside CA 92521 USA Univ Antofagasta Fac Ciencias Basicas Dept Matemat Antofagasta Chile

Cutaneous melanoma is thought to be triggered by intense, occasional exposure to ultraviolet radiation, either from the sun or tanning beds, especially in people who are genetically predisposed to the disease. When skin cells are damaged by ultraviolet light in this way, often showing up as a sunburn, they are more prone to genetic defects that cause them to rapidly multiply and form potentially fatal (malignant) tumors. Melanoma originates in a type of skin cell called a melanocyte, such cells help produce the pigments of our skin, hair, and eyes. We propose a new cure rate survival regression model for predicting cutaneous melanoma. We assume that the unknown number of competing causes that can influence the survival time is governed by a power series distribution and that the time until the tumor cells are activated follows the Pareto IV distribution. The parameter estimation is based on the em algorithm which for this model can be implemented in a simple way in computational terms. Simulation studies are presented, showing the good performance of the proposed estimation procedure. Finally, two real applications related to a cutaneous melanoma and melanoma data sets are presented.

关键词： Competing risks cure rate models em algorithm Pareto IV distribution power series distribution

来源：评论

学校读者我要写书评

暂无评论

Statistical and Computational Guarantees for the Baum-Welch algorithm

引用

JOURNAL OF MACHINE LEARNING RESEARCH 2017年 18卷

作者： Yang, Fanny Balakrishnan, Sivaraman Wainwright, Martin J. Univ Calif Berkeley Dept Elect Engn & Comp Sci Berkeley CA 94720 USA Carnegie Mellon Univ Dept Stat Pittsburgh PA 15213 USA Univ Calif Berkeley Dept Stat Dept Elect Engn & Comp Sci Berkeley CA 94720 USA

The Hidden Markov Model (HMM) is one of the mainstays of statistical modeling of discrete time series, with applications including speech recognition, computational biology, computer vision and econometrics. Estimating an HMM from its observation process is often addressed via the Baum-Welch algorithm, which is known to be susceptible to local optima. In this paper, we first give a general characterization of the basin of attraction associated with any global optimum of the population likelihood. By exploiting this characterization, we provide non-asymptotic finite sample guarantees on the Baum-Welch updates and show geometric convergence to a small ball of radius on the order of the minimax rate around a global optimum. As a concrete example, we prove a linear rate of convergence for a hidden Markov mixture of two isotropic Gaussians given a suitable mean separation and an initialization within a ball of large radius around (one of) the true parameters. To our knowledge, these are the first rigorous local convergence guarantees to global optima for the Baum-Welch algorithm in a setting where the likelihood function is nonconvex. We complement our theoretical results with thorough numerical simulations studying the convergence of the Baum-Welch algorithm and illustrating the accuracy of our predictions.

关键词： Hidden Markov Models Baum-Welch algorithm em algorithm non-convex optimization graphical models

来源：评论

学校读者我要写书评

暂无评论

Analysis of Gamma andWeibull lifetime data under a general censoring scheme and in the presence of covariates

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2017年第5期46卷 2277-2289页

作者： Bennett, Nathan Iyer, Srikanth K. Jammalamadaka, S. Rao Univ Calif Santa Barbara Dept Stat & Appl Probabil Santa Barbara CA 93106 USA Indian Inst Sci Dept Math Bangalore Karnataka India

We consider the problem of estimating the lifetime distributions of survival times subject to a general censoring scheme called "middle censoring". The lifetimes are assumed to follow a parametric family of distributions, such as the Gamma or Weibull distributions, and is applied to cases when the lifetimes come with covariates affecting them. For any individual in the sample, there is an independent, random, censoring interval. We will observe the actual lifetime if the lifetime falls outside of this censoring interval, otherwise we only observe the interval of censoring. This censoring mechanism, which includes both right-and left censoring, has been called "middle censoring"(see Jammalamadaka and Mangalam, 2003). Maximum-likelihood estimation of the parameters as well as their large-sample properties are studied under this censoring scheme, including the case when covariates are available. We conclude with an application to a dataset from Environmental Economics dealing with ContingentValuation of natural resources.

关键词： Accelerated failure time model em algorithm Gamma distribution maximum-likelihood estimators middle censoring Weibull distribution

来源：评论

学校读者我要写书评

暂无评论

Inference based on progressive Type I interval censored data from log-normal distribution

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2017年第8期46卷 6495-6512页

作者： Roy, Soumya Gijo, E. V. Pradhan, Biswabrata Indian Inst Management Kozhikode Kunnamangalam 673570 Kerala India Indian Stat Inst SQC & OR Unit Bangalore Karnataka India Indian Stat Inst SQC & OR Unit Kolkata W Bengal India

This article considers inference for the log-normal distribution based on progressive Type I interval censored data by both frequentist and Bayesian methods. First, the maximum likelihood estimates (MLEs) of the unknown model parameters are computed by expectation-maximization (em) algorithm. The asymptotic standard errors (ASEs) of the MLEs are obtained by applying the missing information principle. Next, the Bayes' estimates of the model parameters are obtained by Gibbs sampling method under both symmetric and asymmetric loss functions. The Gibbs sampling scheme is facilitated by adopting a similar data augmentation scheme as in em algorithm. The performance of the MLEs and various Bayesian point estimates is judged via a simulation study. A real dataset is analyzed for the purpose of illustration.

关键词： Data augmentation em algorithm Gibbs sampling LINEX loss function Missing information principle

来源：评论

学校读者我要写书评

暂无评论

AlloMap6: an R package for genetic linkage analysis in allohexaploids

引用

Briefings in bioinformatics 2017年第6期18卷 919-927页

作者： Xuli Zhu Huan Li Meixia Ye Libo Jiang Mengmeng Sang Rongling Wu Center for Computational Biology College of Biological Sciences and Technology Beijing Forestry University Beijing China

Allopolyploids are a group of polyploids with more than two sets of chromosomes derived from different species. Previous linkage analysis of allopolyploids is based on the assumption that different chromosomes pair randomly during meiosis. A more sophisticated model to relax this assumption has been developed for allotetraploids by incorporating the preferential pairing behavior of homologous over homoeologous chromosomes. Here, we show that the basic principle of this model can be extended to perform linkage analysis of higher-ploidy allohexaploids, where multiple preferential pairing factors are used to characterize chromosomal-pairing meiotic features between different constituent species. We implemented the extended model into an R package, called AlloMap6, allowing the recombination fractions and preferential pairing factors to be estimated simultaneously. Allomap6 has two major functionalities, computer simulation and real-data analysis. By analyzing a real data from a full-sib family of allohexaploid persimmon, we tested and validated the usefulness and utility of this package. AlloMap6 lays a foundation for allohexaploid genetic mapping and provides a new horizon to explore the chromosomal kinship of allohexaploids.

关键词： em algorithm allohexaploid persimmon preferential pairing factor recombination fraction

来源：评论

学校读者我要写书评

暂无评论

A new method for estimation of orthogonal and phase deviations of constant-envelope signals

引用

MEASURemENT SCIENCE AND TECHNOLOGY 2017年第12期28卷

作者： Li, Youyang Lu, Xiaochun Wang, Xue Chinese Acad Sci Natl Time Serv Ctr Lintong 710600 Peoples R China

In this paper it is shown that orthogonal deviation increases the ranging error of navigation signals, and a method for the estimation of orthogonal and phase deviation of the I and Q components of a constant-envelope signal is proposed. A measurement process is introduced and the measurement accuracy for different signal-to-noise ratios and data lengths is provided. Moreover, in the process of evaluation, a very important conclusion is made: when a digital signal is filtered, the filter bandwidth does not affect measurement accuracy. The corresponding proof is given. The accuracy of the proposed method and measurement is verified by simulations. In simulations, an expectation-maximum (em) algorithm was used to estimate the constellation coordinates, and the shortcomings of the em algorithm in high-precision parameter estimation were determined and the corresponding corrections made. Finally, the proposed method was used to estimate both orthogonal deviation and phase deviation of a real satellite signal.

关键词： constellation diagram Gauss mixture model em algorithm orthogonal deviation phase deviation

来源：评论

学校读者我要写书评

暂无评论

Network estimation in State Space Models with L1-regularization constraint

引用

Afrika Statistika 2017年第2期12卷 1253-1273页

作者： Lotsi Anani Ernst Wit

Microarray technologies and related methods coupled with appropriate mathematical and statistical models have made it possible to identify dynamic regulatory networks by measuring time course expression levels of many genes simultaneously. However one of the challenges is the high-dimensional nature of such data coupled with the fact that these gene expression data are known not to include various biological process. As genomic interactions are highly structured, the aim was to derive a method for inferring a sparse dynamic network in a high dimensional data setting. The paper assumes that the observations are noisy measurements of gene expression in the form of mRNAs, whose dynamics can be described by some partially observed process.

关键词： 62-09 62H12 62J07 em algorithm gene expression genomic microarray sparse state space model

来源：评论

学校读者我要写书评

暂无评论

An optimization method of Voiceprint Recognition based on user portrait

An optimization method of Voiceprint Recognition based on us...

引用

IEEE Conference on Energy Internet and Energy System Integration

作者： Furong Yan Bao Yuan Yan Chen Jiakui Zhao Yuxi Liu Hong Ouyang Zuoping Wu Qiang Liu Lei Li Yunyao Xue State Grid Information and Telecommunication Group Co. Ltd Beijing China State Grid Information and Telecommunication Group Co. Ltd. name of organization Beijing China State Grid Zhejiang Electric Power Co. Ltd. Hangzhou China

Voiceprint is an important component of creating a user portrait. Voiceprint Recognition can determine user's identification. However, speech signals in the customer service system are processed by encoded with compression for effective transmission and storage. The low-bit rate codec results that the performance of Voiceprint Recognition system dramatically reduces. What is more, the speech number of each customer is not adequate. In order to solve the problem, this paper proposes a model compensation method. The method uses a test utterance with expectation maximization (em) algorithm to estimate the distortion model and the UBM is adjusted to match the codec type of the test utterance. Voiceprint Recognition experiments are conducted. The results show that the proposed method is able to dramatically improve the performance of the system.

关键词： User portrait Voiceprint recognition Model compensation method em algorithm Distortion model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：