检索结果-内蒙古大学图书馆

An evaluation of the reconstructed coefficient of determination and potential adjustments

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2017年第9期46卷 6705-6718页

作者： Miljkovic, Tatjana Orr, Megan Miami Univ Dept Stat Oxford OH 45056 USA North Dakota State Univ Dept Stat Fargo ND USA

Previously, a method was proposed for calculating a reconstructed coefficient of determination in the case of right-censored regression using the expectation-maximization (em) algorithm. This measure is assessed via simulation study for the purpose of evaluating the utility of model fit. Further, several reconstructed adjusted coefficients of determination are proposed and compared via simulation study for the purpose of model selection. The application of these proposed measures is illustrated on a real dataset.

关键词： Coefficient of determination em algorithm Linear models Regression Right censoring

来源：评论

学校读者我要写书评

暂无评论

A non-iterative posterior sampling algorithm for linear quantile regression model

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2017年第8期46卷 5861-5878页

作者： Yang, Fengkai Yuan, Haijing Shandong Univ Sch Math Jinan Shandong Peoples R China Shandong Univ Sch Math & Stat Weihai Peoples R China

In this article, a non-iterative posterior sampling algorithm for linear quantile regression model based on the asymmetric Laplace distribution is proposed. The algorithm combines the inverse Bayes formulae, sampling/importance resampling, and the expectation maximization algorithm to obtain independently and identically distributed samples approximately from the observed posterior distribution, which eliminates the convergence problems in the iterative Gibbs sampling and overcomes the difficulty in evaluating the standard deviance in the em algorithm. The numeric results in simulations and application to the classical Engel data show that the non-iterative sampling algorithm is more effective than the Gibbs sampling and em algorithm.

关键词： Asymmetric Laplace distribution em algorithm Gibbs sampling Inverse Bayes formulae Quantile regression

来源：评论

学校读者我要写书评

暂无评论

Switching LDS detection for GNSS-based train integrity monitoring system

引用

IET INTELLIGENT TRANSPORT SYSTemS 2017年第5期11卷 299-307页

作者： Li, Sihui Cai, Baigen Wei Shangguan Schnieder, Eckehard Toro, Federico Grasso Beijing Jiaotong Univ Sch Elect & Informat Engn Beijing Peoples R China Tech Univ Carolo Wilhelmina Braunschweig Inst Traff Safety & Automat Engn Braunschweig Germany

Train integrity whilst in service establishes the foundation for railway safety. This study investigates train integrity detection which reliably deduces whether the train consists remain intact. A switching linear dynamic system (SLDS) based train integrity detection method is proposed for Global Navigation Satellite System (GNSS) based train integrity Monitoring System (TIMS) using the relative distance, velocity and acceleration of the locomotive and the last van. There, Expectation Maximisation (em) algorithm estimates the parameters of SLDS model while the Gaussian Sum Filter infers train integrity state. After that, to cope with false detection and misdetection, a verification procedure and train parting time estimation are designed. The approach is evaluated with both field trials and simulated data. Results show that the false alarm rate and misdetection rate of SLDS-based integrity detection approach are 0 and 0.09% respectively, which proves better than the estimated train length based detection model and Hidden Markov Model (HMM).

关键词： satellite navigation railway safety expectation-maximisation algorithm hidden Markov models GNSS-based train integrity monitoring system railway safety switching linear dynamic system train integrity detection method global navigation satellite system TIMS relative distance expectation maximisation algorithm em algorithm SLDS model Gaussian sum filter verification procedure train parting time estimation false alarm rate misdetection rate train length based detection model hidden Markov model HMM

来源：评论

学校读者我要写书评

暂无评论

Product functional information based automatic patent classification: Method and experimental studies

引用

INFORMATION SYSTemS 2017年第Jul.期67卷 71-82页

作者： Li, Wen-qiang Li, Yan Chen, Jian Hou, Chao-yi Sichuan Univ Sch Mfg Sci & Engn Chengdu 610065 Peoples R China Univ Southampton Fac Engn & Environm Southampton SO17 1BJ Hants England

In order to effectively extract the hidden information from the patent texts and to further provide this information to support the product innovation design process, this paper proposed an automatic patent classification method based on the functional basis and Naive Bayes theory. The functions of products are regarded as the innovation attributes, and the function co-reference relations of the patents in different areas are established. Patent classification methods are proposed based on the functions of products and the general steps of the patent classification process are proposed. In addition, three training methods are studied in the experiments, including multi-classification fully supervised training, multiple dichotomous supervised training and semi-supervised training. Through comparing and analyzing the experimental results, a patent text classifier is developed. In summary, this paper provides a general idea and the relevant technologies on how to build a patent knowledge space by automatically extracting and expanding the patent texts. (C) 2017 Published by Elsevier Ltd.

关键词： Innovation design Functional basis Patent text classification Naive Bayes em algorithm

来源：评论

学校读者我要写书评

暂无评论

Subtype classification and heterogeneous prognosis model construction in precision medicine

引用

BIOMETRICS 2018年第3期74卷 814-822页

作者： You, Na He, Shun Wang, Xueqin Zhu, Junxian Zhang, Heping Sun Yat Sen Univ Sch Math Guangzhou 510275 Guangdong Peoples R China Sun Yat Sen Univ Southern China Ctr Stat Sci Guangzhou 510275 Guangdong Peoples R China Peking Univ Sch Math Sci LMAM Beijing 100871 Peoples R China Sun Yat Sen Univ Zhongshan Sch Med Guangzhou 510080 Guangdong Peoples R China SYSU CMU Shunde Int Joint Res Inst Shunde 528300 Guangdong Peoples R China Yale Univ Sch Publ Hlth Dept Biostat New Haven CT 06511 USA

Common diseases including cancer are heterogeneous. It is important to discover disease subtypes and identify both shared and unique risk factors for different disease subtypes. The advent of high-throughput technologies enriches the data to achieve this goal, if necessary statistical methods are developed. Existing methods can accommodate both heterogeneity identification and variable selection under parametric models, but for survival analysis, the commonly used Cox model is semiparametric. Although finite-mixture Cox model has been proposed to address heterogeneity in survival analysis, variable selection has not been incorporated into such semiparametric models. Using regularization regression, we propose a variable selection method for the finite-mixture Cox model and select important, subtype-specific risk factors from high-dimensional predictors. Our estimators have oracle properties with proper choices of penalty parameters under the regularization regression. An expectation-maximization algorithm is developed for numerical calculation. Simulations demonstrate that our proposed method performs well in revealing the heterogeneity and selecting important risk factors for each subtype, and its performance is compared to alternatives with other regularizers. Finally, we apply our method to analyze a gene expression dataset for ovarian cancer DNA repair pathways. Based on our selected risk factors, the prognosis model accounting for heterogeneity consistently improves the prediction for the survival probability in both training and test datasets.

关键词： em algorithm Finite-mixture Cox proportional hazards model Heterogeneity High-dimensional data Subtype Variable selection

来源：评论

学校读者我要写书评

暂无评论

Fast and robust image segmentation with active contours and Student's-t mixture model

引用

PATTERN RECOGNITION 2017年 63卷 71-86页

作者： Gao, Guowei Wen, Chenglin Wang, Huibin Hohai Univ Coll Comp & Informat Nanjing 210098 Jiangsu Peoples R China Anyang Normal Univ Sch Software Engn Anyang 455000 Henan Peoples R China Hangzhou Dianzi Univ Coll Automat Hangzhou 310018 Zhejiang Peoples R China

In this paper, a novel active contours method, which combines with the Student's-t mixture model via Expectaton-Maximizaton (em) algorithm, is proposed to segment complicated two-phase images. Firstly, we rewrite the cost function and derive a novel updating of level set function based on probabilistic principles. Secondly, we put forward two novel geometric priors from the level-set-based curve evolution;and both of them have advantages, the suitable one is selected by personalized need to obtain level set function in em framework with the aim of reducing the computational cost. Therefore, the level set function is derived from latent variables and served as a feedback to the estimation of the latent variables in next iteration. Finally, in order to enhance the robustness to the outliers, Student's-t mixture model with heavy tail has been applied in our algorithm. Experimental results obtained by employing the proposed method on many synthetic, medical and real-world images to demonstrate its robustness, accuracy and effectiveness. (C) 2016 Elsevier Ltd. All rights reserved.

关键词： Segmentation Active contours Level set Student's-t mixture model em algorithm

来源：评论

学校读者我要写书评

暂无评论

Mutual Kernel Matrix Completion

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTemS 2017年第8期E100D卷 1844-1851页

作者： Rivero, Rachelle Lemence, Richard Kato, Tsuyoshi Gunma Univ Dept Comp Sci Kiryu Gunma 3768515 Japan Univ Philippines Inst Math Coll Sci Diliman 1101 Quezon City Philippines

With the huge influx of various data nowadays, extracting knowledge from them has become an interesting but tedious task among data scientists, particularly when the data come in heterogeneous form and have missing information. Many data completion techniques had been introduced, especially in the advent of kernel methods-a way in which one can represent heterogeneous data sets into a single form: as kernel matrices. However, among the many data completion techniques available in the literature, studies about mutually completing several incomplete kernel matrices have not been given much attention yet. In this paper, we present a new method, called Mutual Kernel Matrix Completion (MKMC) algorithm, that tackles this problem of mutually inferring the missing entries of multiple kernel matrices by combining the notions of data fusion and kernel matrix completion, applied on biological data sets to be used for classification task. We first introduced an objective function that will be minimized by exploiting the em algorithm, which in turn results to an estimate of the missing entries of the kernel matrices involved. The completed kernel matrices are then combined to produce a model matrix that can be used to further improve the obtained estimates. An interesting result of our study is that the E-step and the M-step are given in closed form, which makes our algorithm efficient in terms of time and memory. After completion, the ( completed) kernel matrices are then used to train an SVM classifier to test how well the relationships among the entries are preserved. Our empirical results show that the proposed algorithm bested the traditional completion techniques in preserving the relationships among the data points, and in accurately recovering the missing kernel matrix entries. By far, MKMC offers a promising solution to the problem of mutual estimation of a number of relevant incomplete kernel matrices.

关键词： kernel matrix completion em algorithm Kullback-Leibler divergence support vector machine (SVM) data fusion

来源：评论

学校读者我要写书评

暂无评论

Relabel mixture models via modal clustering

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2017年第5期46卷 3406-3418页

作者： Wu, Qiang Yao, Weixin East Carolina Univ Dept Biostat Greenville NC 27834 USA Univ Calif Riverside Dept Stat Riverside CA 92521 USA

Effectively solving the label switching problem is critical for both Bayesian and Frequentist mixture model analyses. In this article, a new relabeling method is proposed by extending a recently developed modal clustering algorithm. First, the posterior distribution is estimated by a kernel density from permuted MCMC or bootstrap samples of parameters. Second, a modal em algorithm is used to find the m! symmetric modes of the KDE. Finally, samples that ascend to the same mode are assigned the same label. Simulations and real data applications demonstrate that the new method provides more accurate estimates than many existing relabeling methods.

关键词： Bayesian analysis em algorithm Finite mixture models Kernel density estimation Label switching Modal clustering 62F15 62G05 62H30

来源：评论

学校读者我要写书评

暂无评论

Profile maximal likelihood estimation for non linear mixed models with longitudinal data

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2017年第9期46卷 4449-4463页

作者： Li, Zaixing China Univ Min & Technol Dept Math Beijing Peoples R China China Univ Min & Technol State Key Lab Coal Resource & Safe Min Beijing Peoples R China

In this article, the profile maximal likelihood estimate (PMLE) is proposed for non linear mixed models (NLMMs) with longitudinal data where the variance components are estimated by the expectation-maximization (em) algorithm. Strong consistency and the asymptotic normality of the estimators are derived. A simulation study is conducted where the performance of the PLME and the Fishing scoring estimate (FSE) in literatures are compared. Moreover, a real data is also analyzed to investigate the empirical performance of the procedure.

关键词： Asymptotic properties em algorithm NLMM PMLE

来源：评论

学校读者我要写书评

暂无评论

Delay Network Tomography Using a Partially Observable Bivariate Markov Chain

引用

IEEE-ACM TRANSACTIONS ON NETWORKING 2017年第1期25卷 126-138页

作者： Rad, Neshat Etemadi Ephraim, Yariv Mark, Brian L. GEICO Chevy Chase MD 20815 USA George Mason Univ Dept Elect & Comp Engn Fairfax VA 22030 USA

Estimation of link delay densities in a computer network, from source-destination delay measurements, is of great importance in analyzing and improving the operation of the network. In this paper, we develop a general approach for estimating the density of the delay in any link of the network, based on continuous-time bivariate Markov chain modeling. The proposed approach also provides the estimates of the packet routing probability at each node, and the probability of each source-destination path in the network. In this approach, the states of one process of the bivariate Markov chain are associated with nodes of the network, while the other process serves as an underlying process that affects statistical properties of the node process. The node process is not Markov, and the sojourn time in each of its states is phase-type. Phase-type densities are dense in the set of densities with non-negative support. Hence, they can be used to approximate arbitrarily well any sojourn time distribution. Furthermore, the class of phase-type densities is closed under convolution and mixture operations. We adopt the expectation-maximization (em) algorithm of Asmussen, Nerman, and Olsson for estimating the parameter of the bivariate Markov chain. We demonstrate the performance of the approach in a numerical study.

关键词： Delay network tomography bivariate Markov chain em algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：