检索结果-内蒙古大学图书馆

Estimation and variable selection for mixture of joint mean and variance models

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2021年第24期50卷 6081-6098页

作者： Wu, Liucang Li, Shuangshuang Tao, Ye Kunming Univ Sci & Technol Fac Sci Kunming 650093 Yunnan Peoples R China Fujian Normal Univ Coll Math & Informat Fuzhou Peoples R China Guangzhou Univ Sch Econ & Stat Guangzhou Peoples R China

Mixture of regression models are one of the most important statistical data analysis tools in a heterogeneous population. Similar to modeling variance parameter in a homogeneous population, we apply the idea of joint mean and variance models to the mixture of regression models and propose a new class of models: mixture of joint mean and variance models to analyze the heteroscedastic normal data coming from a heterogeneous population in this paper. The problem of variable selection for the proposed models is considered. In particular, a modified Expectation-Maximization (em) algorithm for estimating the model parameters is developed. The consistency and the oracle property of the penalized estimators are established. Properties of the estimators of the regression coefficients are evaluated through Monte Carlo simulations. Finally, a real data analysis is illustrated by the proposed methodologies.

关键词： Mixture of regression models mixture of joint mean and variance models em algorithm variable selection heterogeneous population

来源：评论

学校读者我要写书评

暂无评论

Regression Analysis of Doubly Censored Data with a Cured Subgroup under a Class of Promotion Time Cure Models

引用

Acta Mathematica Sinica,English Series 2021年第6期37卷 835-853页

作者： Min CAI Li Qun XIAO Shu Wei LI School of Economics and Statistics Guangzhou UniversityGuangzhou 510006P.R.China

In some situations,the failure time of interest is defined as the gap time between two related events and the observations on both event times can suffer either right or interval *** data are usually referred to as doubly censored data and frequently encountered in many clinical and observational ***,there may also exist a cured subgroup in the whole population,which means that not every individual under study will experience the failure time of interest *** this paper,we consider regression analysis of doubly censored data with a cured subgroup under a wide class of flexible transformation cure ***,we consider marginal likelihood estimation and develop a two-step approach by combining the multiple imputation and a new expectation-maximization(em)algorithm for its *** resulting estimators are shown to be consistent and asymptotically *** finite sample performance of the proposed method is investigated through simulation *** proposed method is also applied to a real dataset arising from an AIDS cohort study for illustration.

关键词： Doubly censored data marginal likelihood em algorithm multiple imputation transformation cure models

来源：评论

学校读者我要写书评

暂无评论

Identification of switched linear systems based on expectation-maximization and Bayesian algorithms

引用

TRANSACTIONS OF THE INSTITUTE OF MEASURemENT AND CONTROL 2021年第2期43卷 412-420页

作者： Chai, Xiujun Wang, Hongwei Ji, Xinru Wang, Lin Xinjiang Univ Sch Elect Engn Urumqi 830047 Peoples R China Dalian Univ Technol Sch Control Sci & Engn Dalian Peoples R China

This study aims to determine how to deal with the identification from input and output data of switched linear systems (SLSs) with Box and Jenkins models. The identification difficulties of this system are that there exist unknown switched signal, unknown middle variables, and colored noise terms in the identification process. To address these issues, the proposed identification method proceeds in two stages, including the estimation of the switched signal of SLSs and the identification of the parameters of all subsystems. First, the Gaussian mixture model is established to represent the distribution of the input and output data of SLSs. Then, the posterior probability is calculated by the expectation-maximization (em) algorithm and the naive Bayes classifier, and the switched signal is estimated according to the maximum probability criterion. Next, the auxiliary model based multi-innovation generalized extended least square (AM-MI-GELS) algorithm is used to estimate the parameters of all subsystems. Finally, the effectiveness of the proposed method is verified through the simulation example.

关键词： Switched linear systems mode detection em algorithm naive Bayes classifier parameter identification auxiliary model

来源：评论

学校读者我要写书评

暂无评论

A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation

引用

BIOMETRICS 2021年第2期77卷 424-438页

作者： Zhao, Kaiqiong Oualkacha, Karim Lakhal-Chaieb, Lajmi Labbe, Aurelie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Ines Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M. T. McGill Univ Dept Epidemiol Biostat & Occupat Hlth Montreal PQ Canada Univ Quebec Montreal Dept Math Montreal PQ Canada Univ Laval Dept Math & Stat Quebec City PQ Canada HEC Montreal Dept Sci Decis Montreal PQ Canada Lady Davis Inst Med Res Montreal PQ Canada McGill Univ Dept Med Montreal PQ Canada McGill Univ Res Inst Hlth Ctr Montreal PQ Canada Childrens Mercy Kansas City Ctr Pediat Genom Med Kansas City MO USA McGill Univ Douglas Mental Hlth Univ Inst Dept Psychiat Montreal PQ Canada Univ British Columbia Ctr Heart Lung Innovat Vancouver BC Canada Univ British Columbia Dept Med Vancouver BC Canada McGill Univ Dept Human Genet Montreal PQ Canada McGill Univ Gerald Bronfman Dept Oncol Montreal PQ Canada

Identifying disease-associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high-throughput methylation profiles at single-base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long-range correlations, data errors, and confounding from cell type mixtures. We propose a regression-based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized em algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS.

关键词： differentially methylated region em algorithm generalized additive model next-generation sequencing penalized regression splines

来源：评论

学校读者我要写书评

暂无评论

Computational characterization of double reduction in autotetraploid natural populations

引用

PLANT JOURNAL 2021年第6期105卷 1703-1709页

作者： Jiang, Libo Ren, Xiangyu Wu, Rongling Beijing Forestry Univ Beijing Adv Innovat Ctr Tree Breeding Mol Design Beijing 100083 Peoples R China Beijing Forestry Univ Coll Biol Sci & Technol Ctr Computat Biol Beijing 100083 Peoples R China Penn State Univ Ctr Stat Genet Dept Publ Hlth Sci Hershey PA 17033 USA Penn State Univ Ctr Stat Genet Dept Stat Hershey PA 17033 USA

Population genetic theory has been well developed for diploid species, but its extension to study genetic diversity, variation and evolution in autopolyploids, a class of polyploids derived from the genome doubling of a single ancestral species, requires the incorporation of multisomic inheritance. Double reduction, which is characteristic of autopolyploidy, has long been believed to shape the evolutionary consequence of organisms in changing environments. Here, we develop a computational model for testing and estimating double reduction and its genomic distribution in autotetraploids. The model is implemented with the expectation-maximization (em) algorithm to dissect unobservable allelic recombinations among multiple chromosomes, enabling the simultaneous estimation of allele frequencies and double reduction in natural populations. The framework fills an important gap in the population genetic theory of autopolyploids.

关键词： double reduction autopolyploid SNP em algorithm natural population technical advance

来源：评论

学校读者我要写书评

暂无评论

The single-index panel data models with heterogeneous link function: mixture approach

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2021年第8期50卷 2418-2431页

作者： Nademi, Arash Islamic Azad Univ Dept Stat Ilam Branch Ilam Iran

This paper investigates the data generating structure which can be represented as a mixture of single-index panel data model with heterogeneous link function. The switching between the states is governed by a hidden variable. We also offer an Expectation Maximization (em) algorithm for estimating parameters numerically. The ability of the proposed mixture model will be illustrated with both the simulated performance and the empirical applications.

关键词： Mixture models Single-index panel data Hidden variables Heterogeneous link function em algorithm

来源：评论

学校读者我要写书评

暂无评论

GMM Based Adaptive Thresholding for Uneven Lighting Image Binarization

引用

JOURNAL OF SIGNAL PROCESSING SYSTemS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 2021年第11期93卷 1253-1270页

作者： Pattnaik, Tapaswini Kanungo, Priyadarshi CV Raman Global Univ Bhubaneswar India

Image binarization of uneven lighted images, using thresholding techniques, is still a challenging task. Adaptive thresholding methods are the widely adopted approaches for binarization of uneven lighting images. However, the efficacy of these adaptive thresholding methods is highly sensitive to the criteria function used for measuring the bimodal property of the gray level distribution of a local region. In this paper, we propose Gaussian Mixture Model (GMM) which is based on adaptive thresholding for binarizing uneven lighting images. The proposed GMM based criteria function efficiently partitioning the uneven light images into bimodal and unimodal subimages with low uneven light effect. At first, the bimodal subimages are binarized using Otsu's thresholding approach, followed by unimodal subimages being thresholded using the bilinear interpolation of neighbouring thresholds of bimodal subimages. Next a fast Expectation Maximization(em) algorithm is developed to reduce the computational complexity of the GMM. Experimental results on different uneven light images demonstrate that the proposed adaptive thresholding outperforms the other considered methods with an avg. misclassification error of 1.68 % and an average computation time of 1.50 seconds. The computational time can be further reduced by a specially purposed hardware and parallel processing of each subimages for real time applications.

关键词： Segmentation Thresholding Uneven lighting image Gaussian mixture model em algorithm

来源：评论

学校读者我要写书评

暂无评论

Influence of Entrepreneur Social Network on New Product Development Performance of Enterprises under Intelligent Big Data

引用

International Journal of High Speed Electronics and Systems 2025年第3期34卷

作者： Su, Yuantao College of Economics and Management Quanzhou University of Information Engineering Fujian Quanzhou362000 China SEGi University Selangor 47810 Malaysia

In the complex market environment, it is difficult for enterprises to innovate only by their own internal product resources. In order to improve the performance level of new product development, this paper puts forward the influence of entrepreneur social network on the performance of new product development under smart big data. The process of new product development is modeled, and the influence of entrepreneur social network on enterprise new product development performance is analyzed from two aspects: vertical network quality and horizontal network quality. According to the analysis results, build an intelligent big data platform, and use this platform to realize the mining and analysis of enterprise-related data;On this basis, em algorithm is used to mine the performance data of enterprise new product development, DEA/AHP model is used to evaluate the performance level of enterprise new product development, and the evaluation results are combined with the influence model of entrepreneur social network on enterprise new product development performance to realize the influence analysis of entrepreneur social network on enterprise new product development performance. The experimental results show that the method is highly recognized by experts, which shows that the analysis results are reliable, and the correlation between entrepreneur social network and enterprise new product development performance can be obtained, which provides reference for enterprise new product development. © 2025 World Scientific Publishing Company.

关键词： Intelligent big data development performance social networks of entrepreneurs em algorithm DEA/AHP model

来源：评论

学校读者我要写书评

暂无评论

A penalized approach to mixed model selection via cross-validation

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2021年第11期50卷 2481-2507页

作者： Xiong, Jingwei Shang, Junfeng First Solar Co Perrysburg OH USA Bowling Green State Univ Dept Math & Stat Bowling Green OH 43403 USA

Mixed models play an important role for describing data in various fields, and accordingly selecting the most appropriate mixed model is an appealing topic in model selection literature. To achieve the goal of selecting the most appropriate mixed model, we propose a procedure to jointly select the fixed and random effects by implementing the adaptive Lasso (Zou 2006) penalized methodology via cross-validation. In the procedure, the application of cross-validation can effectively lower the risk of selecting overfitting models. The data are divided into training and test sets, where the training set is utilized for constructing candidate models and the test set is utilized for choosing the most appropriate mixed model. To boost the computational efficiency in the estimation and in the selection of mixed models, we adopt the em algorithm to optimize the penalized likelihood. Theoretical properties are founded to prove that the proposed approach possesses the consistency and oracle properties. The simulations and a real data example are provided to justify the validity of the procedure.

关键词： Linear mixed models penalized variable selection complete log-likelihood train and test data sets em algorithm

来源：评论

学校读者我要写书评

暂无评论

GemBag: Group Estimation of Multiple Bayesian Graphical Models

引用

JOURNAL OF MACHINE LEARNING RESEARCH 2021年第1期22卷 1-48页

作者： Yang, Xinming Gan, Lingrui Narisetty, Naveen N. Liang, Feng Univ Illinois Dept Stat Champaign IL 61820 USA Facebook Menlo Pk CA USA

In this paper, we propose a novel hierarchical Bayesian model and an efficient estimation method for the problem of joint estimation of multiple graphical models, which have similar but different sparsity structures and signal strength. Our proposed hierarchical Bayesian model is well suited for sharing of sparsity structures, and our procedure, called as GemBag, is shown to enjoy optimal theoretical properties in terms of l(infinity) norm estimation accuracy and correct recovery of the graphical structure even when some of the signals are weak. Although optimization of the posterior distribution required for obtaining our proposed estimator is a non-convex optimization problem, we show that it turns out to be convex in a large constrained space facilitating the use of computationally efficient algorithms. Through extensive simulation studies and an application to a bike sharing data set, we demonstrate that the proposed GemBag procedure has strong empirical performance in comparison with alternative methods.

关键词： graphical models Bayesian regularization spike-and-slab priors selection consistency non-convex optimization em algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：