检索结果-内蒙古大学图书馆

Penalized estimation in finite mixture of ultra-high dimensional regression models

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2022年第17期51卷 5971-5992页

作者： Tang, Shiyi Zheng, Jiali Shanghai Univ Finance & Econ Sch Stat & Management Shanghai Peoples R China

In this paper, we propose a penalized estimation method for finite mixture of ultra-high dimensional regression models. A two-step procedure is explored. Firstly, we conduct order selection with the number of components unknown. Then variable selection is applied to ultra-high dimensional regression models. A specific em algorithm is designed to maximize penalized log-likelihood function. We demonstrate our method by numerical simulations which performs well. Further, an empirical study of return on equity (ROE) prediction is shown to consolidate our methodology.

关键词： Finite mixture of regression models ultra-high dimensional regression em algorithm variable selection order selection

来源：评论

学校读者我要写书评

暂无评论

Estimation of stress-strength reliability using discrete phase type distribution

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2022年第2期51卷 368-386页

作者： Jose, Joby K. Drisya, M. Manoharan, M. Kannur Univ Dept Stat Sci Kannur 670567 Kerala India Univ Calicut Dept Stat Calicut Kerala India

In this paper, the stress-strength reliability of single and multi-component systems are estimated assuming discrete phase type distribution for stress and strength components. The systems with strength following mixture of discrete phase type distributions is also considered. Matrix based expressions are obtained for stress-strength reliability and its maximum likelihood estimate is obtained using em algorithm. The numerical illustration using various special cases of discrete phase type distribution like geometric, negative binomial, generalized negative binomial and different mixtures of discrete distributions are also carried out.

关键词： Discrete phase type distribution mixture distribution stress-strength reliability em algorithm

来源：评论

学校读者我要写书评

暂无评论

A Statistical Modeling Framework for DCT Coefficients of Tampered JPEG images and Forgery Localization

引用

IEEE ACCESS 2022年 10卷 71143-71164页

作者： Nhan Le Retraint, Florent Univ Technol Troyes Comp Sci & Digital Soc Lab LIST3N F-10004 Troyes France

Various manipulations on JPEG images introduce single and multiple compression artifacts for forged and unmodified areas respectively. Based on the statistical analysis of JPEG compression cycle and on the finite mixture paradigm, we propose in this paper a modeling framework for AC DCT coefficients of such tampered JPEG images. Its accuracy is numerically assessed using the Kullback-Leibler divergence on the basis of a tampered JPEG image dataset built from six well-known uncompressed color image databases. To illustrate the framework utility, an application in image forgery localization is proposed. By formulating the localization as a clustering problem, we use the plug-in Bayes rule combined with a simple em algorithm to distinguish between forged and unmodified areas. Numerous experiments show that, when the quality factor of final JPEG compression is high enough, the proposed modeling framework yields higher localization performances in terms of F-1-score than prior art regardless of divers local manipulations.

关键词： DCT coefficients analysis em algorithm forgery localization multiple JPEG compression statistical image models tampered JPEG images

来源：评论

学校读者我要写书评

暂无评论

Parameter learning of stochastic Boolean networks

引用

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2022年第5期32卷 2472-2484页

作者： Chen, Hongwei Shen, Bo Donghua Univ Coll Informat Sci & Technol Shanghai 201620 Peoples R China Res Ctr Digitalized Text & Fash Technol Minist Educ Shanghai Peoples R China

In this article, the parameter learning problem is studied for stochastic Boolean networks (SBNs). Both the measure noise and the system noise are assumed to be white and modeled by sequences of Bernoulli distributed stochastic variables which are mutually independent. An algebraic representation of the SBNs is obtained by taking advantage of vector expression of logic variable and applying the semi-tensor product technique. Consequently, the parameter learning problem is reformulated as an optimization problem that makes it possible to identify the system matrices of SBNs in an efficient computation way. Subsequently, properties of forward and backward probabilities are investigated, and the em algorithm is utilized to learn the model parameters from time series data. Finally, a numerical experiment is presented to show the usefulness of the designed parameter learning algorithm.

关键词： em algorithm parameter learning semi-tensor product stochastic Boolean networks

来源：评论

学校读者我要写书评

暂无评论

Multi-Source Domain Adaptation via Latent Domain Reconstruction

Multi-Source Domain Adaptation via Latent Domain Reconstruct...

引用

32nd World Wide Web Conference (WWW)

作者： Zhou, Jun Fu, Chilin Zhang, Xiaolu Zhejiang Univ Coll Comp Sci & Technol Ant Grp Hangzhou Peoples R China Ant Grp Hangzhou Peoples R China

ISBN: (纸本)9781450394161

Multi-Source Domain Adaptation (MSDA) is widely used in various machine learning scenarios for domain shifts between labeled source domains and unlabeled target domains. Conventional MSDA methods are built on a strong hypothesis that data samples from the same source belong to the same domain with the same latent distribution. However, in practice sources and their latent domains are not necessarily one-to-one correspondence. To tackle this problem, a novel Multi-source Reconstructed Domain Adaptation (MRDA) framework for MSDA is proposed. We use an Expectation-Maximization (em) mechanism that iteratively reconstructs the source domains to recover the latent domains and performs domain adaptation on the reconstructed domains. Specifcally, in the E-step, we cluster the samples from multiple sources into diferent latent domains, and a soft assignment strategy is proposed to avoid cluster imbalance. In the M-step, we freeze the latent domains clustered in the E-step and optimize the objective function for domain adaptation, and a global-specifc feature extractor is used to capture both domain-invariant and domain-specifc features. Extensive experiments demonstrate that our approach can reconstruct source domains and perform domain adaptation on the reconstructed domains efectively, thus signifcantly outperforming state-of-the-art (SOTA) baselines (e.g., 1% to 3.1% absolute improvement in AUC).

关键词： Domain Adaptation Domain Reconstruction em algorithm

来源：评论

学校读者我要写书评

暂无评论

Inverse Gaussian processes with correlated random effects for multivariate degradation modeling

引用

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2022年第3期300卷 1177-1193页

作者： Fang, Guanqi Pan, Rong Wang, Yukun Zhejiang Gongshang Univ Sch Stat & Math Hangzhou 310018 Peoples R China Zhejiang Gongshang Univ Collaborat Innovat Ctr Stat Data Engn Technol & A Hangzhou 310018 Peoples R China Arizona State Univ Sch Comp & Augmented Intelligence Tempe AZ 85281 USA Tianjin Chengjian Univ Sch Econ & Management Tianjin 300384 Peoples R China

Many engineering products have more than one failure mode and the evolution of each mode can be monitored by measuring a performance characteristic (PC). It is found that the underlying multi-dimensional degradation often occurs with inherent process stochasticity and heterogeneity across units, as well as dependency among PCs. To accommodate these features, in this paper, we propose a novel multivariate degradation model based on the inverse Gaussian process. The model incorporates random effects that are subject to a multivariate normal distribution to capture both the unit-wise variability and the PC-wise dependence. Built upon this structure, we obtain some mathematically tractable properties such as the joint and conditional distribution functions, which subsequently facilitate the future degrada-tion prediction and lifetime estimation. An expectation-maximization algorithm is developed to infer the model parameters along with the validation tools for model checking. In addition, two simulation studies are performed to assess the performance of the inference method and to evaluate the effect of model misspecification. Finally, the application of the proposed methodology is demonstrated by two illustrative examples. (c) 2021 Elsevier B.V. All rights reserved.

关键词： Reliability Degradation process Dependence modeling em algorithm Lifetime distribution Multivariate model Random effects

来源：评论

学校读者我要写书评

暂无评论

Causal inference with missingness in confounder

引用

JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION 2022年第18期92卷 3917-3930页

作者： Bagmar, Md Shaddam Hossain Shen, Hua Univ Calgary Dept Math & Stat Calgary AB Canada Univ Dhaka Inst Stat Res & Training ISRT Dhaka Bangladesh

Causal inference is a process of uncovering causal relationship between effect variable and disease outcome in epidemiologic research. When estimating causal effect in observational studies, confounders that influence both the effect variable and the outcome need to be adjusted for in the estimation process. In addition, missing data often arise in data collection procedure;working with complete cases often results in biased parameter estimates. We consider the causal effect estimation in the presence of missingness in the confounders under the missing at random assumption. We investigate how the double robust estimators perform when applying complete-case analysis or multiple imputations. Given the uncertainty of appropriate imputation model and computational challenge for many imputations, we propose an expectation-maximization (em) algorithm to estimate the expected values of the missing confounder and utilize a weighting approach in the estimation of the average treatment effect. Simulation studies are conducted to see whether there is any gain in estimation efficiency using the proposed method, instead of the complete case analysis and multiple imputations. The results identified em as the most efficient and accurate method for dealing with missingness in confounder. Our study result is applied in a B-aware trial, which is a multi-centre clinical trial, to estimate the effect of total intravenous anaesthetic on post-operative anxiety.

关键词： Causal inference confounders missing at random double robustness em algorithm estimation efficiency

来源：评论

学校读者我要写书评

暂无评论

Valid properties of truncated Student-t regression model with applications in analysis of censored data

引用

BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS 2022年第1期36卷 157-184页

作者： Zhang, Chi Tian, Guo-Liang Zhai, Yibo Fei, Yu Shenzhen Univ Coll Econ Shenzhen 518055 Guangdong Peoples R China Southern Univ Sci & Technol Dept Stat & Data Sci Shenzhen 518055 Guangdong Peoples R China Yunnan Univ Finance & Econ Sch Stat & Math Kunming 650221 Yunnan Peoples R China

Kim (J. Korean Stat. Soc. 37 (2008) 81-87) introduced an incor-rect stochastic representation (SR) for the truncated Student-t (Tt) random variable. By pointing out that the gamma mixture based on a truncated nor-mal distribution actually cannot result in a true Tt distribution, in this paper, we first propose three correct SRs and then recalculate the corresponding moments of the Tt distribution. Different from those derived by following the invalid SR of Kim (J. Korean Stat. Soc. 37 (2008) 81-87), the correct moments of the Tt distribution play a crucial role in parameter estimations. Based on the third SR proposed and the correct expressions of truncated mo-ments, expectation-maximization (em) algorithms are developed for calcu-lating the maximum likelihood estimates of parameters in the Tt distribu-tion. Extensions to a Tt regression model and a t interval-censored regression model are provided as well. Simulated experiments are conducted to evalu-ate the performance of the proposed methods. Finally, two real data analyses corroborate the theoretical results.

关键词： em algorithm interval-censored regression model stochastic representation trun-cated Student-t distribution truncated Student-t regression model

来源：评论

学校读者我要写书评

暂无评论

Semi-Blind Channel Estimation in MIMO Systems With Discrete Priors on Data Symbols

引用

IEEE SIGNAL PROCESSING LETTERS 2022年 29卷 51-54页

作者： Al-Shoukairi, Maher Rao, Bhaskar D. Univ Calif San Diego Dept Elect & Comp Engn San Diego CA 92122 USA

In this work, we addressthe MIMO semi-blindchannel estimation problem. We propose an eigenvalue decomposition based technique to significantly reduce the dimensionality of the em based algorithm when the imposed prior on the data is Gaussian, greatly lowering the computational complexity. In addition to that, we apply the Minimum Power Distortionless Response (MPDR) decoupling principle to derive a tractable em algorithm that uses the actual discrete prior of the data symbols. Our results show that the proposed MPDR based algorithm has superior performance over other em based algorithms in both low and high SNR regions. The results also show that a faster version of the algorithm can be obtained by initializing it using the eigenvalue decomposition based Gaussian algorithm.

关键词： Channel estimation Covariance matrices Complexity theory Signal processing algorithms MIMO communication Signal to noise ratio Maximum likelihood estimation Channel estimation semi-blind em algorithm massive MIMO MPDR decoupling discrete priors

来源：评论

学校读者我要写书评

暂无评论

A Statistical Approach for Improving Image Quality and Cell Quantification from Imaging Mass Cytometry 9

A Statistical Approach for Improving Image Quality and Cell ...

引用

9th International Conference on Communication, Image and Signal Processing, CCISP 2024

作者： Xiao, Xu Lin, Yating Zhang, Lei Yang, Wenxian Yu, Rongshan School of Informatics Xiamen University Xiamen China Aginome Scientific Xiamen China

ISBN: (纸本)9798350356656

Imaging Mass Cytometry (IMC), a multiplexed imaging technology, has become a valuable tool in biomedical research due to its capability to measure over 100 markers theoretically. However, the presence of noise in IMC images can affect cell phenotyping and the precision of clinical analysis. To address this challenge, we propose IMCell, a highly effective and generalizable method that enhances image quality and cell quantification by leveraging biological priors. IMCell begins by generating decoy cells using Monte Carlo sampling and incorporates cell shape information for each protein channel. Subsequently, it estimates background noise from the expression of decoy cells by fitting Gaussian mixture models with the em algorithm. Additionally, it identifies positive cell expressions by comparing distribution patterns between real cells and decoy cells. Based on the analysis results from IMCell, we further introduce an image quality index for comprehensive image quality assessment by combining noise and positive cell expressions. Experiment results demonstrate the efficacy of our proposed approach in enhancing image quality and facilitating downstream analysis. By improving the quality of IMC images, IMCell holds the potential to enhance our understanding of complex biological systems and disease mechanisms. © 2024 IEEE.

关键词： Biomedical imaging em algorithm Gaussian mixture model Image quality enhancement IMC

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：