检索结果-内蒙古大学图书馆

Multi-grade principal component analysis for fault detection with multiple production grades

CHemOMETRICS AND INTELLIGENT LABORATORY SYSTemS 2018年 175卷 20-29页

作者： Zhou, Le Chen, Junghui Hou, Beiping Song, Zhihuan Zhejiang Univ Sci & Technol Sch Automat & Elect Engn Hangzhou 310023 Zhejiang Peoples R China Chung Yuan Christian Univ Dept Chem Engn Taoyuan 32023 Taiwan Zhejiang Univ State Key Lab Ind Control Technol Hangzhou 310027 Zhejiang Peoples R China

In many chemical industries, a production line usually produces various products with different grades to meet the demands of the worldwide market. A process with multiple grades is not suitable to be described using a traditional single model. In this paper, a multi-grade principal component analysis (MGPCA) model is proposed for multi-grade process modeling and fault detection purposes. The proposed MGPCA can use the measurements from different grades with unequal sizes and to extract the essential information from the multi-grade process. The model is derived in a probabilistic framework and the corresponding parameters are estimated by the expectation-maximization algorithm. Finally, a simulated case and a real industrial polyethylene process with multiple grades are tested to evaluate the property of the proposed method.

关键词： em algorithm Fault detection Multi-grade principal component analysis Multi-grade process

来源：评论

学校读者我要写书评

暂无评论

Spatial Variability in Slash Linear Modeling with Finite Second Moment

引用

JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS 2018年第2期23卷 276-296页

作者： Fagundes, R. S. Uribe-Opazo, M. A. Galea, M. Guedes, L. P. C. Fed Technol Univ Parana Toledo PR Brazil Western Parana State Univ Cascavel PR Brazil Pontificia Univ Catolica Chile Dept Estadist Santiago Chile

This article studies the dependence of spatial linear models using a slash distribution with a finite second moment. The parameters of the model are estimated with maximum likelihood by using the em algorithm. To avoid identifiability problems, the cross-validation, the Trace and the maximum log-likelihood value are used to choose the parameter for adjusting the kurtosis of the slash distribution and the selection of the model to explain the spatial dependence. We present diagnostic techniques of global and local influences for exploring the sensibility of estimators and the presence of possible influential observations. A simulation study is developed to determine the performance of the methodology. The results showed the effectiveness of the choice criteria of the parameter for adjusting the kurtosis and for the selection of the spatial dependence model. It has also showed that the slash distribution provides an increased robustness to the presence of influential observations. As an illustration, the proposed model and its diagnostics are used to analyze an aquifer data. The spatial prediction with and without the influential observations were compared. The results show that the contours of the interpolation maps and prediction standard error maps showed low changes when we removed the influential observations. Thus, this model is a robust alternative in the spatial linear modeling for dependent random variables. Supplementary materials accompanying this paper appear online.

关键词： em algorithm Analysis of spatial data Geostatistics Global and local influence Robust modeling

来源：评论

学校读者我要写书评

暂无评论

Detecting Spammer Groups From Product Reviews: A Partially Supervised Learning Model

引用

IEEE ACCESS 2018年 6卷 2559-2568页

作者： Zhang, Lu Wu, Zhiang Cao, Jie Nanjing Univ Finance & Econ Jiangsu Prov Key Lab E Business Nanjing 210023 Jiangsu Peoples R China

Nowadays, online product reviews play a crucial role in the purchase decision of consumers. A high proportion of positive reviews will bring substantial sales growth, while negative reviews will cause sales loss. Driven by the immense financial profits, many spammers try to promote their products or demote their competitors' products by posting fake and biased online reviews. By registering a number of accounts or releasing tasks in crowdsourcing platforms, many individual spammers could be organized as spammer groups to manipulate the product reviews together and can be more damaging. Existing works on spammer group detection extract spammer group candidates from review data and identify the real spammer groups using unsupervised spamicity ranking methods. Actually, according to the previous research, labeling a small number of spammer groups is easier than one assumes, however, few methods try to make good use of these important labeled data. In this paper, we propose a partially supervised learning model (PSGD) to detect spammer groups. By labeling some spammer groups as positive instances, PSGD applies positive unlabeled learning (PU-Learning) to study a classifier as spammer group detector from positive instances (labeled spammer groups) and unlabeled instances (unlabeled groups). Specifically, we extract reliable negative set in terms of the positive instances and the distinctive features. By combining the positive instances, extracted negative instances and unlabeled instances, we convert the PU-Learning problem into the well-known semi supervised learning problem, and then use a Naive Bayesian model and an em algorithm to train a classifier for spammer group detection. Experiments on real-life *** data set show that the proposed PSGD is effective and outperforms the state-of-the-art spammer group detection methods.

关键词： Spammer group detection partially supervised learning positive unlabeled learning reliable negative set extraction Naive Bayesian model em algorithm

来源：评论

学校读者我要写书评

暂无评论

On the mixtures of Weibull and Pareto (IV) distribution: An alternative to Pareto distribution

引用

COMMUNICATIONS IN STATISTICS-THEORY AND METHODS 2018年第9期47卷 2073-2084页

作者： Ghosh, I. Hamedani, G. G. Bansal, N. Maadooliat, M. Univ N Carolina Dept Math & Stat 601 S Coll Rd Wilmington NC 28403 USA Marquette Univ Dept Math Stat & Comp Sci Milwaukee WI 53233 USA

Finite mixture models have provided a reasonable tool to model various types of observed phenomena, specially those which are random in nature. In this article, a finite mixture of Weibull and Pareto (IV) distribution is considered and studied. Some structural properties of the resulting model are discussed including estimation of the model parameters via expectation maximization (em) algorithm. A real-life data application exhibits the fact that in certain situations, this mixture model might be a better alternative than the rival popular models.

关键词： em algorithm Maximum likelihood Mixtures of Weibull and Pareto

来源：评论

学校读者我要写书评

暂无评论

Best unbiased prediction of order statistics in stable distributions

引用

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION 2018年第8期47卷 2424-2435页

作者： Almasi, Isaac Mohammadpour, Adel Mohammadi, Mohammad Amirkabir Univ Technol Fac Math & Comp Sci Dept Stat Tehran Polytech 424 Hafez Ave Tehran Iran Behbahan Khatam Alanbia Univ Technol Fac Basic Sci Dept Stat Behbahan Iran

We introduce the best unbiased prediction of missing order statistics of a stable distribution, based on conditional expected value. We present necessary and sufficient conditions for the existence of conditional moments of stable order statistics. These conditions enable us to compute unknown parameters using the expectation-maximization algorithm. We reveal the efficiency of the presented method through a simulation study.

关键词： em algorithm Order statistics Prediction Stable distribution 62G30 60E07

来源：评论

学校读者我要写书评

暂无评论

Soft-Information Aided Channel Estimation With IQ Imbalance for Alternate-Relaying OFDM Cooperative Systems

引用

IEEE WIRELESS COMMUNICATIONS LETTERS 2018年第3期7卷 308-311页

作者： Marey, Mohamed Menoufia Univ Fac Elect Engn Menoufia 32952 Egypt Prince Sultan Univ Coll Engn Riyadh 11586 Saudi Arabia

In this letter, we exploit the feature of data redundancy associated with alternate-relaying cooperative systems to develop an iterative channel estimation algorithm in the context of orthogonal frequency division multiplexing (OFDM) transmission. Our attention is also focused on the problem of in-phase/quadrature-phase (IQ) imbalance which is typically associated with OFDM transmission. Analytical analysis indicates that instead of estimating a family of parameters including IQ imbalance occurring at the source, relays, and destination, and channel impulse responses (CIRs) between the source-destination link, and relays-destination links, we can estimate one parameter called the equivalent CIR. In addition, we illustrate how to perform data detection using the estimated parameter. By employing expectation-maximization algorithm, we show that soft information provided by the detector can be combined with pilot symbols in an efficient way to enhance the estimation process. Simulations experiments have confirmed the efficiency of the proposed approach.

关键词： Cooperative systems OFDM em algorithm

来源：评论

学校读者我要写书评

暂无评论

Weighted Weibull distribution: Bivariate and multivariate cases

引用

BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS 2018年第1期32卷 20-43页

作者： Al-Mutairi, D. K. Ghitany, M. E. Kundu, Debasis Kuwait Univ Fac Sci Dept Stat & Operat Res POB 5969 Safat 13060 Kuwait Indian Inst Technol Kanpur Dept Math & Stat Kanpur 208016 Uttar Pradesh India

Gupta and Kundu (Statistics 43 (2009) 621-643) introduced a new class of weighted exponential distribution and established its several properties. The probability density function of the proposed weighted exponential distribution is unimodal and it has an increasing hazard function. Following the same line Shahbaz, Shahbaz and Butt (Pak. J. Stat. Oper. Res. VI (2010) 53-59) introduced weighted Weibull distribution, and we derive several new properties of this weighted Weibull distribution. The main aim of this paper is to introduce bivariate and multivariate distributions with weighted Weibull marginals and establish their several properties. It is shown that the hazard function of the weighted Weibull distribution can have increasing, decreasing and inverted bathtub shapes. The proposed multivariate model has been obtained as a hidden truncation model similarly as the univariate weighted Weibull model. It is observed that to compute the maximum likelihood estimators of the unknown parameters for the proposed p-variate distribution, one needs to solve (p + 2) non-linear equations. We propose to use the em algorithm to compute the maximum likelihood estimators of the unknown parameters. We obtain the observed Fisher information matrix, which can be used for constructing asymptotic confidence intervals. One data analysis has been performed for illustrative purposes, and it is observed that the proposed em algorithm is very easy to implement, and the performance is quite satisfactory.

关键词： Hidden truncation model maximum likelihood estimator failure rate em algorithm bootstrap confidence intervals

来源：评论

学校读者我要写书评

暂无评论

Estimation for finite mixture of simplex models: applications to biomedical data

引用

STATISTICAL MODELLING 2018年第2期18卷 129-148页

作者： Lopez Quintero, Freddy Omar Contreras-Reyes, Javier E. Univ Tecn Federico Santa Maria Dept Matemat Ave Espana 1680 Valparaiso 2390123 Chile Inst Fomento Pesquero Div Invest Pesquera Valparaiso Chile Univ Valparaiso Inst Estadist Valparaiso Chile

Simplex distribution has been proved useful for modelling double-bounded variables in data directly. Yet, it is not sufficient for multimodal distributions. This article addresses the problem of estimating a density when data is restricted to the (0,1) interval and contains several modes. Particularly, we propose a simplex mixture model approach to model this kind of data. In order to estimate the parameters of the model, an Expectation Maximization (em) algorithm is developed. The parameter estimation performance is evaluated through simulation studies. Models are explored using two real datasets: i) gene expressions data of patients' survival times and the relation to adenocarcinoma and ii) magnetic resonant images (MRI) with a view in segmentation. In the latter case, given that data contains zeros, the main model is modified to consider the zero-inflated setting.

关键词： simplex distribution Finite mixture zero-inflated models simulation em algorithm MRI

来源：评论

学校读者我要写书评

暂无评论

Measurement Error Models for Replicated Data Under Asymmetric Heavy-Tailed Distributions

引用

COMPUTATIONAL ECONOMICS 2018年第2期52卷 531-553页

作者： Cao, Chunzheng Wang, Yahui Shi, Jian Qing Lin, Jinguan Nanjing Univ Informat Sci & Technol Sch Math & Stat Nanjing 210044 Jiangsu Peoples R China Seoul Natl Univ Dept Stat Seoul 151742 South Korea Univ Newcastle Sch Math & Stat Newcastle NE1 7RU England Nanjing Audit Univ Dept Stat Nanjing 211815 Jiangsu Peoples R China

Replicated data with measurement errors are frequently presented in economical, environmental, chemical, medical and other fields. In this paper, we discuss a replicated measurement error model under the class of scale mixtures of skew-normal distributions, which extends symmetric heavy and light tailed distributions to asymmetric cases. We also consider equation error in the model for displaying the matching degree between the true covariate and response. Explicit iterative expressions of maximum likelihood estimates are provided via the expectation-maximization type algorithm. empirical Bayes estimates are conducted for predicting the true covariate and response. We study the effectiveness as well as the robustness of the maximum likelihood estimations through two simulation studies. The method is applied to analyze a continuing survey data of food intakes by individuals on diet habits.

关键词： em algorithm Equation error Food intakes by individuals Replicated measurement Robustness Scale mixtures of skew-normal distributions

来源：评论

学校读者我要写书评

暂无评论

Probability distribution of wave periods in combined sea states with finite mixture models

引用

APPLIED OCEAN RESEARCH 2019年 92卷 1页

作者： Huang, Weinan Dong, Sheng Ocean Univ China Coll Engn Qingdao 266100 Shandong Peoples R China

The short-term distribution of wave periods is very important for ocean and coastal engineering applications. At present, the vast majority of research studies are confined to single-wave systems. Most available theoretical distributions of wave periods, which are based on the narrowband approximation, are inapplicable to actual sea states. This study focuses on the probability distribution of individual wave periods in combined sea states with two parametric mixture distribution models. The expectation-maximisation (em) algorithm is used to calculate the maximum likelihood estimates of the mixture models. Further, the mixture distributions are compared with other two models: a theoretical and a parametric model. In situ-measured data with two-peaked spectra and simulated data obtained with the six-parameter Ochi-Hubble model allow for a thorough assessment of the distribution models. The patterns of the distributions of wave periods in nine types of mixed sea states are considered and discussed. According to the results, the theoretical distribution model is unsuitable for the description of the distributions in mixed sea states;in particular, when the patterns exhibit bimodal characters. By contrast, despite having a higher calculation complexity, the mixture distribution models provide an improved performance for all combined-sea state cases.

关键词： Wave periods Short-term distribution Parametric model Mixture model em algorithm Combined sea states

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：