检索结果-内蒙古大学图书馆

Multivariate semiparametric control charts for mixed-type data

STATISTICAL METHODS IN MEDICAL RESEARCH 2023年第4期32卷 671-690页

作者： Sofikitou, Elisavet M. Markatou, Marianthi Koutras, Markos, V SUNY Buffalo Sch Publ Hlth & Hlth Profess Dept Biostat Buffalo NY USA Univ Piraeus Sch Finance & Stat Dept Stat & Insurance Sci Piraeus Greece US FDA Ctr Devices & Radiol Heath CDRH Off Prod Evaluat & Qual OPEQ Silver Spring MD USA SUNY Buffalo Sch Publ Hlth & Hlth Profess Dept Biostat 726 Kimball Hall Buffalo NY 14214 USA

A useful tool that has gained popularity in the Quality Control area is the control chart which monitors a process over time, identifies potential changes, understands variations, and eventually improves the quality and performance of the process. This article introduces a new class of multivariate semiparametric control charts for monitoring multivariate mixed-type data, which comprise both continuous and discrete random variables (rvs). Our methodology leverages ideas from clustering and Statistical Process Control to develop control charts for MIxed-type data. We propose four control chart schemes based on modified versions of the KAy-means for MIxed LArge kamila data clustering algorithm, where we assume that the two existing clusters represent the reference and the test sample. The charts are semiparametric, the continuous rvs follow a distribution that belongs in the class of elliptical distributions. Categorical scale rvs follow a multinomial distribution. We present the algorithmic procedures and study the characteristics of the new control charts. The performance of the proposed schemes is evaluated on the basis of the False Alarm Rate and in-control Average Run Length. Finally, we demonstrate the effectiveness and applicability of our proposed methods utilizing real-world data.

关键词： Artificial intelligence average run length clustering false alarm rate kamila algorithm kernel density estimation

来源：评论

学校读者我要写书评

暂无评论

Optimization of the Numeric and Categorical Attribute Weights in kamila Mixed Data Clustering algorithm 1

引用

20th International Conference on Intelligent Data Engineering and Automated Learning (IDEAL)

作者： Martarelli, Nadia Junqueira Nagano, Marcelo Seido Univ Sao Paulo Lab Appl Operat Res BR-13566590 Sao Paulo Brazil

ISBN: (数字)9783030336073

ISBN: (纸本)9783030336073;9783030336066

The mixed data clustering algorithms have been timidly emerging since the end of the last century. One of the last algorithms proposed for this data-type has been kamila (KAy-means for MIxed LArge data) algorithm. While the kamila has outperformed the previous mixed data algorithms results, it has some gaps. Among them is the definition of numerical and categorical variable weights, which is a user-defined parameter or, by default, equal to one for all features. Hence, we propose an optimization algorithm called Biased Random-Key Genetic algorithm for Features Weighting (BRKGAFW) to accomplish the weighting of the numerical and categorical variables in the kamila algorithm. The experiment relied on six real-world mixed data sets and two baseline algorithms to perform the comparison, which are the kamila with default weight definition, and the kamila with weight definition done by the traditional genetic algorithm. The results have revealed the proposed algorithm overperformed the baseline algorithms results in all data sets.

关键词： Attributes weighting Mixed data clustering Biased Random-Key Genetic algorithm kamila algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还