检索结果-内蒙古大学图书馆

4th Annual International Conference on Network and Information Systems for Computers (ICNISC)

作者： Sun, Hua-Yan Li, Ye-Li Zi, Yun-Fei Han, Xu Beijing Inst Graph Commun Sch Informat Engn Beijing 102600 Peoples R China

ISBN: (纸本)9781538669563

In the whole process of data mining, the em algorithm is widely applied to dealing with incomplete data for its numerical stability, simplicity of implementation, reliable global convergence. the main disadvantage of the em is slow convergence speed, the algorithm is highly dependent on the initial value of the option, In this paper, the clustering results use K-means algorithm as the initial scope of em algorithm, according to the different choice of different characteristics of mining purposes, then use incremental em algorithm (Iem) step by step em iterative refinement repeatedly, it obtains the optimal value of filling missing data quickly and efficiently. it is concluded that the optimal value of filling missing data experimental results show that the algorithm of this paper to speed up the convergence rate, strengthened the stability of clustering, data filling effect is remarkable.

关键词： recommendation systems collaborative filtering fuzzy equivalence cause-effect clustering threshold value weighting K-means algorithm clustering em algorithm incremental em algorithm Convergence speed Stable clustering Missing data filling

来源：评论

学校读者我要写书评

暂无评论

On the choice of the number of blocks with the incremental em algorithm for the fitting of normal mixtures

引用

STATISTICS AND COMPUTING 2003年第1期13卷 45-55页

作者： Ng, SK McLachlan, GJ Univ Queensland Dept Math St Lucia Qld 4072 Australia

The em algorithm is a popular method for parameter estimation in situations where the data can be viewed as being incomplete. As each E-step visits each data point on a given iteration, the em algorithm requires considerable computation time in its application to large data sets. Two versions, the incremental em (Iem) algorithm and a sparse version of the em algorithm, were proposed recently by Neal R.M. and Hinton G.E. in Jordan M.I. (Ed.), Learning in Graphical Models, Kluwer, Dordrecht, 1998, pp. 355- 368 to reduce the computational cost of applying the em algorithm. With the Iem algorithm, the available n observations are divided into B (B less than or equal to n) blocks and the E-step is implemented for only a block of observations at a time before the next M-step is performed. With the sparse version of the em algorithm for the fitting of mixture models, only those posterior probabilities of component membership of the mixture that are above a specified threshold are updated;the remaining component-posterior probabilities are held fixed. In this paper, simulations are performed to assess the relative performances of the Iem algorithm with various number of blocks and the standard em algorithm. In particular, we propose a simple rule for choosing the number of blocks with the Iem algorithm. For the Iem algorithm in the extreme case of one observation per block, we provide efficient updating formulas, which avoid the direct calculation of the inverses and determinants of the component-covariance matrices. Moreover, a sparse version of the Iem algorithm (SPIem) is formulated by combining the sparse E-step of the em algorithm and the partial E-step of the Iem algorithm. This SPIem algorithm can further reduce the computation time of the Iem algorithm.

关键词： incremental em algorithm sparse Iem algorithm partial E-step efficient updating formulas

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：