检索结果-内蒙古大学图书馆

Fast and Secure distributed nonnegative matrix factorization

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 2022年第2期34卷 653-666页

作者： Qian, Yuqiu Tan, Conghui Ding, Danhao Li, Hui Mamoulis, Nikos Tencent Shenzhen 518057 Guangdong Peoples R China WeBank Shenzhen 518000 Guangdong Peoples R China Univ Hong Kong Dept Comp Sci Hong Kong Peoples R China Xiamen Univ Sch Informat Xiamen 361005 Fujian Peoples R China Univ Ioannina Dept Comp Sci & Engn Ioannina 45110 Epirus Greece

nonnegative matrix factorization (NMF) has been successfully applied in several data mining tasks. Recently, there is an increasing interest in the acceleration of NMF, due to its high cost on large matrices. On the other hand, the privacy issue of NMF over federated data is worthy of attention, since NMF is prevalently applied in image and text analysis which may involve leveraging privacy data (e.g, medical image and record) across several parties (e.g., hospitals). In this paper, we study the acceleration and security problems of distributed NMF. First, we propose a distributed sketched alternating nonnegative least squares(DSANLS) framework for NMF, which utilizes a matrix sketching technique to reduce the size of nonnegative least squares subproblems with a convergence guarantee. For the second problem, we show that DSANLS with modification can be adapted to the security setting, but only for one or limited iterations. Consequently, we propose four efficient distributed NMF methods in both synchronous and asynchronous settings with a security guarantee. We conduct extensive experiments on several real datasets to show the superiority of our proposed methods. The implementation of our methods is available at https://***/qianyuqiu79/DSANLS.

关键词： Servers distributed databases Linear systems Convergence Data privacy Manganese Elbow distributed nonnegative matrix factorization matrix sketching privacy

来源：评论

学校读者我要写书评

暂无评论

distributed nonnegative matrix factorization with HALS Algorithm on Apache Spark 17th

Distributed Nonnegative Matrix Factorization with HALS Algor...

引用

17th International Conference on Artificial Intelligence and Soft Computing (ICAISC)

作者： Fonal, Krzysztof Zdunek, Rafal Wroclaw Univ Technol Dept Elect Wybrzeze Wyspianskiego 27 PL-50370 Wroclaw Poland

ISBN: (纸本)9783319912622;9783319912615

nonnegative matrix factorization (NMF) is a commonlyused unsupervised learning method for extracting parts-based features and dimensionality reduction from nonnegative data. Many computational algorithms exist for updating the latent nonnegative factors in NMF. In this study, we propose an extension of the Hierarchical Alternating Least Squares (HALS) algorithm to a distributed version using the state-of-the-art framework - Apache Spark. Spark gains its popularity among other distributed computational frameworks because of its in-memory approach which works much faster than well-known Apache Hadoop. The scalability and efficiency of the proposed algorithm is confirmed in the numerical experiments, performed on real data as well as synthetic ones.

关键词： distributed nonnegative matrix factorization Large-scale NMF HALS algorithm Spark Recommendation systems

来源：评论

学校读者我要写书评

暂无评论

distributed nonnegative matrix factorization with HALS Algorithm on MapReduce 1

引用

17th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP)

作者： Zdunek, Rafal Fonal, Krzysztof Wroclaw Univ Sci & Technol Dept Elect Wybrzeze Wyspianskiego 27 PL-50370 Wroclaw Poland

ISBN: (数字)9783319654829

ISBN: (纸本)9783319654829;9783319654812

nonnegative matrix factorization (NMF) is a commonly used method in machine learning and data analysis for feature extraction and dimensionality reduction of nonnegative data. Recently, we observe its increasing popularity in processing massive data, and advances in developing various distributed algorithms for NMF. In the paper, we propose a computational strategy for implementation of the Hierarchical Alternating Least Squares (HALS) algorithm using the MapReduce programming paradigm. Due to this approach, the scalable HALS NMF, which can be implemented on parallel and distributed computer architectures, is obtained. The scalability and efficiency of the proposed algorithm is confirmed in the numerical experiments, performed on largescale synthetic and recommendation system datasets.

关键词： distributed nonnegative matrix factorization Large-scale NMF HALS algorithm Mapreduce paradigm Recommendation systems

来源：评论

学校读者我要写书评

暂无评论

distributed geometric nonnegative matrix factorization and hierarchical alternating least squares-based nonnegative tensor factorization with the MapReduce paradigm

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2018年第17期30卷

作者： Zdunek, Rafal Fonal, Krzysztof Wroclaw Univ Sci & Technol Dept Elect Wybrzeze Wyspianskiego 27 PL-50370 Wroclaw Poland

nonnegative matrix factorization and its multilinear extension known as nonnegative tensor factorization are commonly used methods in machine learning and data analysis for feature extraction and dimensionality reduction for nonnegative high-dimensional data. Dimensionality reduction for massive amounts of data usually involves distributed computation across multi-node computer architectures. In this study, we propose various computational strategies for parallel and distributed computation of the latent factors in both factorization models, all of which are based on partitioning the computational tasks according to the MapReduce paradigm. We extend the previously reported distributed hierarchical alternating least squares algorithm to the multi-way array factorization model, where we assume that the observed multi-way data can be partitioned into chunks along one mode. Moreover, we propose a new geometry-based distributed computational strategy for solving nonnegative matrix factorization problems. Numerical experiments performed using various large-scale data sets demonstrated that these algorithms are efficient and robust to noisy data.

关键词： distributed nonnegative matrix factorization distributed nonnegative tensor factorization geometric nonnegative matrix factorization hierarchical alternating least squares algorithm MapReduce paradigm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：