检索结果-内蒙古大学图书馆

Approximation Schemes for Low-rank binary matrix Approximation Problems

ACM TRANSACTIONS ON ALGORITHMS 2020年第1期16卷 12-12页

作者： Fomin, Fedor, V Golovach, Petr A. Lokshtanov, Daniel Panolan, Fahad Saurabh, Saket Univ Bergen Dept Informat PB 7803 N-5020 Bergen Norway Univ Calif Santa Barbara Dept Comp Sci Santa Barbara CA 93106 USA IIT Hyderabad Dept Comp Sci & Engn Kandi 502285 Sangareddy India HBNI Inst Math Sci 4th CrossStCIT Campus Chennai 600113 Tamil Nadu India

We provide a randomized linear time approximation scheme for a generic problem about clustering of binary vectors subject to additional constraints. The new constrained clustering problem generalizes a number of problems and by solving it, we obtain the first linear time-approximation schemes for a number of well-studied fundamental problems concerning clustering of binary vectors and low-rank approximation of binary matrices. Among the problems solvable by our approach are Low GF(2)-RANK APPROXIMATION, Low BOOLEAN-RANK APPROXIMATION, and various versions of binary CLUSTERING. For example, for Low GF(2)-RANK APPROXIMATION problem, where for an m x n binary matrix A and integer r > 0, we seek for a binary matrix B of GF(2) rank at most r such that the l(0)-norm of matrix A - B is minimum, our algorithm, for any epsilon > 0 in time f (r, epsilon) . n . m, where f is some computable function, outputs a (1 + epsilon)-approximate solution with probability at least (1 - 1/e). This is the first linear time approximation scheme for these problems. We -7 also give (deterministic) PTASes for these problems running in time n(f(r)()1/)(epsilon 2)( log 1/epsilon), where f is some function depending on the problem. Our algorithm for the constrained clustering problem is based on a novel sampling lemma, which is interesting on its own.

关键词： binary matrix factorization clustering approximation scheme random sampling

来源：评论

学校读者我要写书评

暂无评论

LIBMF: a library for parallel matrix factorization in shared-memory systems

The Journal of Machine Learning Research

引用

The Journal of Machine Learning Research 2016年第1期17卷

作者： Kevin Murphy Bernhard Schölkopf Wei-Sheng Chin Bo-Wen Yuan Meng-Yuan Yang Yong Zhuang Yu-Chin Juan Chih-Jen Lin Google MPI for Intelligent Systems Department of Computer Science National Taiwan University Taipei Taiwan

matrix factorization (MF) plays a key role in many applications such as recommender systems and computer vision, but MF may take long running time for handling large matrices commonly seen in the big data era. Many parallel techniques have been proposed to reduce the running time, but few parallel MF packages are available. Therefore, we present an open source library, LIBMF, based on recent advances of parallel MF for shared-memory systems. LIBMF includes easy-to-use command-line tools, interfaces to C/C++ languages, and comprehensive documentation. Our experiments demonstrate that LIBMF outperforms state of the art packages. LIBMF is BSD-licensed, so users can freely use, modify, and redistribute the code.

关键词： adaptive learning rate binary matrix factorization logistic matrix factorization matrix factorization non-negative matrix factorization one-class matrix factorization parallel computation stochastic gradient method

来源：评论

学校读者我要写书评

暂无评论

Minimum-overlap Clusterings and the Sparsity of Overcomplete Decompositions of binary Matrices

引用

Procedia Computer Science 2015年 51卷 2967-2971页

作者： Victor Mireles Tim O.F. Conrad Mathematics and Computer Science Department - Freie Universiẗat Berlin International Max Planck Research School for Computational Biology and Scientific Computing

Given a set of n binary data points, a widely used technique is to group its features into k clusters (e.g. [7] ). In the case where n < k , the question of how overlapping are the clusters becomes of interest. In this paper we approach the question through matrix decomposition, and relate the degree of overlap with the sparsity of one of the resulting matrices. We present analytical results regarding bounds on this sparsity, and a heuristic to estimate the minimum amount of overlap that an exact grouping of features into k clusters must have. As shown below, adding new data will not alter this minimum amount of overlap.

关键词： binary matrix factorization Overcomplete decompositions Feature clustering

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：