检索结果-内蒙古大学图书馆

algorithms and Hardness Results for parallel Large Margin learning

JOURNAL OF MACHINE learning RESEARCH 2013年 14卷 3105-3128页

作者： Long, Philip M. Servedio, Rocco A. Microsoft Sunnyvale CA 94089 USA Columbia Univ Dept Comp Sci New York NY 10027 USA

We consider the problem of learning an unknown large-margin halfspace in the context of parallel computation, giving both positive and negative results. As our main positive result, we give a parallel algorithm for learning a large-margin halfspace, based on an algorithm of Nesterov's that performs gradient descent with a momentum term. We show that this algorithm can learn an unknown gamma-margin halfspace over n dimensions using n . poly(1/gamma) processors and running in time (O) over tilde (1/gamma) + O(log n). In contrast, naive parallel algorithms that learn a gamma-margin halfspace in time that depends polylogarithmically on n have an inverse quadratic running time dependence on the margin parameter gamma. Our negative result deals with boosting, which is a standard approach to learning large-margin halfspaces. We prove that in the original PAC framework, in which a weak learning algorithm is provided as an oracle that is called by the booster, boosting cannot be parallelized. More precisely, we show that, if the algorithm is allowed to call the weak learner multiple times in parallel within a single boosting stage, this ability does not reduce the overall number of successive stages of boosting needed for learning by even a single stage. Our proof is information-theoretic and does not rely on unproven assumptions.

关键词： PAC learning parallel learning algorithms halfspace learning linear classifiers

来源：评论

学校读者我要写书评

暂无评论

algorithms and hardness results for parallel large margin learning

The Journal of Machine Learning Research

引用

The Journal of Machine learning Research 2013年第1期14卷

作者： Kevin Murphy Bernhard Schölkopf Philip M. Long Rocco A. Servedio Google MPI for Intelligent Systems Microsoft Sunnyvale CA Department of Computer Science Columbia University New York NY

We consider the problem of learning an unknown large-margin halfspace in the context of parallel computation, giving both positive and negative *** our main positive result, we give a parallel algorithm for learning a large-margin half-space, based on an algorithm of Nesterov's that performs gradient descent with a momentum term. We show that this algorithm can learn an unknown γ-margin halfspace over n dimensions using n ċ poly(1/γ) processors and running in time Õ(1/γ)+O(log n). In contrast, naive parallel algorithms that learn a γ-margin halfspace in time that depends polylogarithmically on n have an inverse quadratic running time dependence on the margin parameter γ.Our negative result deals with boosting, which is a standard approach to learning large-margin halfspaces. We prove that in the original PAC framework, in which a weak learning algorithm is provided as an oracle that is called by the booster, boosting cannot be parallelized. More precisely, we show that, if the algorithm is allowed to call the weak learner multiple times in parallel within a single boosting stage, this ability does not reduce the overall number of successive stages of boosting needed for learning by even a single stage. Our proof is information-theoretic and does not rely on unproven assumptions.

关键词： PAC learning halfspace learning linear classifiers parallel learning algorithms

来源：评论

学校读者我要写书评

暂无评论

A parallel growing architecture for self-organizing maps with unsupervised learning

引用

NEUROCOMPUTING 2005年第10期68卷 177-195页

作者： Valova, I Szer, D Gueorguieva, N Buer, A Univ Massachusetts Dept Comp & Informat Sci N Dartmouth MA 02747 USA CUNY Coll Staten Isl Dept Comp Sci Staten Isl NY 10314 USA

Self-organizing maps (SOMs) have become popular for tasks in data visualization, pattern classification or natural language processing and can be seen as one of the major concepts for artificial neural networks of today. Their general idea is to approximate a high dimensional and previously unknown input distribution by a lower-dimensional neural network structure with the goal to model the topology of the input space as close as possible. Classical SOMs read the input values in random but sequential order one by one and thus adjust the network structure over space: the network will be built while reading larger and larger parts of the input. In contrast to this approach, we present a SOM that processes the whole input in parallel and organizes itself over time. The main reason for parallel input processing lies in the fact that knowledge can be used to recognize parts of patterns in the input space that have already been learned. This way, networks can be developed that do not reorganize their structure from scratch every time a new set of input vectors is presented, but rather adjust their internal architecture in accordance with previous mappings. One basic application could be a modeling of the whole-part relationship through layered architectures. (c) 2004 Elsevier B.V. All rights reserved.

关键词： self-organizing map parallel learning algorithms growing architectures

来源：评论

学校读者我要写书评

暂无评论

parallel algorithm for growing SOM with regions of influence and neuron inertia

Parallel algorithm for growing SOM with regions of influence...

引用

3rd IASTED International Conference on Circuits, Signals, and Systems

作者： Hammond, J Fischer, S Valova, I Univ Massachusetts Comp & Informat Sci Dept N Dartmouth MA 02747 USA

ISBN: (纸本)0889865078

The self-organizing map (SOM) is a common methodology used to capture and represent data patterns and increasingly playing a significant role in the development of neural networks. The primary objective of an SOM is to determine an approximate representation of data with an unknown probability distribution, from a multi-dimensional input space, using a lower dimensional neural network. The approximation by the network corresponds to the topological structure inherent in the data distribution. The classical SOM, and many of its variations such as the growing grid, construct the network based on randomly selected pieces of the input space, where the number of pieces increases over time. We give an overview of a parallel algorithm for the SOM (ParaSOM), which alternatively examines the entire input in each step, leading to a more accurate representation of input patterns after only a fraction of iterations, albeit requiring significantly more time. Both growing grid and ParaSOM, unlike the classical SOM, do not maintain a fixed number of neurons. Instead, their networks may grow and increase in density to match the input space. We present a comparison of results generated by implementations of ParaSOM and growing grid is made, making apparent their considerable performance differences despite having the growth feature in common.

关键词： neural networks Self-Organizing Map parallel learning algorithms pattern recognition neural systems

来源：评论

学校读者我要写书评

暂无评论

A scalable parallel algorithm for training a hierarchical mixture of neural experts

引用

parallel COMPUTING 2002年第6期28卷 861-891页

作者： Estévez, PA Paugam-Moisy, H Puzenat, D Ugarte, M UMR CNRS 5015 Inst Cognit Sci 67 Blvd Pinel F-69675 Bron France Univ Chile Dept Ingn Elect Santiago Chile Univ Antilles Guyane Equipe GRIMAAG F-97159 Pointe a Pitre Guadeloupe France

Efficient parallel learning algorithms are proposed for training a powerful modular neural network, the hierarchical mixture of experts (HME). parallelizations are based on the concept of modular parallelism, i.e. parallel execution of network modules. From modeling the speed-up as a function of the number of processors and the number of training examples, several improvements are derived, such as pipelining the training examples by packets. Compared to experimental measurements, theoretical models are accurate. For regular topologies, an analysis of the models shows that the parallel algorithms are highly scalable when the size of the experts grows from linear units to multi-layer perceptrons (MLPs). These results are confirmed experimentally, achieving near-linear speedups for HME-MLP. Although this work can be viewed as a case study in the parallelization of HME neural networks, both algorithms and theoretical models can be expanded to different learning rules or less regular tree architectures. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： artificial neural networks parallel learning algorithms hierarchical mixture of experts expectation-maximization learning algorithm scalability

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：