检索结果-内蒙古大学图书馆

3rd International Workshop on e-Activity (IWEA2009)/10th ACIS International Conference on Software Engineering Artificial Intelligence, Networking and parallel/distributed Computing

作者： Chang, Dar-Jen Desoky, Ahmed H. Ouyang, Ming Rouchka, Eric C. Univ Louisville Dept Comp Sci & Comp Engn Louisville KY 40292 USA

ISBN: (纸本)9780769536422

Graphics processing units (GPUs) are powerful computational devices tailored towards the needs of the 3-D gaming industry for high-performance, real-time graphics engines. Nvidia Corporation released a new generation of GPUs designed for general-purpose computing in 2006, and it released a GPU programming language called CUDA in 2007. The DNA microarray technology is a high throughput tool for assaying mRNA abundance in cell samples. In. data analysis, scientists often apply hierarchical clustering of the genes, where a fundamental operation is to calculate all pairwise distances. If there are n genes, it takes O(n(2)) time. In this work, GPUs and the CUDA language are used to calculate pairwise distances. For Manhattan distance, GPU/CUDA achieves a 40 to 90 times speed-up compared to the central processing unit implementation;for Pearson correlation coefficient, the speed-up is 28 to 38 times.

关键词： parallel and distributed computation hierarchical clustering similarity and dissimilarity metrics

来源：评论

学校读者我要写书评

暂无评论

Performance Comparison of Graph BFS Implemented in MapReduce and PGAS Programming Models 12th

Performance Comparison of Graph BFS Implemented in MapReduce...

引用

12th International Conference on parallel Processing and Applied Mathematics (PPAM)

作者： Ryczkowska, Magdalena Nowicki, Marek Univ Warsaw Interdisciplinary Ctr Math & Computat Modeling Pawinskiego 5a PL-02106 Warsaw Poland Nicolaus Copernicus Univ Fac Math & Comp Sci Chopina 12-18 PL-87100 Torun Poland

ISBN: (纸本)9783319780542;9783319780535

computations based on graphs are very common problems but complexity, increasing size of analyzed graphs and a huge amount of communication make this analysis a challenging task. In this paper, we present a comparison of two parallel BFS (Breath- First Search) implementations: MapReduce run on Hadoop infrastructure and in PGAS (Partitioned Global Address Space) model. The latter implementation has been developed with the help of the PCJ (parallel computations in Java) - a library for parallel and distributed computations in Java. Both implementations realize the level synchronous strategy - Hadoop algorithm assumes iterative MapReduce jobs, whereas PCJ uses explicit synchronization after each level. The scalability of both solutions is similar. However, the PCJ implementation is much faster (about 100 times) than the MapReduce Hadoop solution.

关键词： High performance computing Hadoop MapReduce PGAS parallel and distributed computation Performance evaluation parallel graph algorithms Java

来源：评论

学校读者我要写书评

暂无评论

Peer-to-peer Estimation over Wireless Sensor Networks via Lipschitz Optimization

Peer-to-peer Estimation over Wireless Sensor Networks via Li...

引用

8th International Symposium on Information Processing Sensor Networks

作者： Fischione, Carlo Speranzon, Alberto Johansson, Karl Henrik Sangiovanni-Vincentelli, Alberto Royal Inst Technol ACCESS Linnaeus Ctr S-10044 Stockholm Sweden United Technol Res Ctr E Hartford CT 06108 USA Univ Calif Berkeley Berkeley CA 94720 USA

ISBN: (纸本)9781424451081

Motivated by a peer-to-peer estimation algorithm in which adaptive weights are optimized to minimize the estimation error variance, we formulate and solve a novel non-convex Lipschitz optimization problem that guarantees global stability of a large class of peer-to-peer consensus-based algorithms for wireless sensor network. Because of packet. losses, the solution of this optimization problem cannot be achieved efficiently with either traditional centralized methods or distributed Lagrangian message passing. The prove that the optimal solution can be obtained by solving a set of nonlinear equations. A fast distributed algorithm, which requires only local computations, is presented for solving these equations. Analysis and computer simulations illustrate the algorithm and its application to various network topologies.

关键词： Lipschitz Optimization parallel and distributed computation Wireless Sensor Networks distributed Estimation

来源：评论

学校读者我要写书评

暂无评论

parallelizable PARAFAC decomposition of 3-way tensors 40

Parallelizable PARAFAC decomposition of 3-way tensors

引用

40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015

作者： Nguyen, Viet-Dung Abed-Meraim, Karim Nguyen, Linh-Trung PRISME Lab. University of Orléans France Faculty of Elec. and Telecom. VNU Univ. of Eng. and Tech. Viet Nam

ISBN: (纸本)9781467369978

This paper introduces a new PARAFAC algorithm for a class of third-order tensors. Particularly, the proposed algorithm is based on subspace estimation and solving a non-symmetrical joint diagonalization problem. To deal with large scale problem, a procedure for overcoming scale and permutation ambiguities is proposed in a parallel computing scheme leading to a significant cost reduction of our method. Performance comparison with some state-of-the-art algorithms produces promising results. © 2015 IEEE.

关键词： non-symmetrical joint diagonalization PARAFAC parallel and distributed computation subspace estimation Tensor decomposition

来源：评论

学校读者我要写书评

暂无评论

A SMOOTH PRIMAL-DUAL OPTIMIZATION FRAMEWORK FOR NONSMOOTH COMPOSITE CONVEX MINIMIZATION

引用

SIAM JOURNAL ON OPTIMIZATION 2018年第1期28卷 96-134页

作者： Quoc Tran-Dinh Fercoq, Olivier Cevher, Volkan Univ North Carolina Chapel Hill UNC Dept Stat Res & Operat Res Chapel Hill NC 27599 USA Univ Paris Sacley CNRS LCTI Telecom ParisTech F-75013 Paris France Ecole Polytech Fed Lausanne Lab Informat & Inference Syst LIONS CH-1015 Lausanne Switzerland

We propose a new and low per-iteration complexity first-order primal-dual optimization framework for a convex optimization template with broad applications. Our analysis relies on a novel combination of three classic ideas applied to the primal-dual gap function: smoothing, acceleration, and homotopy. The algorithms due to the new approach achieve the best-known convergence rate results, in particular when the template consists of only nonsmooth functions. We also outline a restart strategy for the acceleration to significantly enhance the practical performance. We demonstrate relations with the augmented Lagrangian method and show how to exploit the strongly convex objectives with rigorous convergence rate guarantees. We provide representative examples to illustrate that the new methods can outperform the state of the art, including Chambolle Pock, and the alternating direction method-of-multipliers algorithms. We also compare our algorithms with the well-known Nesterov smoothing method.

关键词： gap reduction technique first-order primal-dual methods augmented Lagrangian smoothing techniques homotopy separable convex minimization parallel and distributed computation

来源：评论

学校读者我要写书评

暂无评论

Practical inverse modeling with SIESTA

引用

IEICE TRANSACTIONS ON ELECTRONICS 2000年第8期E83C卷 1303-1310页

作者： Strasser, R Selberherr, S Infineon Technol SIM D-81739 Munich Germany Tech Univ Vienna Inst Microelect A-1060 Vienna Austria

We present a simulation system which meets the requirements for practical application of inverse modeling in a professional environment. A tool interface for the integration of arbitrary simulation tools at the user level is introduced and a methodology for the formation of simulation networks is described. A Levenberg-Marquardt optimizer automates the inverse modeling procedure. Strategics for the efficient execution of simulation tools are discussed. An example demonstrates the extraction of doping profile information on the basis of electrical measurements.

关键词： calibration inverse modeling parameter identification tool integration parallel and distributed computation

来源：评论

学校读者我要写书评

暂无评论

TOSSIM simulation of wireless sensor network serving as hardware platform for Hopfield neural net configured for max independent set

引用

Procedia Computer Science 2011年 6卷 408-412页

作者： Jiakai Li Gursel Serpen Electrical Engineering and Computer Science University of Toledo Toledo Ohio 43606 USA

This paper, the third one in a three-paper sequence, presents the result of TOSSIM simulation of a Hopfield neural network as a static optimizer and configured to solve the maximum independent set (MIS) problem using a wireless sensor network as a fully parallel and distributed computing hardware platform. TinyOS with its default protocol stack along with nesC were used to develop the simulation model. Simulations were realized for mote counts of 10, 50, 100, and 182; messaging complexity, memory and simulation time costs were measured. Results indicated, as the most prominent finding, that the neural optimization algorithm was able to compute solutions to the MIS problem. The memory footprint of the TOSSIM process in Windows XP environment was about 20 MB for the range of sensor networks considered. The messaging complexity as measured by the total number of messages transmitted and the simulation time increased rather quickly indicating a need to optimize and tune certain aspects of the simulation environment if wireless sensor networks with higher mote counts need to be simulated.

关键词： TOSSIM nesC TinyOS wireless sensor network Hopfield neural network static optimization parallel and distributed computation message complexity scalability

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：