检索结果-内蒙古大学图书馆

parallel algorithms for solving linear systems with block-tridiagonal matrices on multi-core CPU with GPU

JOURNAL OF COMPUTATIONAL SCIENCE 2012年第6期3卷 445-449页

作者： Akimova, Elena N. Belousov, Dmitry V. RAS Ural Branch Inst Math & Mech Moscow Russia

For solving systems of linear algebraic equations with block-tridiagonal matrices arising in geoelectrics problems, the parallel matrix sweep algorithm, conjugate gradient method with preconditioner, and square root method are proposed and implemented numerically on multi-core CPU Intel with graphics processors NVIDIA. Investigation of efficiency and optimization of parallel algorithms for solving the problem with quasi-model data are performed. Crown Copyright (C) 2012 Published by Elsevier B.V. All rights reserved.

关键词： parallel algorithms Block-tridiagonal SLAE Direct and iterative numerical methods Multi-core CPU and graphics processors NVIDIA

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR MODELLING TWO-DIMENSIONAL NON-EQUILIBRIUM SALT TRANSFER PROCESSES ON THE BASE OF FRACTIONAL DERIVATIVE MODEL

引用

FRACTIONAL CALCULUS AND APPLIED ANALYSIS 2018年第3期21卷 654-671页

作者： Bohaienko, Vsevolod Natl Acad Sci Ukraine VM Glushkov Inst Cybernet Acad Glushkov Ave 40 UA-03187 Kiev Ukraine

Modelling of salt transfer processes in fractal structured media has been considered on the base of fractional derivative equations with Caputo-Gerasimov derivatives with respect to space variables. Initial-boundary problem has been solved using locally one-dimensional finite difference scheme. Procedure of fractional derivative approximation has been proposed to lower computational complexity of solution process. parallel algorithms for distributed memory systems and GPU have been considered. Analysis of using one-dimensional and red-black data partitioning schemes is presented and new parametric scheme which have better characteristics in the determined conditions has been proposed.

关键词： parallel algorithms GPU fractional order equations approximated solutions finite difference schemes salt transfer

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for reducing derivation time of distinguishing experiments for nondeterministic finite state machines

引用

INTERNATIONAL JOURNAL OF parallel EMERGENT AND DISTRIBUTED SYSTEMS 2018年第2期33卷 197-210页

作者： El-Fakih, Khaled Barlas, Gerassimos Ali, Mustafa Yevtushenko, Nina Amer Univ Sharjah Dept Comp Sci & Engn Sharjah U Arab Emirates Tomsk State Univ Dept Informat Technol Tomsk Russia

Many approaches have been proposed for deriving tests from finite state machine (FSM) specifications with respect to some established coverage criteria. A fundamental core problem in FSM-based testing relates to the derivation of input sequences that can distinguish states of an FSM specification, aka distinguishing sequences. A major effort in the construction of these sequences is based on the derivation of a successors search-tree labeled by sets of pairs of states of the given machine. We aim at reducing the time associated with such constructions through the use of state-of-the-art parallel technologies. Namely, we propose a parallel algorithm that we implement and evaluate on multicore CPUs and on many-core GPUs. We evaluate two alternative GPU implementations that use the CUDA and Thrust software platforms and a network of workstations based solution. The latter sports a workload partitioning based on Divisible Load Theory. A rigorous set of experiments highlights the differences of the proposed implementations in terms of execution time and speedup. [GRAPHICS] We aim at reducing the time associated with the construction of the successors of all state pairs of a given non-deterministic finite state machine. We propose a parallel algorithm that we implement and evaluate on multicore CPUs and on many-core GPUs. We evaluate two alternative GPU implementations that use the CUDA and Thrust software platforms. Additionally, we propose and evaluate a Network of Workstations solution based on Divisible Load Theory. A rigorous set of experiments highlights the differences of the proposed implementations in terms of execution time and speedup.

关键词： Conformance testing distinguishing experiments nondeterministic finite state machines divisible load theory parallel algorithms CUDA Thrust network of workstations

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR CONTOUR EXTRACTION AND CODING ON AN EREW PRAM COMPUTER

引用

PATTERN RECOGNITION LETTERS 1990年第2期11卷 87-93页

作者： DINSTEIN, IH LANDAU, GM POLYTECH INST NEW YORK DEPT COMP SCIBROOKLYNNY 11201

A parallel approach to contour extraction and coding on an Exclusive Read Exclusive Write (EREW) parallel Random Access Machine (PRAM) is presented and analyzed. The algorithm is intended for binary images. The labeled contours can be represented by lists of coordinates, and/or chain codes, and/or any other user designed codes. Using O( n 2 /log n ) processors, the algorithm runs in O(log n ) time, where, n by n is the size of the processed binary image.

关键词： Image contour analysis image shape analysis parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for LQ optimal control of discrete-time periodic linear systems

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2002年第2期62卷 306-325页

作者： Benner, P Byers, R Mayo, R Quintana-Ortí, ES Hernández, V Univ Bremen Fachbereich Math & Informat 3 Zentrum Technomath D-28334 Bremen Germany Univ Kansas Dept Math Lawrence KS 66045 USA Univ Jaume 1 Dept Ingn & Ciencia Comp Castellon de La Plana 12080 Spain Univ Politecn Valencia Dept Sistemas Informat & Computac E-46071 Valencia Spain

This paper analyzes the performance of two parallel algorithms for solving the linear-quadratic optimal control problem arising in discrete-time periodic linear systems. The algorithms perform a sequence of orthogonal reordering transformations on formal matrix products associated with the periodic linear system and then employ the so-called matrix disk function to solve the resulting discrete-time periodic algebraic Riccati equations needed to determine the optimal periodic feedback. We parallelize these solvers using two different approaches, based on a coarse-grain and a medium-grain distribution of the computational load. The experimental results report the high performance and scalability of the parallel algorithms on a Beowulf cluster. (C) 2002 Elsevier Science (USA).

关键词： parallel algorithms LINEAR systems

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR ADDITION AND MULTIPLICATION ON PROCESSOR ARRAYS WITH RECONFIGURABLE BUS SYSTEMS

引用

INFORMATION PROCESSING LETTERS 1993年第2期46卷 89-94页

作者： THANGAVEL, P MUTHUSWAMY, VP Department of Mathematics Bharathidasan University Tiruchirapalli 620 024 India

Binary addition and multiplication problems are very important as their time dominates computation time of any scientific or engineering problem. Simple algorithms are presented for these 2 problems which take only O(1) time and O(log n) time on a linear PARBS and n x 2n-PARBS respectively, in which each processor has only a constant number of gates and registers. It is believed that these algorithms could be an efficient design for the implementation of an adder and multiplier circuit in a single VLSI chip.

关键词： BINARY ADDITION MULTIPLICATION parallel algorithms RECONFIGURABLE BUS

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR THE ROUNDING EXACT SUMMATION OF FLOATING POINT NUMBERS

引用

COMPUTING 1982年第2期28卷 89-104页

作者： LEUPRECHT, H OBERAIGNER, W Institut für Informatik Universität Innsbruck Tschurtschenthalerstrasse 5/II A-6020 Innsbruck Austria

Pichat and Bohlender studied an algorithm for the rounding exact summation of floating point numbers which can be executed on any floating point arithmetic unit. We propose parallel versions of this algorithm, namely a pipeline version, an algorithm similar to the exchange methods for sorting and a tree-like algorithm, associating a tree to the sum. For all these algorithms we discuss the properties, a multiprocessor architecture should have for an efficient implementation of an algorithm without restricting us to a special architecture.

关键词： parallel algorithms computer arithmetic

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for Hamiltonian problems on quasi-threshold graphs

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2004年第1期64卷 48-67页

作者： Nikolopoulos, SD Univ Ioannina Dept Comp Sci GR-45110 Ioannina Greece

In this paper we show structural and algorithmic properties on the class of quasi-threshold graphs, or QT-graphs for short, and prove necessary and sufficient conditions for a QT-graph to be Hamiltonian. Based on these properties and conditions, we construct an efficient parallel algorithm for finding a Hamiltonian cycle in a QT-graph;for an input graph on n vertices and in edges, our algorithm takes O(log n) time and requires O(n + m) processors on the CREW PRAM model. In addition, we show that the problem of recognizing whether a QT-graph is a Hamiltonian graph and the problem of computing the Hamiltonian completion number of a nonHamiltonian QT-graph can also be solved in O(log n) time with O(n + in) processors. Our algorithms rely on O(log n)-time parallel algorithms, which we develop here, for constructing tree representations of a QT-graph;we show that a QT-graph G has a unique tree representation, that is, a tree structure which meets the structural properties of G. We also present parallel algorithms for other optimization problems on QT-graphs which run in O(log n) time using a linear number of processors. (C) 2003 Elsevier Inc. All rights reserved.

关键词： parallel algorithms quasi-threshold graphs recognition tree representation Hamiltonian cycles Hamiltonian completion number complexity

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for robot path planning with simpler VLSI architecture

引用

INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY 2006年第3期26卷 157-163页

作者： Arock, Michael Ponalagusamy, R. Natl Inst Technol Dept Comp Applicat Tiruchirappalli 620015 Tamil Nadu India Natl Inst Technol Dept Math Tiruchirappalli 620015 Tamil Nadu India

This paper proposes a parallel algorithm for robot path planning on a linear array with a reconfigurable pipelined bus system (LARPBS) through the construction of a Voronoi diagram on a binary image of the workspace. The algorithm is based on a d(4) distance metric, and it does not incur any additional time or processor requirements compared with those of a previously reported proposal (Tzionas et al., 1997). This paper recommends the same model as the simpler VLSI architecture for the problem in question.

关键词： linear array with are configurable pipelined bus system parallel algorithms robot path planning Voronoi diagram VLSI architecture

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for evaluation of directional Boolean derivatives of multivalued logic functions

引用

CYBERNETICS AND SYSTEMS ANALYSIS 1996年第6期32卷 777-793页

作者： Levashenko, VG Shmerko, VP Yanushkevich, SN

Processing of logical data always requires special software and hardware tools. This is attributable to the specific features of the mathematical apparatus of the algebra of logical functions. Organization of parallel logical computation on the basis of the symbolic mathematical apparatus leads to complex logic programs. A different approach is proposed in this article. It is based on the matrix apparatus. Its use enables us to synthesize parallel and structurally homogeneous algorithms for the evaluation of directional logical derivatives of multivalued logic functions and implement their evaluation using standard matrixalgebra software or homogeneous computing systems. Homogeneous computing systems substantially accelerate the processing speed and can be built using VLSI technology. The operation graphs of the proposed algorithms have the same configuration as the graphs of fast algorithms used in digital signal processing. This result makes it possible to use well-tried standard procedures of digital signal processing, which involve mapping of algorithms into homogeneous computing structures and hardware-software architectures.

关键词： parallel algorithms PROCESSING SPEED DIRECTIONAL proprietary software parallel Lines Function Logic functions logical Heterogeneous multivalued logic

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：