Using a cluster of PCs or workstations or the likes (called nodes) to implement the database server can bring us two great benefits: high scalability and parallelprocessing capability Before such a database server ca...
详细信息
ISBN:
(纸本)0769517609
Using a cluster of PCs or workstations or the likes (called nodes) to implement the database server can bring us two great benefits: high scalability and parallelprocessing capability Before such a database server can be put into actual use, however two problems have to be solved. the one is how we cope withthe data-skew since it can degrade the system performance significantly. the other is how a node is connected to or disconnected from a database server without affecting the users. One general solution to both problems is to redistribute the data. Unfortunately, this would take the data off line for a long time. In fact, numerous applications such as that for reservations, finance, process control, hospitals, police, and armed forces cannot afford the off-line data for any significant amount of time. In this paper we address the subject of balancing data load on line, i.e., balancing data load concurrently with users' reading and writing of the database. Main contributions are an effective approach for this purpose and a comprehensive performance study of the possible alternatives.
the objective of the parallelism-independent (PI) scheduling is minimization of the completion time of a parallel application for any number of processing elements in the computing system. We propose several paralleli...
详细信息
this paper presents an efficient implementation of a VLSI high speed parallel multiplier using the Radix_4 modified Booth algorithm and the Wallace Tree structure the design is structured for a n x in multiplication w...
详细信息
ISBN:
(纸本)0780375785
this paper presents an efficient implementation of a VLSI high speed parallel multiplier using the Radix_4 modified Booth algorithm and the Wallace Tree structure the design is structured for a n x in multiplication where n can reach up to 126 bits. the Wallace Tree structure serves to compress the partial product term by a ratio of 3:2 111 To enhance the speed of operation, carry-look-ahead(CLA) adders are used which is independent on the number of bits of the two operands[2]. An efficient VHDL code was written and successfully simulated and synthesised using Altera's MaxplusII(10.0) and ModelSim3.4 CAD tools.
We study two classical connectivity-preserving parallel shrinking algorithms proposed to recognize and label two-dimensional connected components of binary images. the algorithms we consider were developed by Beyer [R...
详细信息
We study two classical connectivity-preserving parallel shrinking algorithms proposed to recognize and label two-dimensional connected components of binary images. the algorithms we consider were developed by Beyer [Recognition of topological invariants by iterative arrays, Ph.D. thesis, MIT, 1969, p. 144] and Levialdi [Commun. ACM 15 (1) (1972) 7] independently for the purpose of shrinking 4-connected and 8-connected components of binary images in linear time, respectively. It is shown that those two independently developed algorithms are closely related and in a sense they are in a dual relation such that, for any initially given binary image and its inverted one, one algorithm produces, simultaneously, an image which is dual of the one produced by the other, step-by-step. (C) 2002 Elsevier Science B.V. All rights reserved.
Power dissipation reduction is a stringent constraint in modern mobile devices. It can be obtained by supply voltage or frequency reduction, but a strong reduction of number of cycles for operation must be achieved. T...
详细信息
Power dissipation reduction is a stringent constraint in modern mobile devices. It can be obtained by supply voltage or frequency reduction, but a strong reduction of number of cycles for operation must be achieved. To this end, reconfigurable architectures are a valuable solution. In this paper a reconfigurable architecture is designed and successfully tested on GSM coding. An average reduction of 98.2% cycles for specific tasks and of 44.6% cycles for overall computation with respect to standard general purpose processors is obtained.
In this paper, the discrete state space recursive filters are implemented in the form of parallel array processors. the state space description permits the straightforward application of systolic architectures to real...
详细信息
In this paper, the discrete state space recursive filters are implemented in the form of parallel array processors. the state space description permits the straightforward application of systolic architectures to realize recursive filters of 1D and 2D types. We show that the recursivity inherent to the filtering algorithm introduces a latency proportional to the filter order. Moreover, we show that the use of the CTP decomposition technique together withthe cylindrical-type structures reduces significantly this latency and improves the computation throughput of these arrays. the processing cells of the systolic array are designed via switched-capacitor techniques.
parallelprocessing is a vital tool for many scientific and industrial applications where real time constraints apply; in many applications the use of parallelprocessing and multiprocessor platforms seems to be the f...
详细信息
parallelprocessing is a vital tool for many scientific and industrial applications where real time constraints apply; in many applications the use of parallelprocessing and multiprocessor platforms seems to be the favourable solution for achieving acceptable throughput. Hence parallelprocessingalgorithms are vital tools to achieve a good trade off between hardware cost, system efficiency and power. In this paper, the one-dimensional generalised parallel block filter algorithm based on the overlap-add approach is implemented on multi-DSPs platform. the mathematical concept of the input stage, output stage and the generalised direct filter equation are given. Also the 1-D parallel algorithm is shown and a suitable parallel architecture is presented.
this paper describes a parallel divide-and-conquer-algorithm for Delaunay triangulation. this algorithm finds the affected zone that cover the triangulations that may be modified during the merge of two sub-block tria...
详细信息
In heterogeneous sub-surface environments, the evaluation of GPR sections is complicated by the influence of near-field effects, antenna radiation patterns, velocity variations and surveying inconsistencies. Section i...
详细信息
ISBN:
(纸本)0819445223
In heterogeneous sub-surface environments, the evaluation of GPR sections is complicated by the influence of near-field effects, antenna radiation patterns, velocity variations and surveying inconsistencies. Section interpretation can be exceedingly difficult, even with advanced processing methods, and therefore mathematical modelling has become an increasingly popular addition to traditional techniques. the Finite-Difference Time-Domain method (FDTD) is the most common, but to be of practical use the modelling scheme must incorporate realistic antenna configurations, complex sub-surface geometries and accurate material property descriptions. these additional components add computational complexity to the models and, at present, most single processor FDTD schemes are only capable of modelling relatively basic three-dimensional data sets in practical time scales. Modem parallel computing techniques have the potential to overcome these limitations by spreading the computational demand across a number of processors (or individual PC's). these PC 'cluster' machines provide the necessary computational power required to model more complex GPR problems in realistic time-scales. Consequently, the scope and run-time of current GPR FDTD modelling applications can be improved making them an accessible and affordable aid to GPR interpretation.
this paper describes an implementation of a novel line removal Hough transform on a new parallel architectural system, referred to as a hybrid system. the algorithm, which is used for detecting lines in an image, stri...
详细信息
this paper describes an implementation of a novel line removal Hough transform on a new parallel architectural system, referred to as a hybrid system. the algorithm, which is used for detecting lines in an image, strips away major line so that minor lines can be more easily detectable. the algorithm was subsequently implemented on the hybrid system. A hybrid system is a combination of SIMD and MIMD system processingthe data in parallel. In this paper, we also introduce a new SIMD concept.
暂无评论