In this paper parallel solving symmetric eigenproblems, which include standard and generalized eigenvalue problems, is discussed. For standard eigenvalue problem and tridiagonal eigenvalue problem is not the key point...
详细信息
Creating portable and automatically scalable parallel software has been a goal for researchers and practitioners since the advent of parallel computing. In this paper we present a programming methodology that reduces ...
详细信息
ISBN:
(纸本)0769515126
Creating portable and automatically scalable parallel software has been a goal for researchers and practitioners since the advent of parallel computing. In this paper we present a programming methodology that reduces parallel programming complexity, while creating portable and automatically scalable parallel software. To support this methodology two separate tools have been developed - the PARSA Software Development Environment and an accompanying thread manager. the development environment addresses programming issues via an object-based graphical programming methodology that transforms a project automatically into a portable and scalable source code. Generated source code makes calls to the user-level thread manager, which manages the run time execution of the parallel software. Two sample applications that contain various forms of parallelism have been developed and are compiled on three different systems with diverse native threading mechanisms to demonstrate portability Finally, the automatic scalability is demonstrated withthe run time performance of the applications on multiprocessor systems.
As a classical method of image segmentation in mathematical morphology, the watershed transform has been applied successively into some fields like remote sensing image processing, biomedical and computer vision appli...
详细信息
In this paper some implicit domain decomposition procedures for solving parabolic problems are proposed. In these methods, the classic implicit scheme is used in each sub-domain, and Dirichlet boundary values at the (...
详细信息
parallelprocessing is a vital tool for many scientific and industrial applications where real time constraints apply;in many applications the use of parallelprocessing and multiprocessor platforms seems to be the fa...
详细信息
ISBN:
(纸本)0780375963
parallelprocessing is a vital tool for many scientific and industrial applications where real time constraints apply;in many applications the use of parallelprocessing and multiprocessor platforms seems to be the favourable solution for achieving acceptable throughput. Hence parallelprocessingalgorithms are vital tools to achieve a good trade off between hardware cost, system efficiency and power. In this paper, the one-dimensional generalised parallel block filter algorithm based on the overlap-add approach is implemented on multi-DSPs platform. the mathematical concept of the input stage, output stage and the generalised direct filter equation are given. Also the I-D parallel algorithm is shown and a suitable parallel architecture is presented.
Our new architecture, known as Scheduled DataFlow (SDF) system deviates from current trend of building complex hardware to exploit Instruction Level parallelism (ILP) by exploring a simpler, yet powerful execution par...
详细信息
ISBN:
(纸本)0769515126
Our new architecture, known as Scheduled DataFlow (SDF) system deviates from current trend of building complex hardware to exploit Instruction Level parallelism (ILP) by exploring a simpler, yet powerful execution paradigm that is based on dataflow, multithreading and decoupling of memory accesses from execution. A program is partitioned into non-blocking threads. In addition, all memory accesses are decoupled from the thread's execution. Data is pre-loaded into the thread's context (registers), and all results are post-stored after the completion of the thread's execution. Even though multithreading and decoupling are possible with control-flow architecture, the non-blocking and functional nature of the SDF system make it easier to coordinate the memory accesses and execution of a thread. In this paper we show some recent improvements on SDF implementation, whereby threads exchange data directly in register contexts, thus eliminating the need for creating thread frames. thus it is now possible to explore the scalability of our architecture's performance when more register contexts are included on the chip.
Various calculation of matrices and vectors has been used in many digital signal processing systems. Although the calculation simply repeats multiplication and addition, the reiteration processing are usually heavy. T...
详细信息
ISBN:
(纸本)0769514413
Various calculation of matrices and vectors has been used in many digital signal processing systems. Although the calculation simply repeats multiplication and addition, the reiteration processing are usually heavy. therefore, in order to calculate them with high speed, it is necessary to apply parallel proceccing. Although there is another issue that a circuit area becomes large in the case of digital LSI, a proposal analog circuit can realize multiplication and addition simultaneously withthe simple structure which arranges capacitors in a matrix form. Furthermore, the vowel speech recognition system is designed using this circuit.
In the recent years multimedia technology has emerged as a key technology, mainly because of its ability to represent information in disparate forms as a bit-stream. this enables everything from text to video and soun...
ISBN:
(纸本)3540440496
In the recent years multimedia technology has emerged as a key technology, mainly because of its ability to represent information in disparate forms as a bit-stream. this enables everything from text to video and sound to be stored, processed, and delivered in digital form. A great part of the current research community effort has emphasized the delivery of the data as an important issue of multimedia technology. However, the creation, processing, and management of multimedia forms are the issues most likely to dominate the scientific interest in the long run. the aim to deal with information coming from video, text, and sound will result in a data explosion. this requirement to store, process, and manage large data sets naturally leads to the consideration of programmable parallelprocessing systems as strong candidates in supporting and enabling multimedia technology. therefore, this fact taken together withthe inherent data parallelism in these data types makes multimedia computing a natural application area for parallel and distributed processing. In addition to this, the concepts developed for parallel and distributed algorithms are quite useful for the implementation of distributed multimedia systems and applications. thus, the adaptation of these methods for distributed multimedia systems is an interesting topic to be studied.
A data-driven formant-type TTS system is proposed. the formanttype speech synthesizer is one of the most promising architectures to enable flexible control of various voice qualities. By applying the ARX-based speech ...
详细信息
A data-driven formant-type TTS system is proposed. the formanttype speech synthesizer is one of the most promising architectures to enable flexible control of various voice qualities. By applying the ARX-based speech analysis method, source and formant parameters are automatically obtained. It is shown that a TTS system can be built by using the parameters, without requiring any heuristic rules to control vocal tract characteristics.
In this paper, a high accuracy Analog Random Access Memory (ARAM) for programmable vision chips is addressed. In the context of large retina, the main sources of error are analyzed. Several architectures are presented...
详细信息
ISBN:
(纸本)981238121X
In this paper, a high accuracy Analog Random Access Memory (ARAM) for programmable vision chips is addressed. In the context of large retina, the main sources of error are analyzed. Several architectures are presented and compared. the design of an improved ARAM is detailed.
暂无评论