Low latency image processing and high FPS (frames per second) is significant for high resolution decision making in many object recognition applications. Reading frames in between processing of a video is too slow and...
详细信息
ISBN:
(纸本)9781538651636
Low latency image processing and high FPS (frames per second) is significant for high resolution decision making in many object recognition applications. Reading frames in between processing of a video is too slow and sluggish as the corresponding reading and decoding the frames are done in the main processingthread. Packages, such as imutils provide such off-the-shelf image processing algorithms which apply multi-threading to achieve low latency. However, the algorithms are unable to perform computationally expensive image processing operations. In this paper, we apply a parallelprocessing technique based on co-incident multi-threading to decrease the latency for computationally expensive cases. the technique is evaluated using a prototype of smart car to show that FPS rate is increased and time complexity of algorithms is reduced by an order of n.
A parallel simulated annealing method, IIP, is applied to the n-queen problem. By this method, identical multiple copies of the single process algorithm are independently run in parallel. this technique gives superlin...
详细信息
Refined descriptive sampling is designed to improve upon the descriptive sampling method for experimentation in simulation. the former reduces significantly the risk of sampling bias generated by descriptive sampling ...
详细信息
ISBN:
(纸本)9789077381489
Refined descriptive sampling is designed to improve upon the descriptive sampling method for experimentation in simulation. the former reduces significantly the risk of sampling bias generated by descriptive sampling and eliminates its problem related to the sample size. In this paper, we propose an optimal parallel Monte Carlo simulation algorithm using refined descriptive sampling and evaluate in parallel architecture, performance measures of a Pert network and the Newsboy problem.
In this work MASIPE, a tool for monitoring parallel applications, is presented. MASIPE is a distributed tool that gives support to user-defined mobile agents, including functionalities for creating and transferring th...
详细信息
ISBN:
(纸本)9783540681052
In this work MASIPE, a tool for monitoring parallel applications, is presented. MASIPE is a distributed tool that gives support to user-defined mobile agents, including functionalities for creating and transferring these agents through different compute nodes. In each node, the mobile agent can access the node information as well as the memory space of the parallel program that is being monitored. In addition, MASIPE includes functionalities for managing and graphically displaying the agent data. In this work, its internal structure is detailed and an example of a monitored scientific application is shown. We also perform a study of the MASIPE requirements (in terms of CPU and memory) and we evaluate its overhead during the program execution. Experimental results show that MASIPE can be efficiently used with minimum impact on the program performance.
the key technology of realizing Synthetic Aperture Radar (SAR) imaging is real-time processing of echo signal and a lot of data memory. However, it is very difficult to finish the work of SAR imaging system with singl...
详细信息
ISBN:
(纸本)7121002159
the key technology of realizing Synthetic Aperture Radar (SAR) imaging is real-time processing of echo signal and a lot of data memory. However, it is very difficult to finish the work of SAR imaging system with single DSP chip. In order to resolve those problems. the DSP parallelprocessing system was designed. ADSP2106x is a high-speed and real-time digital signal processor that was produced by AD Company. Using the Links port of ADSP2106x, the DSP parallelprocessing structure was devised. In the meantime, SAR imaging processing structure and the performance and characteristics of ADSP2106x were introduced. besides, the advantages and disadvantages of these methods were described. DSP parallelprocessing system can satisfy the real-time requirement of the SAR imaging signal processing and has widely applied expectation in radar, communication and image processing etc fields.
this paper describes and compares two algorithms for the Ordered Successive Interference Cancellation (OS IQ decoding procedure proposed in V-BLAST wireless MIMO systems. they are based on algorithms that solve the Re...
详细信息
ISBN:
(纸本)9783540681052
this paper describes and compares two algorithms for the Ordered Successive Interference Cancellation (OS IQ decoding procedure proposed in V-BLAST wireless MIMO systems. they are based on algorithms that solve the Recursive Least Squares (RLS) problem, and are derived from the square root version of the Kalman Filter and the square root version of the Information Filter, respectively. For OSIC decoding, the latter is a novel and faster solution than the former because a matrix multiplication and some rotation applications are avoided. the algorithm has been formulated as a block algorithm, observing an optimum block size that minimizes execution time. It has been parallelized as a pipeline, and so is very efficient.
Skeleton based libraries are considered one of the alternatives to reduce the distance between end users and parallel architectures. Algorithmic skeletons are based in general procedures describing the method to be im...
详细信息
ISBN:
(纸本)9783540681052
Skeleton based libraries are considered one of the alternatives to reduce the distance between end users and parallel architectures. Algorithmic skeletons are based in general procedures describing the method to be implemented. Although a gap between general formalizations for dynamic programming and software components can be found, we develop a skeleton tool for dynamic programming problems. the design strategy is general enough to consider a wide rage of dynamic programming recurrences. As usual in skeleton approaches, the parallelism is provided in a transparent manner, so that, sequential users may access to the system. A set of tests problems representative of different classes of dynamic programming formulations has been used to validate the distributed memory implementation on an IBM-SP.
A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document processing. However, the processing of...
详细信息
ISBN:
(纸本)9781424403431
A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document processing. However, the processing of XML documents has a reputation for poor performance, and a number of optimizations have been developed to address this performance problem from different perspectives, none of which have been entirely satisfactory. In this paper, we present a seemingly quixotic, but novel approach: parallel XML parsing. parallel XML parsing leverages the growing prevalence of multicore architectures in all sectors of the computer market, and yields significant performance improvements. this paper presents our design and implementation of parallel XML parsing. Our design consists of an initial preparsing phase to determine the structure of the XML document, followed by a full, parallel parse. the results of the preparsing phase are used to help partition the XML document for data parallelprocessing. Our parallel parsing phase is a modification of the libxml2 [1] XML parser, which shows that our approach applies to real-world, production quality parsers. Our empirical study shows our parallel XML parsing algorithm can improved the XML parsing performance significantly and scales well.
this work deals withthe efficient numerical solution of nonlinear transient flow problems posed on two-dimensional porous media of general geometry. We first consider a spatial semidiscretization of such problems by ...
详细信息
ISBN:
(纸本)9783540681052
this work deals withthe efficient numerical solution of nonlinear transient flow problems posed on two-dimensional porous media of general geometry. We first consider a spatial semidiscretization of such problems by using a cell-centered finite difference scheme on a logically rectangular grid. the resulting nonlinear stiff initial-value problems are then integrated in time by means of a fractional step method, combined with a decomposition of the flow domain into a set of overlapping sub-domains and a linearization procedure which involves suitable Taylor expansions. the proposed algorithm reduces the original problem to the solution of several linear systems per time step. Moreover, each one of such systems can be directly decomposed into a set of uncoupled linear subsystems which can be solved in parallel. A numerical example illustrates the unconditionally convergent behaviour of the method in the last section of the paper.
For efficient execution of parallel irregular computations, dynamic load balancing must, be applied. If the computational work is associated with data sets, which must be separately processed by an algorithm, then loa...
详细信息
ISBN:
(纸本)9783540681052
For efficient execution of parallel irregular computations, dynamic load balancing must, be applied. If the computational work is associated with data sets, which must be separately processed by an algorithm, then load balancing can be performed most efficiently by transfering the data sets between processes using application level messages. Such a situation exists in parallel branch and bound (B&B) computations. A parallel B&B algorithm has been implemented in a novel parallel programming environment. this environment facilitates an infrastructure for parallel application control. Application consistent global states are continuously monitored. Control decisions are taken based on the monitored states and the decisions are communicated to the application processes. this infrastructure has been used for load balancing strategy implementation in parallel B&B computations. An analysis of the characteristics of the control infrastructure and the application resulted in a choice of a global load balancing strategy working with many simple and small steps executed frequently. Experiments have shown, that this strategy works well. the chosen strategy is much more efficient (shortening the application runtime by more than 3 times), if the prediction of the results of an already taken load balancing decision is used for subsequent load balancing decisions.
暂无评论