In order to run Computational Fluid Dynamics (CFD) codes on large scale infrastructures, parallel computing has to be used because of the computational intensive nature of the problems. In this paper we investigate th...
详细信息
ISBN:
(纸本)9783319495835;9783319495828
In order to run Computational Fluid Dynamics (CFD) codes on large scale infrastructures, parallel computing has to be used because of the computational intensive nature of the problems. In this paper we investigate the ADAPT platform where we couple flow Partial Differential Equations and a Poisson equation. this leads to a linear system which we solve using direct methods. the implementation deals withthe MUMPS parallel multi-frontal direct solver and mesh partitioning methods using METIS to improve the performance of the framework. We also investigate, in this paper, how the mesh partitioning methods are able to optimize the mesh cell distribution for the ADAPT solver. the experience gained in this paper facilitates the move to a Service Oriented view of ADAPT as future work.
An approach to very rapid and real-time ellipse detection is described, Based on a modified RANSAC (RANdom Sample Consensus), and considering the parallelprocessing capabilities of a programmable graphics processing ...
详细信息
After the emergence of the new High Efficiency Video Coding standard, several strategies have been followed in order to take advantage of the parallel features available in it. Many of the parallelization approaches i...
详细信息
ISBN:
(纸本)9783319495835;9783319495828
After the emergence of the new High Efficiency Video Coding standard, several strategies have been followed in order to take advantage of the parallel features available in it. Many of the parallelization approaches in the literature have been performed in the decoder side, aiming at achieving real-time decoding. However, the most complex part of the HEVC codec is the encoding side. In this paper, we perform a comparative analysis of two parallelization proposals. One of them is based on tiles, employing shared memory architectures and the other one is based on Groups Of Pictures, employing distributed shared memory architectures. the results show that good speed-ups are obtained for the tile-based proposal, especially for high resolution video sequences, but the scalability decreases for low resolution video sequences. the GOP-based proposal outperforms the tile-based proposal when the number of processes increases. this benefit grows up when low resolution video sequences are compressed.
In this paper we describe the parallelization of two nearest neighbour classification algorithms. Nearest neighbour methods are well-known machine learning techniques. they have been successfully applied to Text Categ...
详细信息
ISBN:
(纸本)9783540744658
In this paper we describe the parallelization of two nearest neighbour classification algorithms. Nearest neighbour methods are well-known machine learning techniques. they have been successfully applied to Text Categorization task. Based on standard parallel techniques we propose two versions of each algorithm on message passing architectures. We also include experimental results on a cluster of personal computers using a large text collection. Our algorithms attempt to balance the load among the processors, they are portable, and obtain very good speedups and scalability.
For a long period in the development of computers and computing efficient applications were only characterized by computational - and memory complexity or in more practical terms elapsed computing time and required ma...
详细信息
ISBN:
(纸本)9783642400476
For a long period in the development of computers and computing efficient applications were only characterized by computational - and memory complexity or in more practical terms elapsed computing time and required main memory capacity. the history of Euro-Par and its predecessor-organizations stands for research on the development of ever more powerful computer architecturesthat shorten the compute time both by faster clocking and by parallel execution as well as the development of algorithmsthat can exhibit these parallel architectural features. the success of enhancing architectures and algorithms is best described by exponential curves regarding the peak computing power of architectures and the efficiency of algorithms. As microprocessor parts get more and more power hungry and electricity gets more and more expensive, "energy to solution" is a new optimization criterion for large applications. this calls for energy aware solutions.
the Pattern Matching with Swaps problem is a variation of the classical pattern matching problem in which a match is allowed to include disjoint local swaps. In 2009, Cantone and Faro devised a new dynamic programming...
详细信息
ISBN:
(纸本)9783642131189
the Pattern Matching with Swaps problem is a variation of the classical pattern matching problem in which a match is allowed to include disjoint local swaps. In 2009, Cantone and Faro devised a new dynamic programming algorithm for this problem that runs in time O(nm), where n is the length of the text and m is the length of the pattern. In this paper, first, we present an improved dynamic programming formulation of the approach of Cantone and Faro. then, we present an optimal parallelization of our algorithm, based on a linear array model, that runs in time O(m(2)) using [n/m-1] processors.
In recent years ″software engineering″ has emerged as a discipline of programming. It includes the conceptualization, design, implementation, testing and modification of software systems. Related issues are language...
详细信息
In recent years ″software engineering″ has emerged as a discipline of programming. It includes the conceptualization, design, implementation, testing and modification of software systems. Related issues are languages, standards, distributed and parallelprocessing, and total programming environments. the fact that more than 80 percent of system development costs were in software rather than hardware helps one appreciate the importance of any effort to understand and enhance the software production process. Many pattern recognition projects involve fairly large software efforts. It makes sense not only for researchers to make use of the latest software tools and methodologies but also to anticipate future changes.
Multishift QR, algorithms are efficient for solving the symmetric tridiagonal eigenvalue problem on a parallel computer. In this paper, we focus on three variants of the multishift QR. algorithm, namely, the conventio...
详细信息
ISBN:
(纸本)9783642131356
Multishift QR, algorithms are efficient for solving the symmetric tridiagonal eigenvalue problem on a parallel computer. In this paper, we focus on three variants of the multishift QR. algorithm, namely, the conventional multishift QR algorithm, the deferred shift QR, algorithm and the fully pipelined multishift QR, algorithm, and construct performance models for them. Our models are designed for shared-memory parallel machines, and given the basic performance characteristics of the target;machine and the problem size, predict the execution time of these algorithms. Experimental results show that our models can predict the relative performance of these algorithms to the accuracy of 10% in many cases. thus our models are useful for choosing the best algorithm to solve a given problem in a specified computational enviromnent, as well as for finding the best value of the performance parameters.
this paper describes an insertion opportunity for optical interconnection technology in embedded massively parallel processors for radar signal processing.
ISBN:
(纸本)0818685727
this paper describes an insertion opportunity for optical interconnection technology in embedded massively parallel processors for radar signal processing.
Software architecture is important to software development and it affects the cost and quality of software. Today the diverse Question Answering (QA) system is pervasive, except for simple architectures developed for ...
详细信息
ISBN:
(纸本)9780769528670
Software architecture is important to software development and it affects the cost and quality of software. Today the diverse Question Answering (QA) system is pervasive, except for simple architectures developed for industry use there is almost no universal software architecture which exists for question answering system, especially for research use. In this paper, we proposed a semantic pattern based user-interactive question answering system and new research-intended software architecture for the question answering system. Here the semantic pattern is used for question and answer analysis to improve the correctness of extracted answers. the architecture of the question answering system is a three dimensional one, with one dimension used for system optimization and experimentation. through the implementation of the architecture, we can clearly comprehend the principles and software architecture of our QA system.
暂无评论