The proceedings contain 48 papers. The topics discussed include: reconfigurable platform for digital convergence terminals;European research in embedded systems;interface overheads in embedded multimedia software;auto...
详细信息
ISBN:
(纸本)3540364102
The proceedings contain 48 papers. The topics discussed include: reconfigurable platform for digital convergence terminals;European research in embedded systems;interface overheads in embedded multimedia software;automated distribution of UML 2.0 designed applications to a configurable multiprocessor platform;key research challenges for successfully applying MDD within real-time embedded software development;domain-specific modeling of power aware distributed real-time embedded systems;mining dynamic document spaces with massively parallel embedded processors;designing wireless sensor nodes;design, implementation, and experiment on outdoor deployment of wireless sensor network for environmental monitoring;energy-driven partitioning of signal processing algorithms in sensor networks;security in wireless sensor networks: considerations and experiments;on security of PAN wireless systems;and code size reduction by compiler tuning.
A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document processing. However, the processing of...
详细信息
ISBN:
(纸本)9781424403431
A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document processing. However, the processing of XML documents has a reputation for poor performance, and a number of optimizations have been developed to address this performance problem from different perspectives, none of which have been entirely satisfactory. In this paper, we present a seemingly quixotic, but novel approach: parallel XML parsing. parallel XML parsing leverages the growing prevalence of multicore architectures in all sectors of the computer market, and yields significant performance improvements. This paper presents our design and implementation of parallel XML parsing. Our design consists of an initial preparsing phase to determine the structure of the XML document, followed by a full, parallel parse. The results of the preparsing phase are used to help partition the XML document for data parallel processing. Our parallel parsing phase is a modification of the libxml2 [1] XML parser, which shows that our approach applies to real-world, production quality parsers. Our empirical study shows our parallel XML parsing algorithm can improved the XML parsing performance significantly and scales well.
The development of scalable parallel database systems requires the design of efficient algorithms for the join operation which is the most frequent and expensive operation in relational database systems. The join is a...
详细信息
ISBN:
(纸本)3540285660
The development of scalable parallel database systems requires the design of efficient algorithms for the join operation which is the most frequent and expensive operation in relational database systems. The join is also the most vulnerable operation to data skew and to the high cost of communication in distributed architectures. In this paper, we present a new parallel algorithm for join and multi-join operations on distributed architectures based on an efficient semi-join computation technique. This algorithm is proved to have optimal complexity and deterministic perfect load balancing. Its tradeoff between balancing overhead and speedup is analyzed using the BSP cost model which predicts a negligible join product skew and a linear speed-up. This algorithm improves our fa-join and sfa-join algorithms by reducing their communication and synchronization cost to a minimum while offering the same load balancing properties even for highly skewed data.
A hybrid adder-based distributed arithmetic (DA) architecture targeting a reconfigurable System-on-Chip (rSoC) platform has been presented. The work exemplifies hardware comparisons of three DA based discrete cosine t...
详细信息
This paper first identifies some of the key concerns about the techniques and algorithms developed for parallel model checking;specifically, the inherent problem with load balancing and large queue sizes resultant in ...
详细信息
The proceedings contain 33 papers from the Language and Compilers for High Performance Computing: 17th internationalworkshop, LCPC 2004, Revised Selected Papers. The topics discussed include: an overview of the open ...
详细信息
The proceedings contain 33 papers from the Language and Compilers for High Performance Computing: 17th internationalworkshop, LCPC 2004, Revised Selected Papers. The topics discussed include: an overview of the open research compiler;phase-based miss rate prediction across program inputs;speculative subword register allocation in embedded processors;implementation of parallel numerical algorithms using hierarchically tiled arrays;a geometric approach for partitioning n-dimensional non-rectangular iteration spaces;the use of traces for inlining in java programs;compiling high-level languages for vector architectures;and applying loop optimizations to object-oriented abstractions through general classification of array semantics.
A new program graph structuring algorithm for dynamically reconfigurable multi-processor systems based on the look-ahead dynamic connection reconfiguration is presented. This architectural model enables elimination of...
详细信息
ISBN:
(纸本)0769522106
A new program graph structuring algorithm for dynamically reconfigurable multi-processor systems based on the look-ahead dynamic connection reconfiguration is presented. This architectural model enables elimination of connection reconfiguration time overheads. It consists in preparing link connections in advance in redundant connection switches in parallel with program execution. An application program is partitioned into sections, which are executed using such connections. parallel program scheduling in this environment incorporates graph partitioning problem. The new algorithm is based on list scheduling and a new iterative clustering heuristics for graph partitioning. The experimental results are presented, which compare performance of several graph partitioning heuristics for such environment.
In this paper we present our joint efforts towards the development of a parallel version of the GNU Scientific Library for heterogeneous systems. Two well-known operations arising in discrete mathematics and sparse li...
详细信息
ISBN:
(纸本)0769522106
In this paper we present our joint efforts towards the development of a parallel version of the GNU Scientific Library for heterogeneous systems. Two well-known operations arising in discrete mathematics and sparse linear algebra allow us to describe the design and the implementation of the library, and to report experimental results on heterogeneous clusters of personal computers.
There are two main approaches for designing parallel language. The first approach states that parallel computing demands new programming concepts and radical intellectual changes regarding the way we think about progr...
详细信息
ISBN:
(纸本)0769522106
There are two main approaches for designing parallel language. The first approach states that parallel computing demands new programming concepts and radical intellectual changes regarding the way we think about programming, as compared to sequential computing. Therefore, the design of such a parallel language must present new constructs and new programming methodologies. The second approach states that there is no need to reinvent the wheel, and serial languages can be extended to support parallelism. The motivation behind this approach is to keep the language as friendly as possible for the programmer who is the main bridge toward wider acceptance of the new language. In this paper we present a qualitative evaluation of two contemporary parallel languages: OpenMP-C and Unified parallel C (UPC). Both are explicit parallel programming languages based on the ANSI C standard. OpenMP-C was designed for shared-memory architectures and extends the base-language by using compiler directives that annotate the original source-code. On the other hand, UPC was designed for distribute-shared memory architectures and extends the base-language by new parallel constructs. We deconstruct each parallel language into its basic components, show examples, make a detailed analysis, compare them, and finally draw some conclusions.
The application fields of bytecode virtual machines and VLIW processors overlap in the area of embedded and mobile systems, where the two technologies offer different benefits, namely high code portability, low power ...
详细信息
暂无评论