This paper presents a methodology, which enables an optimized execution of FDTD (Finite Difference Time Domain) computations in a streaming model architecture (Cell BE processors). A dataflow graph that represents th...
详细信息
ISBN:
(纸本)9783642143892
This paper presents a methodology, which enables an optimized execution of FDTD (Finite Difference Time Domain) computations in a streaming model architecture (Cell BE processors). A dataflow graph that represents the FDTD computations in an irregular wave propagation area is transformed into a set of tiles. Each tile represents regular computations for a small part of a given computational area. Tiles are injected into computational nodes and processed in a pipe-like manner. It will be shown that such approach enables solving the FDTD problem with a speedup almost equal to the ideal one. Several computation optimization methods are presented. Efficiency of streaming computations for various simulation parameters is discussed. Experimental results obtained for the streaming model on a physical PS3 machine are presented as well.
Services are highly reusable, flexible and loosely coupled, which makes the evolution and the maintenance of composite services more complex. Evolution of BPEL composite service covers changes of processes, bindings a...
详细信息
ISBN:
(纸本)9781424486298
Services are highly reusable, flexible and loosely coupled, which makes the evolution and the maintenance of composite services more complex. Evolution of BPEL composite service covers changes of processes, bindings and interfaces. In this paper, an approach is proposed to select and generate test cases during the evolution of BPEL composite service. The approach identifies the changes by using control-flow analysis technique and comparing the paths in new composite service version and the old one using extensible BPEL flow graph (or XBFG). Message flow is appended to the control flow so that XBFG can describe the behavior of composite service integrally. The binding and predicate constraint information added in XBFG elements can be used in path selection and test case generation. Theory analysis and case study both show that the approach is effective, and test cases coverage rate is high for the changes of processes, bindings and interfaces.
This paper presents a method for concurrent BIST cost estimation during testable data path allocation. The method integrates testability in the design process and generates a distributed test controller that aims to m...
详细信息
ISBN:
(纸本)9781424453771
This paper presents a method for concurrent BIST cost estimation during testable data path allocation. The method integrates testability in the design process and generates a distributed test controller that aims to minimize area and power. The system has been implemented and favorable results are reported.
The impressive progress of Systems on Chip (SoC) design enables a revival of efficient massively parallel systems based on many Chip Multiprocessor (CM P) modules interconnected by global networks. The paper presents ...
详细信息
ISBN:
(纸本)9783642143892
The impressive progress of Systems on Chip (SoC) design enables a revival of efficient massively parallel systems based on many Chip Multiprocessor (CM P) modules interconnected by global networks. The paper presents methods for the optimized program execution control for such modular CMP systems. At the CMP module level, communication through shared memory is applied, improved by a novel efficient group communication mechanism (reads on, the fly). The inter module global communication is implemented as a message passing between module memories placed in a shared address space. A two-phase structuring algorithm is described for programs represented as macro data-flowgraphs. In the first phase, program tasks inside the CMP modules are scheduled, using an algorithm based on the notion of moldable tasks. In the next phase, the moldable task graph is structured for optimized communication execution in the global interconnection network according to the look-ahead link connection setting paradigm. Simulation experiments evaluate the efficiency and other properties of the proposed architectural solutions.
This paper describes an extension to the Taylor Expansion Diagrams (TED), called Timed TEDs, which makes it possible to represent sequential arithmetic datapaths. Timed TEDs enable register and clock period minimizati...
详细信息
ISBN:
(纸本)9781424478064
This paper describes an extension to the Taylor Expansion Diagrams (TED), called Timed TEDs, which makes it possible to represent sequential arithmetic datapaths. Timed TEDs enable register and clock period minimization while performing factorizations and common sub expression eliminations in the dataflow graph (DFG). Specifically, timed TEDs allow a wider range of retiming options as the computations in the DFG can be modified while performing retiming. In this paper we discuss the formalism of timed TEDs and the restrictions it imposes on the TED variable ordering.
Template-based task partitioning and scheduling can dramatically improve the scheduling efficiency and resource utilization in the coarse-grained reconfigurable system. However, when the number of templates is large, ...
详细信息
ISBN:
(纸本)9781424472369
Template-based task partitioning and scheduling can dramatically improve the scheduling efficiency and resource utilization in the coarse-grained reconfigurable system. However, when the number of templates is large, the effective organization and management of the templates is a hard problem. Here a template library based method is employed to address the above problem. Various templates are organized and managed within the database;an improved canonical labeling algorithm for DFG (dataflow graph) is provided to speed up the template insertion and query by avoiding numerous graph isomorphism operations;an automatic tool is also designed and implemented for the template execution on the RCA (reconfigurable cell array), including the effective methods for context words generation, storage and matching. Experiments are carried out and the results show that the provided method can effectively organize and manage the templates. Moreover, the experiments also indicate that the methods can speed up the template insertion and query operations dramatically.
The paper presents new methods for data communication for modular CMP systems interconnected by a global network. A special group communication called communication on the fly is applied between processor core data ca...
详细信息
Synchronous data flow graphs (SDFGs) are a useful tool for modeling and analyzing embedded dataflow applications, both in a single processor and a multiprocessing context or for application mapping on platforms. Thro...
详细信息
We give an overview of methods for modeling and system level design of mixed HW/SW/Analog systems. For abstract, functional modeling we combine Kahn Process Networks and Timed data flow graphs. In order to model concr...
详细信息
This paper proposes a new design methodology to reduce the logic complexity of reconfigurable multiplier block (ReMB). The minimization problem is modeled as a scheduled time-multiplexed dataflow graph (TDFG). To red...
详细信息
ISBN:
(纸本)9781424438273
This paper proposes a new design methodology to reduce the logic complexity of reconfigurable multiplier block (ReMB). The minimization problem is modeled as a scheduled time-multiplexed dataflow graph (TDFG). To reduce the number of operators to be scheduled in the DFG, the most dominant common subexpressions are greedily identified and eliminated based on the subexpressions' frequencies which are updated dynamically in the optimization process. High level synthesis algorithm is then employed to perform the scheduling of operators to control steps. By binding the compatible operators in the same control steps, more operators can be saved. Two design examples are used to demonstrate the effectiveness of the proposed algorithm. On average, the logic complexity of the proposed ReMB design is about 19% lower than that of the classical ReMB methods, and 7% lower than that of the latest and most competitive ReMB design methodology.
暂无评论