In this paper we present a new approach for generating high-speed optimized event-driven instruction set level simulators for adaptive massively parallel processor architectures. the simulator generator is part of a m...
详细信息
ISBN:
(纸本)9781450378345
In this paper we present a new approach for generating high-speed optimized event-driven instruction set level simulators for adaptive massively parallel processor architectures. the simulator generator is part of a methodology for the systematic mapping, evaluation, and exploration of massively parallel processor architecturesthat are designed for special purpose applications in the world of embeddedcomputers. the generation of high-speed cycle-accurate simulators is of utmost importance here, because they are directly used both for parallel processor architecture debugging and evaluation purposes, as well as during time-consuming architecture/compiler co-exploration. We developed a modeling environment which automatically generates a C++ simulation model either from a graphical input or directly from an XML-based architecture description. Here, we focus on the underlying event-driven simulation model and present our modeling environment, in particular the features of the graphical parallel processor architecture editor and the automatic instruction set level simulator generator. Finally, in a case-study, we demonstrate the pertinence of our approach by simulating different processor arrays. the superior performance of the generated simulators compared to existing simulators and simulator generation approaches is shown.
the latest image compression standard, JPEG 2000 is well tuned for diverse applications, thus raising various throughput demands on its building blocks. therefore, a JPEG 2000 encoder withthe feature of scalability i...
详细信息
ISBN:
(纸本)354026969X
the latest image compression standard, JPEG 2000 is well tuned for diverse applications, thus raising various throughput demands on its building blocks. therefore, a JPEG 2000 encoder withthe feature of scalability is favorable for its ability of meeting different throughput requirements. On the other hand, the large amounts of data streams underline the importance of bandwidth optimization in designing the encoder. the initial specification, especially in terms of loop organization and array indices, describes the data manipulations and, subsequently, influences the outcome of the architecture implementation. therefore, there is a clear need for the exploiting support, and we believe the emphasis should lie on the loop level steering. In this paper, we apply loop transformation techniques to a scalable embedded JPEG 2000 encoder design during the architectural exploration stage, considering not only the balance of throughput among different blocks, but also the reduction of data transfer. the architecture is prototyped onto Xilinx FPGA. (c) 2006 Elsevier B.V. All rights reserved.
Contemporary SoC designs ask for system-level debugging tools suitable to heterogeneous platforms. Such tools will have to rely on some low-level model-driven debugging engine that must be retargetable, since embedded...
详细信息
Although SoC design space exploration requires retargetable tools and real-time constraint awareness, conventional compiler infrastructure barely provides both. this paper proposes a novel, automatically retargetable,...
详细信息
In this paper, we propose a library for the system level modeling and simulation of dynamically reconfigurable architectures (DRAs). Using the proposed library, the designer can model the system specifications includi...
详细信息
ISBN:
(纸本)9780769527628;0769527620
In this paper, we propose a library for the system level modeling and simulation of dynamically reconfigurable architectures (DRAs). Using the proposed library, the designer can model the system specifications including modules for the dynamic generation and elimination that are needed in the design of dynamically reconfigurable systems. In addition, we evaluate the proposed library by the modeling and simulation of sample circuits, such as multi-context DRA and partially DRA. Using the proposed library, we can model the system specifications as much the same amount as a detailed description, such as one using multiplexers and de-multiplexers, which is a modeling formula for describing multi-context DRA. Under some conditions, higher-speed simulation is possible using the proposed library
the Instruction-Set extension problem has been one of the major topics in the last years and it is the addition of a set of new complex instructions to a given Instruction-Set. this problem in its general formulation ...
详细信息
SystemCdouble-struck F signdouble-struck L sign is the formalisation of a reasonable subset of SystemC based on classical process algebras. During the last few years, SystemC double-struck F signdouble-struck L sign h...
详细信息
this paper proposes a theory of transistor short faults and their detection in logic test environment. the transistor short models were defined, and the characteristics of equivalent faults and redundant faults were r...
详细信息
ISBN:
(纸本)9780769527628;0769527620
this paper proposes a theory of transistor short faults and their detection in logic test environment. the transistor short models were defined, and the characteristics of equivalent faults and redundant faults were revealed. Also presented were a stuck-at fault simulation method and a test generation method that uses only the gate-level description of the circuits while dealing with transistor short faults. Experimental results for ISCAS benchmark circuits were presented to demonstrate the effectiveness of the methodology proposed in this paper
the H.264 standard achieves higher compression efficiency than previous video coding standards withthe rate-distortion optimized (RDO) method for mode decision. the outstanding coding performance of H.264, however, c...
详细信息
the H.264 standard achieves higher compression efficiency than previous video coding standards withthe rate-distortion optimized (RDO) method for mode decision. the outstanding coding performance of H.264, however, comes at the cost of significantly increased complexity. In this paper, we present algorithm-level optimization methods: input parameter selection, fast inter-mode decision, and efficient combination of motion estimation and mode decision. simulation results show that our optimized H.264 encoder achieves realtime encoding for video graphics array (VGA) and extended graphics array (XGA) on a commercial personal computer without introducing serious quality degradations.
暂无评论