The Aho-Corasick (AC) algorithm is a very flexible and efficient but memory-hungry pattern matching algorithm that can scan the existence of a query string among multiple test strings looking at each character exactly...
详细信息
This paper proposes a new integration technique, called IRIS (Integrated Register allocation and Instruction Scheduling), to combine instruction scheduling and register allocation. Both register allocation and instruc...
详细信息
ISBN:
(纸本)3540364102
This paper proposes a new integration technique, called IRIS (Integrated Register allocation and Instruction Scheduling), to combine instruction scheduling and register allocation. Both register allocation and instruction scheduling are performed simultaneously at each variable reference where the selection between serialization by scheduling and spilling by register allocation is determined. To make a right selection, the costs of serialization and spilling are estimated with a cost model proposed to reduce the complexity of the estimation. Experiments show that IRIS achieves significant improvements when compared to widely-used existing techniques.
Reliability and flexibility are among the key required features of a framework used to model a system. Existing approaches to design resource-constrained, soft-real time systems either provide guarantees for output qu...
详细信息
ISBN:
(纸本)9781479901036
Reliability and flexibility are among the key required features of a framework used to model a system. Existing approaches to design resource-constrained, soft-real time systems either provide guarantees for output quality or account for loss in the system, but not both. We propose two independent solutions where each modeling technique has both the above mentioned characteristics. We present a probabilistic analytical framework and a statistical model checking approach to design system-on-chips for low-cost multimedia systems. We apply the modeling techniques to size the output buffer in a video decoder. The results shows that, for our stochastic design metric, the analytical framework upper bounds (and relatively accurate) compare to the statistical model checking technique. Also, we observed significant reduction in resource usage (such as output buffer size) with tolerable loss in output quality.
PYTHON has become the de facto programming language in machine learning and scientific computing, but high performance implementations are challenging to create especially for embeddedsystems with limited resources. ...
详细信息
ISBN:
(数字)9783030609399
ISBN:
(纸本)9783030609399;9783030609382
PYTHON has become the de facto programming language in machine learning and scientific computing, but high performance implementations are challenging to create especially for embeddedsystems with limited resources. We address the challenge of compiling and optimizing PYTHON source code for a low-level target by introducing RUST as an intermediate source code step. We show that pre-existing PYTHON implementations that depend on optimized libraries, such as NumPy, can be transpiled to RUST semi-automatically, with potential for further automation. We use two representative test cases, Black-Scholes for financial options pricing and robot trajectory optimization. The results show up to 12x speedup and 1.5x less memory use on PC, and the same performance but 4x less memory use on an ARM processor on PYNQ SoC FPGA. We also present a comprehensive list of factors for the process, to show the potential for fully automated transpilation. Our findings are generally applicable and can improve the performance of many PYTHON applications while keeping their easy programmability.
Current integrated circuits exhibit an impressive and increasing power density. In this scenario, thermal modelling plays a key role in the design of next generation cooling and thermal management solutions. However, ...
详细信息
ISBN:
(纸本)9783030275624;9783030275617
Current integrated circuits exhibit an impressive and increasing power density. In this scenario, thermal modelling plays a key role in the design of next generation cooling and thermal management solutions. However, extending existing thermal models, or designing new ones to account for new cooling solutions, requires parameter identification as well as a validation phase to ensure correctness of the results. In this paper, we propose a flexible solution to the validation issue, in the form of a hardware platform based on a Thermal Test Chip (TTC). The proposed platform allows to test a heat dissipation solution under realistic conditions, including fast spatial and temporal power gradients as well as hot spots, while collecting a temperature map of the active silicon layer. The combined power/temperature map is the key input to validate a thermal model, in both the steady state and transient case. This paper presents the current development of the platform, and provides a first validation dataset for the case of a commercial heat sink.
Markov Decision Processes (MDPs) provide a powerful decision making framework, which is increasingly being used in the design of embedded Computing systems (ECSs). This paper presents a detailed accounting of the use ...
详细信息
ISBN:
(纸本)9783030275624;9783030275617
Markov Decision Processes (MDPs) provide a powerful decision making framework, which is increasingly being used in the design of embedded Computing systems (ECSs). This paper presents a detailed accounting of the use of MDPs in this context across research groups, including reference implementations, common datasets, file formats and platforms. Inspired by recent results showing the promising outlook of using embedded GPUs to solve MDPs on ECSs, we detail the many challenges that designers currently face and present GEMBench (the Gpu accelerated embedded Mdp testBench) in order to facilitate experimental research in this area. GEMBench is targeted to a specific embedded GPU platform, the NVIDIA Jetson platform, and is designed for future retargetability to other platforms. GEMBench is a novel open source software package that is intended to run on the target platform. The package contains libraries of MDP solvers, parsers, datasets and reference solutions, which provide a comprehensive infrastructure for understanding trade-offs among existing embedded MDP techniques, and experimenting with novel techniques.
Traditional design techniques for embeddedsystems apply transformations on the source code to optimize hardware-related cost factors. Unfortunately, such transformations cannot adequately deal with the highly dynamic...
详细信息
Traditional design techniques for embeddedsystems apply transformations on the source code to optimize hardware-related cost factors. Unfortunately, such transformations cannot adequately deal with the highly dynamic nature of today's multimedia applications. Therefore, we go one step back in the design process. Starting from a conceptual UML model, we first transform the model before refining it into executable code. This paper presents: various model transformations, an estimation technique for the steering cost parameters, and three case studies that show how our model transformations result in factors improvement in memory footprint and performance with respect to the initial implementation. (c) 2006 Elsevier B.V. All rights reserved.
3D stacking and integration can provide significant system advantages. Following a brief technology review, this abstract explores application drivers, design and CAD for 3D ICs. The main 3D exploitation explored in d...
详细信息
This book constitutes the proceedings of the 21;international conference on embedded computer systems: architectures, modeling, and simulation, SAMOS 2021, which took place in July 2021. Due to COVID-19 pandemic the c...
详细信息
ISBN:
(数字)9783031045806
ISBN:
(纸本)9783031045790
This book constitutes the proceedings of the 21;international conference on embedded computer systems: architectures, modeling, and simulation, SAMOS 2021, which took place in July 2021. Due to COVID-19 pandemic the conference was held virtually.
暂无评论